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CANCER-LINKED GENES AS TARGETS FOR 
CHEMOTHERAPY 

5 

FIELD OF THE INVENTION 

10 

The present invention relates to methods of identifying cancer-target 
genes and screening such cancer-target genes and expression products for 
involvement in the cancer initiation and facilitation process and the use of such 
genes for screening potential anti-cancer agents, including the design of small 
1 5 organic compounds and other molecules, and in the diagnosis of cancer. 



BACKGROUND OF THE INVENTION 

20 

Cancer-linked genes are valuable in that they indicate genetic differences 
between cancer cells and normal cells, such as where a gene is expressed in a 
cancer cell but not in a non-cancer cell, or where said gene is over-expressed or 
expressed at a higher level in a cancer as opposed to normal or non-cancer cell. 

25 In addition, the expression of such a gene in a normal cell but not in a cancer 
cell, especially of the same type of tissue, can indicate important functions in the 
cancerous process. For example, screening assays for novel drugs are based on 
the response of model cell based systems in vitro to treatment with specific 
compounds. Various measures of cellular response have been utilized, including 

30 the release of cytokines, alterations in cell surface markers, activation of specific 
enzymes, as well as alterations in ion flux and/or pH. Some such screens rely on 
specific genes, such as oncogenes (or gene mutations). In accordance with the 
present invention, cancer-target genes, and encoded polypeptides, have been 
identified. Such genes are useful in the diagnosing of cancer, the screening of 



anticancer agents and the treatment of cancer using such agents, especially in 
that these genes encode polypeptides that can act as markers, such as cell 
surface markers, thereby providing ready targets for anti-tumor agents such as 
antibodies, preferably antibodies complexed to cytotoxic agents, including 
5 apoptotic agents. 



BRIEF SUMMARY OF THE INVENTION 

10 

In accordance with the present invention, there is provided herein a set of 
genes related to, or linked to, cancer, or otherwise involved in the cancer 
initiating and facilitating process and referred to as cancer-target genes, as well 
as polypeptides encoded by such genes. 

15 

In a particular embodiment, such genes are those corresponding to 
KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, CDC6, CDK7, CDKN3, 
CRK7, DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, KCNK5, PRO2000, RFC2 and 
RIPK2 and which encode polypeptides. 

20 

More particularly, such genes whose expression is changed in cancerous, 
as compared to non-cancerous cells, from a specific tissue, for example, lung, 
where the gene would include a polynucleotide corresponding to one of the 
genes designated KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, 
25 CDC6, CDK7, CDKN3, CRK7. DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, 
KCNK5, PRO2000, RFC2 and RIPK2 of genes substantially identical to said 
genes and/or encode the same or similar polypeptide or a polypeptide differing 
therefrom by conservative amino acid substitutions. 

30 It is another object of the present invention to provide methods of using 

such characteristic genes as a basis for assaying the potential ability of selected 
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chemical agents to modulate upward or downward the expression of said cancer 
characteristic, or related, or target genes. 

It is a further object of the present invention to provide methods of 
5 detecting the expression, or non-expression, or amount of expression, of said 
characteristic gene, or portions thereof, as a means of determining the 
cancerous, or non-cancerous, status (or potential cancerous status) of selected 
cells as grown in culture or as maintained in situ. 

10 It is a still further object of the present invention to provide methods for 

treating cancerous conditions utilizing selected chemical agents as determined 
from their ability to modulate (i.e., increase or decrease) the characteristic gene, 
or its protein product. 

15 The present invention also relates to a method for treating cancer 

comprising contacting a cancerous cell with an agent having activity against an 
expression product encoded by one or more of the genes, which process may be 
conducted either ex vivo or in vivo and which product is disclosed herein. Such 
agents may comprise an antibody or other molecule or portion that is specific for 

20 said expression product. In a preferred embodiment, the polypeptide product of 
such genes is a polypeptide encoded by one of KIAA1274, NEK6, PAK2, PAK4, 
STK38L, ACP1 , ARHC, CDC6, CDK7, CDKN3, CRK7, DUSP16, FIGNL1, GUK1, 
ITPR2, KCNK1, KCNK5, PRO2000, RFC2 and RIPK2. 

25 

DEFINITIONS 

As used herein and except as noted otherwise, all terms are defined as 
30 given below. 
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The term "draggable" or "druggable domain" refers to a gene that encodes 
a protein domain known to be modulated by chemical compounds. 

As used herein, the term "isolated" means that the material is removed from 
5 its original environment (e.g., the natural environment if it is naturally occurring). It 
could also be produced recombinants and subsequently purified. For example, a 
naturally-occurring polynucleotide or polypeptide present in a living animal is not 
isolated, but the same polynucleotide or polypeptide, separated from some or all of 
the coexisting materials in the natural system, is isolated. Such polynucleotides, for 

1 0 example, those prepared recombinant^, could be part of a vector and/or such 
polynucleotides or polypeptides could be part of a composition, and still be isolated 
in that such vector or composition is not part of its natural environment. In one 
embodiment of the present invention, such isolated, or purified, polypeptide is 
useful in generating antibodies for practicing the invention, or where said antibody 

1 5 is attached to a cytotoxic or cytolytic agent, such as an apoptotic agent. 

As known in the art "similarity" between two polypeptides is determined by 
comparing the amino acid sequence and its conserved amino acid substitutes of 
one polypeptide to the sequence of a second polypeptide. As used herein, the 

20 terms "portion," "segment," and 'fragment," when used in relation to polypeptides, 
refer to a continuous sequence of residues, such as amino acid residues, which 
sequence forms a subset of a larger sequence. For example, if a polypeptide were 
subjected to treatment with any of the common endopeptidases, such as trypsin or 
chymotrypsin, the oligopeptides resulting from such treatment would represent 

25 portions, segments or fragments of the starting polypeptide. When used in relation 
to a polynucleotides, such terms refer to the products produced by treatment of said 
polynucleotides with any of the common endonucleases. 

As used herein, the term "corresponding genes" refers to genes that 
30 encode an RNA that is at least 90% identical, preferably at least 95% identical, 
most preferably at least 98% identical, and especially identical, to an RNA 
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encoded by one of the genes of KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, 
ARHC, CDC6, CDK7, CDKN3, CRK7, DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, 
KCNK5, PRO2000, RFC2 and RIPK2. 

5 As used herein, the term "correspond" means that the gene has the same 

nucleotide sequence as a gene disclosed herein or that it encodes substantially 
the same RNA as would be encoded by the disclosed gene, the term 
"substantially" meaning at least 90% identical as defined elsewhere herein and 
includes splice variants thereof. 

10 

The term "percent identity" or "percent identical," when referring to a 
sequence, means that a sequence is compared to a claimed or described 
sequence after alignment of the sequence to be compared (the "Compared 
Sequence") with the described or claimed sequence (the "Reference Sequence"). 
1 5 The Percent Identity is then determined according to the following formula: 

Percent Identity = 100[1-(C/R)] 

wherein C is the number of differences between the Reference Sequence and the 
20 Compared Sequence over the length of alignment between the Reference 
Sequence and the Compared Sequence wherein (i) each base or amino acid in the 
Reference Sequence that does not have a corresponding aligned base or amino 
acid in the Compared Sequence and (ii) each gap in the Reference Sequence and 
(iii) each aligned base or amino acid in the Reference Sequence that is different 
25 from an aligned base or amino acid in the Compared Sequence, constitutes a 
difference; and R is the number of bases or amino acids in the Reference 
Sequence over the length of the alignment with the Compared Sequence with any 
gap created in the Reference Sequence also being counted as a base or amino 
acid. 

30 



If an alignment exists between the Compared Sequence and the Reference 
Sequence for which the percent identity as calculated above is about equal to or 
greater than a specified minimum Percent Identity then the Compared Sequence 
has the specified minimum percent identity to the Reference Sequence even 
5 though alignments may exist in which the hereinabove calculated Percent Identity is 
less than the specified Percent Identity. 

The term "expression product" means that polypeptide or protein that is the 
natural translation product of the gene and any nucleic acid sequence coding 
10 equivalents resulting from genetic code degeneracy and thus coding for the 
same amino acid(s). 

The term "active fragment," when referring to a coding sequence, means a 
portion comprising less than the complete coding region whose expression 
15 product retains essentially the same biological function or activity as the 
expression product of the complete coding region. 

The term "primer 0 means a short nucleic acid sequence that is paired with 
one strand of DNA and provides a free 3-OH end at which a DNA polymerase 
20 starts synthesis of a deoxyribonucleotide chain. 

The term "promoter" means a region of DNA involved in binding of RNA 
polymerase to initiate transcription. The term "enhancer" refers to a region of 
DNA that, when present and active, has the effect of increasing expression of a 
25 different DNA sequence that is being expressed, thereby increasing the amount 
of expression product formed from said different DNA sequence. 

The term "protein domain" refers to a discrete portion of a single 
polypeptide chain with its own function. The combination of domains in a 
30 single protein determines its overall function. Protein domains can act as 
independent units, to the extent that they can be excised from the chain, 
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and still be shown to fold correctly, and often still exhibit biological activity. 
Another property of domains is that they are regions which are usually 
conserved during recombination events. This means that along a protein 
sequence, the domains will tend to be fairly well conserved, and conversely, 
5 the interdomain regions will be more divergent 

As used herein, the term "conservative amino acid substitution" are 
defined herein as exchanges within one of the following five groups: 

I. Small aliphatic, nonpolar residues: 
10 Ala, Gly; 

II. Negatively charged residues: 

Asp, Glu 

III. Positively charged residues: 

His, Arg, Lys; 
1 5 IV. Large, aliphatic, nonpolar residues: 

Met, Leu, He, Val, Cys 

V. Aromatic residues: 

Phe, Tyr, Trp, Pro, 

VI. Polar residues 
20 Ser, Thr 

VII. Amides 

Asn, Gin 

25 

BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 contains a listing of the sequences disclosed according to the 
present invention. The genes described herein are listed according to their 
30 Gencarta names or accession Numbers (which are reproduced in Table 6) and 
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each is then followed by a listing of relevant transcripts and polypeptides 
encoded thereby. The corresponding SEQ ID NOs: are provided in Table 6. 

5 

DETAILED SUMMARY OF THE INVENTION 

The present invention relates to processes for identifying and/or utilizing 
cancer-target genes, and expression products of such genes, as targets for 
1 0 chemotherapeutic agents, especially anti-cancer agents. 

Genes whose expression, or non-expression, or change in expression, are 
indicative of the cancerous or non-cancerous status of a given cell and whose 
expression is changed in cancerous, as compared to non-cancerous cells, from a 

15 specific tissue, are genes that are disclosed herein or that are identified by 
methods disclosed herein. These include genes having structural and/or 
functional similarity to the genes disclosed herein and include genes that are 
substantially identical to said genes. In terms of nucleotide sequence, such 
genes are at least about 90% identical, preferably 95% identical, most preferably 

20 at least about 98% identical and especially where such gene is a gene of 
KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, CDC6, CDK7, CDKN3, 
CRK7, DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, KCNK5, PRO2000, RFC2 and 
RIPK2. 

25 The genes disclosed herein according to the invention were identified 

within an amplified chromosomal region in a cancer cell line(s) and exhibit RNA 
over-expression in the cell line(s) and clinical tumor tissues by Affymetrix 
microarray analysis. Each disclosed gene contains sequences that encode a 
protein domain previously described as being modulated by chemical 

30 compounds. 
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Each such gene was identified in a cancer cell line(s) for which high- 
resolution comparative genomic hybridization (CGH) data and Affymetrix U133 
chip expression data were generated. Each meets the following criteria: 

5 

1) At least 5 fold over-expressed in a cancer cell line(s) and mapped to a 
chromosomal region with a CGH ratio of 1 .25 or above. 

2) RNA expression level of at least 1.5 fold or higher in tumor tissue samples 
10 compared to corresponding normal tissue samples in a genetic database 

(with Gene Logic GX2000 database being a non-limiting example). 

3) The gene encodes a protein domain known to be modulated by chemical 
compounds (i.e., a "druggable" domain). The genes identified herein 

1 5 represent a subset of all genes in these classes. 

In accordance with the foregoing, the present invention relates to 
nucleotide sequences and derived polypeptides having the following 
characteristics: 

20 

Arbitrary Gene Name: KIAA1274 

Description: KIAA protein (similar to mouse paladin) 

UniGene: Hs.300646 

Accession: AB033100 
25 Gencarta No. AA553584 (Gene 2) 

Cytogenetic location: 10q22.1 

Affymetrix fragment: 231887_s_at 

Affymetrix ID: 262356 

Cell lines involved: SW620 
30 Druggable domain: phosphatase 



Arbitrary Gene Name: NEK6 

Description: NIMA (never in mitosis gene a)-related kinase 6 
35 UniGene: Hs.9625 
Accession: BE6 16825 
Gencarta No. T11445 (Gene 12) 
Cytogenetic location: 9q33.3 
Affymetrix fragment: 223158_s_at 
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Affymetrix ID: 253651 
Cell lines involved: Colo205 
Druggable domain: kinase 

5 NIMA-related kinases (NEKs) are mammalian serine/threonine protein kinases 
structurally related to Aspergillus NIMA (never in mitosis, gene A), which play 
essential roles in mitotic signaling. 



1 0 Arbitrary Gene Name: PAK2 

Description: p21 -activated kinase 2 

UniGene: Hs.56974 

Accession: BF796470 

Gencarta No. Z26993 (Gene 17) 
1 5 Cytogenetic location: 3q29 

Affymetrix fragment: 208875_s_at 

Affymetrix ID: 239505 

Cell lines involved: HCC1954, HCC202, HCC70, MDA_MB453, T47D 
Druggable domain: kinase 

20 

The p21-activated kinases (PAK) are critical effectors that link Rho GTPases to 
cytoskeleton reorganization and nuclear signaling. The PAK proteins are a family 
of serine/threonine kinases that serve as targets for the small GTP binding 
proteins, CDC42 and RAC1, and have been implicated in a wide range of 
25 biological activities. 



Arbitrary Gene Name: PAK4 

Description: p21 -activated kinase 4 
30 UniGene: Hs.20447 

Accession: AF005046 

Gencarta No. R09837 (Gene 8) 

Cytogenetic location: 19q13.2 

Affymetrix fragment: 33814_at 
35 Affymetrix ID: 107335 

Cell lines involved: HCC202, MDA_MB468 

Druggable domain: kinase 



40 The p21 activated kinases (PAK) are critical effectors that link Rho 

GTPases to cytoskeleton reorganization and nuclear signaling. The PAK proteins 
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are a family of serine/threonine kinases that serve as targets for the small GTP 
binding proteins, CDC42 and RAC1, and have been implicated in a wide range of 
biological activities. 

5 

Arbitrary Gene Name: STK38L 

Description: serine/threonine kinase 38 like 

UniGene: Hs.184523 

Accession: AW779556 
1 0 Gencarta No. R14324 (Gene 9) 

Cytogenetic location: 12p1 1 .23 

Affymetrix fragment: 212565_at 

Affymetrix ID: 243092 

Cell lines involved: HCC827, BEN 
1 5 Druggable domain: kinase 



Arbitrary Gene Name: ACP1 

Description: acid phosphatase 1, soluble 
20 UniGene: Hs.75393 

Accession: BE872974 

Gencarta No. HUMAAPA (Gene 6) 

Cytogenetic location: 2p25.3 

Affymetrix fragment: 201629_s_at 
25 Affymetrix ID: 232293 

Cell lines involved: BEN, NCI-H460, NCI-H522, MCF7, MDA-MB436, MDA- 

MB468 

Druggable domain: phosphatase 

30 

The product of this gene belongs to the phosphotyrosine protein 
phosphatase family of proteins. It functions as an acid phosphatase and a protein 
tyrosine phosphatase by hydrolyzing protein tyrosine phosphate to protein 
tyrosine and orthophosphate. This gene is genetically polymorphic, and three 
35 common alleles segregating at the corresponding locus give rise to six 
phenotypes. Each allele appears to encode at least two electrophoretically 
different isozymes, Bf and Bs, which are produced in allele-specific ratios. Three 
transcript variants encoding distinct isoforms have been identified for this gene 
(Bryson et al., Genomics 1995 Nov 20;30(2):1 33-40). 

40 

11 



Arbitrary Gene Name: ARHC 

Description: ras homolog gene family, member C (hypothetical protein 
5 MGC19531) 

UniGene: Hs.446391 
Accession: AW1 1 7553 
Gencarta No. AA383349 (Gene 1) 
Cytogenetic location: 1p13.2 
1 0 Affymetrix fragment: 229484_at 
Affymetrix ID: 259953 
Cell lines involved: HCC202, MDA_MB468 
Druggable domain: phosphatase 

15 

ARHC (UniGene Hs. 179735) sits next to the gene for hypothetical protein 
MGC 19531 . The two sequences are in close proximity and they are annotated as 
the same gene in GenCarta, but they are listed as two distinct genes in UCSC 
Goldenpath. The Affymetrix fragment maps within the sequence for hypothetical 
20 protein MGC 19531, but we are inferring that this fragment is detecting 
expression for ARHC. 

ARHC encodes a ras-related GTP binding protein of the rho subfamily, 
member C (RhoC) that regulates remodeling of the actin cytoskeleton during cell 

25 morphogenesis and motility. Up regulation of RhoC through increased 
expression of ARHC has been reported in breast, ovarian and pancreatic cancer 
as well as melanoma and has been associated with progression to a metastatic 
phenotype in each cancer type (van Golen et al., Cancer Res. 2000 Oct 
15;60(20):5832-8, Horiuchi A et al. Lab Invest. 2003 83(6):861-70, Suwa et al. Br 

30 J Cancer. 1998 77(1):147-52, Clark et al., Nature, 2000 406(6795):532-5). 



Arbitrary Gene Name: CDC6 
35 Description: CDC6 cell division cycle 6 homolog (S. cerivisiae) 
UniGene: Hs.69563 
Accession: U77949 
Gencarta No. T83032 (Gene 16) 
Cytogenetic location: 17q21.3 
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Affymetrix fragment: 203967_at 
Affymetrix ID: 234629 
Cell lines involved: NCI-H522, NCI-H23 
Druggable domain: AAA ATPase 

5 

Yan et al. [Proc Nat Acad Sci 94:142-147 (1998)] showed that CDC6 is 
expressed selectively in proliferating but not quiescent mammalian cells, both in 
culture and within tissues in intact animals. During the transition from a growth- 

1 0 arrested to a proliferative state, transcription of mammalian CDC6 is regulated by 
E2F proteins as revealed by a functional analysis of the promoter and by the 
ability of exogenously expressed E2F proteins to stimulate endogenous CDC6. 
Immunodepletion of CDC6 protein by microinjection of anti-CDC6 antibody 
blocked initiation of DNA replication in a human tumor cell line. The authors 

1 5 concluded that expression of human CDC6 is regulated in response to mitogenic 
signals through transcriptional control mechanisms involving E2F proteins, and 
that CDC6 protein is required for initiation of DNA replication in mammalian cells. 



20 

Arbitrary Gene Name: CDK7 

Description: Cyclin-dependent kinase 7 (M015 homolog, Xenopus laevis) 

UniGene: Hs. 184298 

Accession: X77743 
25 Gencarta No. F02366 (Gene 4) 

Cytogenetic location: 5q13.2 

Affymetrix fragment: 21 1297_s_at 

Affymetrix ID: 241855 

Cell lines involved: SW620 
30 Druggable protein domain: kinase 



The protein encoded by CDK7 is a member of the cyclin-dependent 
protein kinase (CDK) family, which are known to be important regulators of cell 
35 cycle progression. This protein forms a trimeric complex with cyclin H and MAPI, 
which functions as a Cdk-activating kinase (CAK) (Fisher and Morgan, Cell 
78:713-724,1994). It is an essential component of the transcription factor IIH 
(TFHH) that is involved in transcription initiation and DNA repair (Shiekhattar et 



13 



al., Nature 374: 283-287, 1995). This protein is thought to serve as a direct link 
between the regulation of transcription and the cell cycle. 



5 

Arbitrary Gene Name: CDKN3 

Description: cyclin-dependent kinase inhibitor 3 (CDK2-associated dual 
specificity phosphatase) 
UniGene: Hs.84113 
1 0 Accession: AF21 3033 

Gencarta No. HUMPTPB (Gene 7) 
Cytogenetic location: 14q22.2 
Affymetrix fragment: 20971 4_s at 
Affymetrix ID: 240337 

15 Cell lines involved: NCI-H460, HCC827, NCI-H23, HCC202, MCF7, MDA- 
MB453, T47D 

Druggable domain: phosphatase 



20 The protein encoded by this gene is a human dual specificity protein 

phosphatase that was identified as a cyclin-dependent kinase inhibitor, and has 
been shown to interact with and dephosphorylate CDK2 kinase and thus prevent 
the activation of CDK2 kinase. The gene has been reported to be deleted, 
mutated, or overexpressed in several kinds of cancers. 

25 

Lee et al. [Mot Cell Biol. 2000 Mar;20(5): 1723-32} identified the CDKN3 as 
an overexpressed gene in breast and prostate cancer by using a phosphatase 
domain-specific differential-display PCR strategy. They report in normal cells, 
CDKN3 protein is primarily found in the perinuclear region, but in tumor cells, a 

30 significant portion of the protein is found in the cytoplasm. Blocking CDKN3 
expression by antisense in a tetracycline-regulatable system resulted in a 
reduced population of S-phase cells and reduced Cdk2 kinase activity. 
Furthermore, lowering CDKN3 expression led to inhibition of the transformed 
phenotype, with reduced anchorage-independent growth and tumorigenic 

35 potential in athymic nude mice. They suggest that therapeutic intervention might 
be aimed at repression of CDKN3 gene overexpression in human breast and 
prostate cancer. 
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Yeh et al. [Cancer Res. 2000 Sep 1;60(17):4697-4700] analyzed CDKN3 
mRNA in hepatocellular carcinoma by reverse transcription-PCR (RT-PCR), 
followed by cloning and sequencing. They found aberrant CDKN3 transcripts in 
5 hepatocellular tumors and showed mutant proteins were defective in interacting 
with CdkZ 



Arbitrary Gene Name: CRK7 
1 0 Description: CDC2-related protein Kinase 7 

UniGene: Hs.278346 

Accession: AI651265 

Gencarta No. T60764 (Gene 14) 

Cytogenetic location: 17q12 
1 5 Affymetrix fragment: 225697_at 

Affymetrix ID: 256169 

Cell lines involved: HCC1954, HCC202, SKBR3 
Druggable domain: kinase 

20 

Ko et al. [Journal of Cell Science, 114,2591-2603 (2001)] isolated and 
characterized CrkRS, CDC2-related kinase 7, as a novel human protein with an 
arginine/serine-rich (RS) domain that is most closely related to the cyclin- 
dependent kinase family. They report CrkRS is a 1490 amino acid protein where 
25 the protein kinase domain is 89% identical to CHED protein kinase. CrkRS has 
extensive proline-rich regions that match the consensus for SH3 and WW 
domain binding sites and RS domain that is predominantly found in splicing 
factors. The authors describe CrkRS as a novel, conserved link between the 
transcription and splicing machinery of a cell. 

30 

Arbitrary Gene Name: DUSP16 
Description: dual specificity phosphatase 16 
UniGene: Hs.20281 
35 Accession: AB051487 

Gencarta No. T23935 (Gene 13) 
Cytogenetic location: 12p13.2 
Affymetrix fragment: 224832_at 
Affymetrix ID: 255305 
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Cell lines involved: HCC827 
Druggable domain: phosphatase 



5 Mitogen-activated protein kinase (MAPK) phosphatases (MKPs) 

negatively regulate MAPK activity. DUSP16 is a dual specificity phosphatase that 
functions as a MAPK phosphatase, also known as MKP7. Masuda et al. [J Biol 
Chem. 2001 276(42):39002-11J showed that MAPK7 behaves as a nuclear 
shuttle for c-Jun terminal kinase (JNK) group of MAPKs as well as a 
1 0 phosphatase. 



Arbitrary Gene Name: FIGNL1 
Description: fidgetin-like 1 

15 UniGene: Hs. 13751 6 
Accession: AA805691 
Gencarta No. H61320 (Gene 5) 
Cytogenetic location: 7p12.2 
Affymetrix fragment: 222843_at 

20 Affymetrix ID: 253337 

Cell lines involved: SW620, BEN, HCC827 
Druggable domain: AAA ATPase 



Arbitrary Gene Name: GUK1 

Description: guanylate kinase 1 

UniGene: Hs.3764 
30 Accession: BC006249 

Gencarta No. T08090 (Gene 1 1) 

Cytogenetic location: 1q42.13 

Affymetrix fragment: 200075_s_at 

Affymetrix ID: 231232 
35 Cell lines involved: HCC202, MDA_MB436, MDA_MB453, MDA_MB468 

Druggable domain: kinase 



Guanylate kinase catalyzes the phosphorylation of either GMP to GDP or 
40 dGMP to dGDP and is an essential enzyme in nucleotide metabolism pathways. 
There are several isoforms, GUK2 and GUK3, determined by different loci. Brady 
et al. [J Biol Chem. 1996 12;271 (28): 16734-40] stated that the guanylate kinases 
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are targets for cancer chemotherapy and are inhibited by the drug 6-thioguanine. 
They report a model of the tertiary structure designed to be used in the 
development of chemotherapy drugs. 

5 

Arbitrary Gene Name: ITPR2 

Description: inositol 1,4,5-triphosphate receptor, type 2 

UniGene 

Accession: 026350 
1 0 Gencarta No. Z38709 (Gene 1 8) ' 

Cytogenetic location: 12p11.23 

Affymetrix fragment: 21 1360_s_at 

Affymetrix ID: 241911 

Cell lines involved: BEN, HCC827 
1 5 Druggable domain: Ion transport 



Arbitrary Gene Name: KCNK1 
20 Description: potassium channel, subfamily K, member 1 

UniGene: Hs.79351 

Accession: U90065 

Gencarta No. Z39663 (Gene 19) 

Cytogenetic location: 1q42.2 
25 Affymetrix fragment: 204678_s_at 

Affymetrix ID: 235340 

Cell lines involved: HCC202, HCC70, MDA_MB436, MDA_MB453, MDA_MB468 
Druggable domain: potassium channel 



This gene encodes one of the members of the superfamily of potassium 
channel proteins containing two pore-forming P domains and 4 transmembrane 
segments. Potassium channels are functionally important to a large number of 
cellular processes including maintenance of the action potential, muscle 
35 contraction, hormone secretion, osmotic regulation and ion flow. 



Arbitrary Gene Name: KCNK5 
40 Description: potassium channel, subfamily K, member 5 
UniGene: Hs. 127007 
Accession: AI678413 
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Gencarta No. R25184 (Gene 10) 
Cytogenetic location: 6p21.2 
Affymetrix fragment: 69854_at 
Affymetrix ID: 153971 
5 Cell lines involved: Colo201 , Colo205 
Druggable domain: K+ channel 



This gene encodes one of the members of the superfamily of potassium 
1 0 channel proteins containing two pore-forming P domains. 



Arbitrary Gene Name: PRO2000 
1 5 Description: Hypothetical protein MGC5254 

UniGene: Hs.222088 

Accession: AI925583 

Gencarta No. Z44462 (Gene 20) 

Cytogenetic location: 8q24.13 
20 Affymetrix fragment: 222740_at 

Affymetrix ID: 253234 

Cell lines involved: BT549, HCC1954, HCC202, HCC70, Hs578t, MCF7, 
MDA_MB231, MDA_MB436, MDA_MB453, SKBR3, T47D, Colo201, HCT116, 
SW620, HT29, HCC827, NCI-H23, NCI-H460 
25 Druggable domain: AAA ATPase 



A large family of ATPases has been described, whose key feature is that 
they share a conserved region of about 220 amino acids that contains an ATP- 
30 binding site. The protein encoded by PRO2000 contains two AAA (ATPases 
Associated with diverse cellular Activities) domains as well as a bromodomain. 
AAA family proteins often perform chaperone-like functions that assist in the 
assembly, operation, or disassembly of protein complexes. The exact function of 
the PRO2000 protein is unknown. 

35 

Fellenberg et al. [Int J Cancer 105(5);636-643 (2003)] report PRO2000 is 
up-regulated >2 fold in osteosarcoma cell line (Saos-2) following treatment with 
cisplatin, methotrexate and doxorubicin. 

40 
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Arbitrary Gene Name: RFC2 
Description: replication factor C (activator 1) 2, 40kDa 
UniGene: Hs.1 39226 
5 Accession: M87338 

Gencarta No. T62520 (Gene 15) 
Cytogenetic location: 7q11.23 
Affymetrix fragment: 1053_at 
Affymetrix ID: 113880 
1 0 Cell lines involved: HCC827, NCI_H23, NCI_H522 
Druggable domain: AAA ATPase 



The elongation of primed DNA templates by DNA polymerase delta and 
1 5 epsilon requires the action of the accessory proteins proliferating cell nuclear 
antigen (PCNA) and replication factor C (RFC). RFC, also called activator 1, is a 
protein complex consisting of five distinct subunits of 145, 40, 38, 37, and 36.5 
kD. This gene encodes the 40 kD subunit, which has been shown to be 
responsible for binding ATP. Alternatively spliced transcript variants encoding 
20 distinct isoforms have been described. 



Arbitrary Gene Name: RIPK2 
25 Description: receptor-interacting serine-threonine kinase 2 

UniGene: Hs. 103755 

Accession: AF064824 

Gencarta No. D61791 (Gene 3) 

Cytogenetic location: 8q21.3 
30 Affymetrix fragment: 209545_s_at 

Affymetrix ID: 240173 

Cell lines involved: HCT116 

Druggable domain: kinase 

35 

The methods of the invention utilize these genes, designated as 
KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, CDC6, CDK7, CDKN3, 
CRK7, DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, KCNK5, PRO2000, RFC2 and 
RIPK2. 
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The genes disclosed herein may be used in any of the methods of the 
invention and modulation, as used herein, may include modulation of the 
gene, such as an increase or decrease in transcription or translation, and may 
include differences in the amount and/or the rate of production of RNA 
and/or polypeptide. Such modulation may affect any of the transcripts 
disclosed for the genes of the invention or any of the encoded polypeptides, 
as identified in Table 6. Antibodies useful in the invention would include 
those specific for any of the polypeptides encoded by these genes, especially 
any polypeptide whose sequences are provided in Figure 1, as identified in 
Table 6. A brief summary of these genes identified by their respective 
GenBank Accession Nos. is provided in Table 1 . 



Table 1 . Brief summary of cancer target genes. 



Accession unigene affy 

AI651265 Hs.278346 256169 

U77949 Hs.69563 234629 

X77743 Hs.1 84298 241855 



AA805691 

AI925583 

D26350 

BE616825 

BF796470 

AF005046 

U90065 

AF064824 

M87338 

AW779556 

AI678413 

AF2 13033 



Hs. 13751 6 
Hs.222088 

Hs.406293 
Hs.9625 
Hs.56974 
Hs.20447 
Hs.79351 
Hs. 103755 
Hs. 139226 
Hs. 184523 
Hs. 127007 
Hs.84113 



BE872974 Hs.75393 
AW1 17553 Hs.446391 

BC006249 Hs.3764 
AB033100 Hs.300646 
AB051487 Hs.20281 



253337 
253234 
241911 
253470 
253651 
239505 
107335 
235340 
240173 
113180 
243092 
153971 
240337 

232293 
259953 

231232 
262356 
255305 



Description 

CDC2-related protein kinase 7 
CDC6 cell division cycle 6 homolog (S. 
cerevisiae) 

cyclin-dependent kinase 7 (M015 homolog, 
Xenopus laevis, cdk-activating kinase) 
fidgetin-like 1 

hypothetical protein MGC5254 
inositol 1,4,5-triphosphate receptor, type 2 
neurotrophic tyrosine kinase, receptor, type 1 
NIMA (never in mitosis gene a)-related kinase 6 
p21 (CDKNIA)-activated kinase 2 
p21(CDKN1A)-activated kinase 4 
potassium channel, subfamily K, member 1 
receptor-interacting serine-threonine kinase 2 
replication factor C (activator 1) 2, 40kDa 
serine/threonine kinase 38 like 
potassium channel, subfamily K, member 5 
cyclin-dependent kinase inhibitor 3 (CDK2- 
associated dual specificity phosphatase) 
acid phosphatase 1, soluble 
hypothetical protein MGC19531 (ras homolog 
gene family, member C) 
guanylate kinase 1 

KIAA protein (similar to mouse paladin) 
dual specificity phosphatase 16 
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Table 2 describes the location of the cancer target genes of the present 
invention while Table 3 describes primers used to locate these genes. An 
additional set of primers is provided in Table 5 while additional gene data is 
provided in Table 4. 

The nucleotides and polypeptides, as gene products, used in the methods of 
the present invention may comprise a recombinant polynucleotide or polypeptide, a 
natural polynucleotide or polypeptide, or a synthetic polynucleotide or polypeptide, 
preferably a recombinant polynucleotide or polypeptide. 

Table 2. Chromosome Location of Cancer Target Genes 



Accession 


Chromosome 


band 


Description 


indication 


Primer 


AI651265 


chr17 


q12 


kinase 




PR3869 


U77949 


chr17 


q21.2 


AAA ATPase 


breast 


PR3870 


X77743 


chr5 


q13.2 


kinase 


ovary 


PR3871 


AA805691 


chr7 


p12.2 


AAA ATPase 


lung 


PR3872 


AI925583 


chr8 


q24.13 AAA ATPase 


PR3873 


D26350 


chr12 


p11.23 


ion transport 




PR3874 




chrl 


q21.3 


tk 


melanoma 


PR3875 










and LN 












mets 




BE616825 


chr9 


q33.3 


kinase 


sarcoma 


PR3876 


BF796470 


chr3 


q29 


kinase 


ovary 


PR3877 


AF005046 


chr19 


q13.2 


kinase 


PR3878 


U90065 


chM 


q42.2 


K+ channel 


pancreas 


PR3879 


AF064824 


chr8 


q21.3 


kinase 


ovary 


PR3880 


M87338 


chr7 


q11.23 AAA ATPase 


PR3881 


AW779556 


chr12 


p11.23 


kinase 


pancreas 


PR3882 


AI678413 


chr6 


p21.2 


K+ channel 




PR3883 


AF2 13033 


chr14 


q22.2 






PR3884 


BE872974 


chr2 


p25.3 






PR3885 


AW1 17553 


chrl 


p13.2 


Phosphatase 


breast 


PR3886 


BC006249 


chM 


q42.13 






PR3887 


AB033100 


chMO 


q22.1 






PR3888 


AB051487 


chM 2 


p13.2 






PR3889 
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Fragments of such polynucleotides and polypeptides as are disclosed herein 
may also be useful in practicing the processes of the present invention. For 
example, a fragment, derivative or analog of a polypeptide encoded by one of 
KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, CDC6, CDK7 t CDKN3, 
5 CRK7, DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, KCNK5, PRO2Q00, RFC2 and 
RIPK2 may be (i) one in which one or more of the amino acid residues are 
substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or 
may not be one encoded by the genetic code, or (ii) one in which one or more of 

10 the amino acid residues includes a substituent group, or (iii) one in which the 
mature polypeptide is fused with another compound, such as a compound to 
increase the half-life of the polypeptide (for example, polyethylene glycol), or (iv) 
one in which the additional amino acids are fused to the mature polypeptide, such 
as a leader or secretory sequence or a sequence which is employed for purification 

1 5 of the mature polypeptide or a proprotein sequence. Such fragments, derivatives 
and analogs are deemed to be within the scope of those skilled in the art from the 
teachings herein. 

The genes and gene products useful in practicing the methods of the 
20 present invention may likewise be obtained in an isolated or purified form. In 
addition, the polypeptide disclosed herein as being useful in practicing the 
processes of the invention include different types of proteins in terms of function so 
that, as recited elsewhere herein, some are enzymes, some are transcription 
factors and other may be cell surface receptors. Precisely how such cancer-linked 
25 proteins are used in the processes of the invention may thus differ depending on 
the function and cellular location of the protein and therefore modification, or 
optimization, of the methods disclosed herein may be desirable in light of said 
differences. For example, a cell-surface receptor is an excellent target for cytotoxic 
antibodies whereas a transcription factor or enzyme is a useful target for a small 
30 organic compound with anti-neoplastic activity. 
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Table 3. Primers used to Identify Genes 



Accession 



Left primer 1 



Right primer 1 



AI651265 
U77949 
X77743 
AA805691 
AI925583 
D26350 

BE616825 
BF796470 
AF005046 

U90065 
AF064824 

M87338 

AW779556 
AI678413 
AF213033 
BE872974 
AW1 17553 
BC006249 
AB033100 
AB051487 



TCTTGGGCATCTCAACG 

TATTCAGCTGGCATTTAGAGAGC 

TGGACAACATTTTACTACTGAGGG 

TTAGTCCTGATAAAAATTAAAAACC 

TAAACCATTTGGGATGGCAT 

CTCTGAGGACATTCCCGTTAGAA 

ATGACATGGGGCTTGC 

TCTTCATGAATTCTAAGTAACTC 

TAACAAGCGATTCTAAACCACC 

AACTAACTCGAGGCAGGGGT 

TTCCCCTTATTTTATTGTAGCAA 

AAATGGGGACAGGAAGCC 

CAACAAACACTGCAAGGCTT 



TGCCACCAAAACATTTTTGA 

AACTACTACACACAGAAGCTGC 

CCATGTCTGAAATGTCAGTTCTC 

TCAGAGGCAAAGTGGTTCAG 

TCTTGACACATACGAAGCC 

AGGCTTGCTGTCTGTTCTCG 

CTTCTCCTCAGTCTCAAACCCAA 

ATCCCATTTTAAACAATTCTTTGA 



CACATAAAGCAGGTTGTAGAACG 

ACACTTGCAAGCACATTGGC 

TAAGTTTTCCCTAATGCATTTTCA 

AGAAAGTGCCTTCCTGATG 

TGGAACAAGCTGTTAACACCC 

CAGGTGTTTCAAGGAAGAGGAAA 

AACACCTGAGGGGGCTT 

GTCACTGACTTCATGACA 

ATGGATGCAAATTCTTTAAGCA 

CTGCCCTTATTGGGGGAC 

GGTTTATGTGTACTGGTTTGCA 

GCTTAATTGCCCTACAAAGGG 

TCTCCATCCTGGGGAAAAA 

ATGTGAGGGGATATTGCTGC 

AAGCCAGCTTCAGATGTATAT 

AAAACTTTAGGAATATCTGCACATG 

AATCAGTCGTTGGCACCTTC 

GTAGAAGCAGAGTCCCTGG 

TTTATTAGGATGTCAGCCCTGG 

ATCCATCTCTCTGACAGTGCTGA 

GCTGAACCACCAGGAACCT 



5 
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Table 4. Further Description of Cancer Target Genes 



Accession 






UniSTS 


Gencarta 










Name 


AI651265 


PR3890 


CRK7 


SHGC-58832 


T60764 


U77949 


PR3891 


CDC6 


RH70424 


T83032 


X77743 


PR3892 


CDK7 


SHGC-149358 


F02366 


AA805691 


PR3893 


FIGNL1 


RH1 03568 


H61320 


AI925583 


PR3894 


PRO2000 


RH80934 


Z44462 


D26350 


PR3895 


ITPR2 


SHGC-1 06565 


Z38709 




PR3896 




SHGC-69193 




BE616825 


PR3897 


NEK6 


RH62928 


T11445 


BF796470 


PR3898 


PAK2 


SHGC-35416 


Z26993 


AF005046 


PR3899 


PAK4 


RH39107 


R09837 


U90065 


PR3900 


KCNK1 


55164 


Z39663 


AF064824 




RIPK2 




D61791 


M87338 


PR3901 


RFC2 


47404 


T62520 


AW779556 


PR3902 


STK38L 


182659 


R14324 


AI678413 


PR3903 


KCNK5 


83108 


R25184 


AF213033 


PR3904 


CDKN3 


24341 


HUMPTPB 


BE872974 


PR3905 


ACP1 


91295 


HUMAAPA 


AW1 17553 


PR3906 


ARHC 


RH49960 


AA383349 


BC006249 


PR3907 


GUK1 


38548 


T08090 


AB033100 


PR3908 


KIAA1274 


148013 


AA553584 


AB051487 


PR3909 


DUSP16 


85676 


T23935 
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Table 5. Additional Primers for Cancer Target Genes 



Accession Left primer 2 Right primer 2 

AI651265 GTGGGGCCCAATAACTCAAA TTTTGAATCTGGCCTTGCCT 

U77949 TTATGACCCCAACGCC AAGCAAGTCCACATGGAG 

X77743 CAGAGGTTCCCTCTTAAAAATTCA AAAGTGAAGTATTGGCTGGGC 

AA805691 CCATCCATGGAATCCTAGACA TTATCCTACCACTTTGCGGG 

AI925583 AAGAGTTGGCCAAACTTCAACTATT TGTCATGTCCGCCTAATTGA 

D26350 CAAAGCCTCAAGACCTTTTTCAA AAGGTACCAGCTAAACCTCTTTGC 

GAGAAAGGGAGGGATCGTTC TGTGAGGGGCTATGCTGG 

BE616825 TTCCACTTTATCCCTTTACAACA GGCTTATGCTAACAGGAGACTTG 

BF796470 TCACTGCTGTGGCCTCATAC TCAGTCCACAATTCCTTCTGG 

AF005046 GGGGGACGCTGTCATTCAC TTCCCAGTACCGCAGAGCC 

U90065 GGTCCTCTACTTCCACAT GCTCTCTGAATTTTTGATT 
AF064824 

M87338 GCAGAGACTTCACTGACTGAC TGACCTCAGGTGATCCACCTG 

AW779556 TTTAGCAAAACTTGGAGCTGGAG AAAACCATTCTCTACTAACTACCCCC 

AI678413 TTTTGCAAGGCAACTGAGG GATACGGCAGCCTCTACTGC 

AF213033 CACATGGCCTAGTAGTTTGG GTTCCAACTGCTTAGATCAGC 

BE872974 TGAACAAAGAGCTGGGCTTT ACTGAGGCAGGTTCGTGC 

AW1 1 7553 AGCCTGTAGCCTTTATCCATG CTTCTGGCTCACAGGAAAATG 

BC006249 CTGCTCTTTACCTGGGGTTG GAGCCACAGAGGAGTGAAGG 

AB033100 ACATGTGCCCTACACACAC AGCTGTCACATAAATAGAACCC 

AB051487 ATCAGACATTCTCAAGTTTCACACA GGACCATGGCCAAGAGAAG 



5 Expression products of the genes disclosed herein for use in the methods of 

the invention may be in an isolated form. 

Methods of producing vectors comprising genes disclosed herein, or 
recombinant cells expressing such genes, are well known to those skilled in the 

10 molecular biology art. See, for example, Sambrook, et al., Molecular Cloning: A 
Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., (1989), Wu et al, 
Methods in Gene Biotechnology (CRC Press, New York, NY, 1997), and 
Recombinant Gene Expression Protocols, in Methods in Molecular Biology, Vol. 
62, (Tuan, ed., Humana Press, Totowa, NJ, 1997), the disclosures of which are 

1 5 hereby incorporated by reference. 
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In one aspect, the present invention relates to a method for identifying a 
cancer-target gene, comprising: 

a) identifying a gene that is at least 5 fold over-expressed in a cancer cell 
line and that maps to a chromosomal region with a CGH ratio of at least 1.25; 
5 b) determining an RNA expression level of said gene of at least 1 .5 fold in 

a tumor tissue compared to corresponding normal tissue in a genetic database, 

c. determining that said gene encodes a protein domain known to be 
modulated, or shown to be modulated, by chemical compounds 

wherein a gene that meets the criteria of a, b and c is considered to be a 
1 0 cancer-target gene, 

thereby identifying a cancer-target gene. 

The present invention also relates to a set of cancer-target genes 
identified using such methods. The genes disclosed herein form such a set. In 
1 5 addition, subsets of such sets are specifically contemplated by the invention. 

In another aspect, the present invention relates to a method for identifying 
an agent that modulates the activity of a cancer-target gene comprising: 

(a) contacting a test compound with a cell containing a polynucleotide that 
20 corresponds to a gene that has the properties of a, b and c of claim 1 and under 

conditions promoting the expression of said gene, and 

(b) determining a difference in expression of said gene relative to when 
said test compound is not present wherein said difference indicates gene 
modulating activity, 

25 thereby identifying an agent that modulates the activity of a cancer-related 

gene. 

In a preferred embodiment, said gene was first identified as a cancer 
target gene using one or more of the methods of the invention. 

30 
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In another preferred embodiment, the gene is a gene selected from the 
group consisting of KIAA1274, NEK6, PAK2, PAK4, STK38L. ACP1, ARHC, 
CDC6, CDK7 t CDKN3, CRK7 t DUSP16, FIGNL1, GUK1, ITPR2, KCNK1. 
KCNK5, PRO2000, RFC2 and RIPK2. These 20 genes are contained in Table 6 
5 where they are described in terms of a consensus sequence along with identified 
polynucleotide transcripts and polypeptides. 

In the methods of the invention, expression may be determined by 
determining transcription (to form RNA), as by measuring the rate or amount of 
10 RNA formed, or translation (to form protein), such as where antibodies may be 
used to determine the amount of polypeptide or protein formed from the gene in 
question or where the activity of such protein is determined, such as where the 
protein is an enzyme and the amount of enzyme activity can be determined. 

1 5 In one preferred embodiment, the cell is a cancer cell and the determined 

difference in expression is a decrease in expression. In another embodiment, the 
cell is a recombinant cell, such as one comprising a gene as disclosed herein, 
and the difference in expression is a decrease in expression. 

20 The present invention also relates to a method for identifying an anti- 

neoplastic agent comprising contacting a cell exhibiting neoplastic activity with a 
compound first identified as a cancer target gene modulator using one of the 
methods of the invention and detecting a decrease in said neoplastic activity 
after said contacting compared to when said contacting does not occur. In 

25 preferred embodiments, the neoplastic activity is accelerated cellular replication. 
In another preferred embodiment, the decrease in neoplastic activity results from 
the death of the cell. In a preferred embodiment, the compound is one that 
modulates, preferably inhibits, a gene disclosed herein, most preferably a gene 
identified in Table 6. 

30 
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The present invention also relates to a method for identifying an anti- 
neoplastic agent comprising administering to an animal exhibiting a cancerous 
condition an effective amount of a cancer target gene modulating agent by a 
method of the invention and detecting a decrease in said cancerous condition. In 
5 a preferred embodiment, the compound is one that modulates, preferably 
inhibits, a gene disclosed herein, most preferably a gene identified in Table 6. 

In accordance with the present invention, model cellular systems using 
1 0 cell lines, primary cells, or tissue samples are maintained in growth medium and 
may be treated with compounds that may be at a single concentration or at a 
range of concentrations. At specific times after treatment, cellular RNAs are 
isolated from the treated cells, primary cells or tumors, which RNAs are indicative 
of expression of selected genes. The cellular RNA is then divided and subjected 
1 5 to analysis that detects the presence and/or quantity of specific RNA transcripts, 
which transcripts may then be amplified for detection purposes using standard 
methodologies, such as, for example, reverse transcriptase polymerase chain 
reaction (RT-PCR), etc. The presence or absence, or levels, of specific RNA 
transcripts are determined from these measurements and a metric derived for the 
20 type and degree of response of the sample to the treated compound compared to 
control samples. 

In accordance with the foregoing, there are thus disclosed herein methods 
for using a cancer-linked or cancer-target gene sequence (such as that of 

25 KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, CDC6, CDK7, CDKN3, 
CRK7, DUSP16, FIGNLl GUK1, ITPR2, KCNK1, KCNK5, PRO2000, RFC2 and 
RIPK2) whose expression is, or can be, as a result of the methods of the present 
invention, linked to, or used to characterize, the cancerous, or non-cancerous, 
status of the cells, or tissues, to be tested. Thus, the processes of the present 

30 invention identify novel anti-neoplastic agents based on their alteration of 
expression of the polynucleotide sequence disclosed herein in specific model 
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systems. The methods of the invention may therefore be used with a variety of 
cell lines or with primary samples from tumors maintained in vitro under suitable 
culture conditions for varying periods of time, or in situ in suitable animal models. 

5 More particularly, genes have been identified that are expressed at a level 

in cancer cells that is different from the expression level in non-cancer cells. In 
one instance, the identified genes are expressed at higher levels in cancer cells 
than in normal cells. 

10 The genes useful in the methods of the invention can include fully 

operational genes with attendant control or regulatory sequences or merely a 
polynucleotide sequence encoding the corresponding polypeptide or an active 
fragment or analog thereof. 

1 5 In one embodiment of the present invention, said gene modulation is 

downward modulation, so that, as a result of exposure to the chemical agent to 
be tested, one or more genes of the cancerous cell will be expressed at a lower 
level (or not expressed at all) when exposed to the agent as compared to the 
expression when not exposed to the agent. 

20 

In a preferred embodiment a selected set of said genes are expressed in 
the reference cell, including the gene(s) identified for use according to the 
present invention, but are not expressed in the cell to be tested as a result of the 
exposure of the cell to be tested to the chemical agent. Thus, where said 
25 chemical agent causes the gene, or genes, of the tested cell to be expressed at a 
lower level than the same genes of the reference, this is indicative of downward 
modulation and indicates that the chemical agent to be tested has anti-neoplastic 
activity. 

30 The genes identified by the present disclosure are considered "cancer- 

related" genes, or cancer-target" genes, as this term is used herein, and include 
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genes expressed at higher levels (due, for example, to elevated rates of 
expression, elevated extent of expression or increased copy number) in cancer 
cells relative to expression of these genes in normal (i.e., non-cancerous) cells 
where said cancerous state or status of test cells or tissues has been determined 
5 by methods known in the art, such as by reverse transcriptase polymerase chain 
reaction (RT-PCR) as described in the Example below. In specific embodiments, 
this relates to the genes of KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, 
ARHC, CDC6, CDK7, CDKN3, CRK7, DUSP16, FIGNL1, GUK1, ITPR2 t KCNK1, 
KCNK5, PRO200Q, RFC2 and RIPK2. 

1 0 The genes disclosed herein may be genomic in nature and thus represent 

an actual gene as found in nature, such as a human gene, or may be a cDNA 
sequence derived from a messenger RNA (mRNA) and thus represent 
contiguous exonic sequences derived from a corresponding genomic sequence 
or they may be wholly synthetic in origin for purposes of practicing the processes 

15 of the invention. Because of the processing that may take place in transforming 
the initial RNA transcript into the final mRNA, the genes disclosed herein may 
represent less than the full genomic nucleotide sequence. They may also 
represent sequences derived from ribosomal and transfer RNAs. Consequently, 
the genes present in the cell (and representing the genomic sequences) and the 

20 sequences of genes disclosed herein, which are mostly cDNA sequences, may 
be identical or may be such that the cDNAs contain less than the full genomic 
sequence. Such genes and cDNA sequences are still considered as 
corresponding to genes disclosed herein because they both encode similar RNA 
sequences. Thus, by way of non-limiting example only, a gene that encodes an 

25 RNA transcript, which is then processed into a shorter mRNA, is deemed to 
encode both such RNAs and therefore encodes an RNA complementary to 
(using the usual Watson-Crick complementarity rules), or that would otherwise be 
encoded by, a cDNA (for example, a sequence as disclosed herein). Thus, the 
sequences of genes disclosed herein correspond to genes contained in the 

30 cancerous or normal cells used to determine relative levels of expression 
because they represent the same sequences or are complementary to RNAs 



encoded by these genes. Such genes also include different alleles and splice 
variants that may occur in the cells used in the processes of the invention. 

The genes of the invention "correspond to" the genes of KIAA1274, NEK6, 
5 PAK2, PAK4, STK38L, ACP1, ARHC, CDC6, CDK7, CDKN3, CRK7, DUSP16, 
FIGNL1, GUK1, ITPR2, KCNK1 , KCNK5, PRO2000, RFC2 and RIPK2 if the 
gene encodes an RNA (processed or unprocessed, including naturally occurring 
splice variants and alleles) that is at least 90% identical, preferably at least 95% 
identical, most preferably ?* least 98% identical to, and especially identical to, an 

1 0 RNA that would be em oded by, or be complementary to, such as by 
hybridization with, a polynucleotide having the indicated sequence. In addition, 
genes including sequences at least 90% identical to a genes of KIAA1274, 
NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, CDC6, CDK7, CDKN3, CRK7, 
DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, KCNK5, PRO2000, RFC2 or RIPK2, 

1 5 preferably at least about 95% identical to such a sequence, more preferably at 
least about 98% identical to such sequence and most preferably comprising such 
sequence are specifically contemplated by all of the processes of the present 
invention as being genes that correspond to these sequences. In addition, genes 
encoding the same proteins as any of these genes, regardless of the percent 

20 identity of such sequences, are also specifically contemplated by any of the 
methods of the present invention that rely on any or all of said sequences, 
regardless of how they are otherwise described or limited. Thus, any such 
sequences are available for use in carrying out any of the methods disclosed 
according to the invention. 

25 

Such genes will also encode the same or similar polypeptide sequence as 
the genes KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, CDC6, 
CDK7, CDKN3, CRK7, DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, KCNK5, 
PRO2000, RFC2 and RIPK2 but may include differences in such amino acid 
30 sequences where such differences are limited to conservative amino acid 
substitutions, such as where the same overall three dimensional structure, and 
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thus the same antigenic character, is maintained. Thus, amino acid sequences 
may be within the scope of the present invention where they react with the same 
antibodies that react with polypeptides encoded by genes disclosed herein, 
preferably KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, CDC6, 
5 CDK7 t CDKN3, CKK7, DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, KCNK5, 
PRO2000, RFC2 and RIPK2. 

The present invention also relates to methods of assaying potential 
antitumor agents based on their modulation of the expression of the disclosed 
10 genes according to the invention and methods for diagnosing cancerous, or 
potentially cancerous, conditions as a result of the patterns of expression of the a 
gene disclosed herein as well as related genes based on common expression or 
regulation of such genes. 

1 5 In carrying out the foregoing assays, relative antineoplastic activity may be 

ascertained by the extent to which a given chemical agent modulates the 
expression of genes present in a cancerous cell. Thus, a first chemical agent that 
modulates the expression of a gene associated with the cancerous state (i.e., a 
gene that includes one of the genes of the invention as disclosed herein and 

20 present in cancerous cells) to a larger degree than a second chemical agent tested 
by the assays of the invention is thereby deemed to have higher, or more desirable, 
or more advantageous, antineoplastic activity than said second chemical agent. 

The gene expression to be measured is commonly assayed using RNA 
25 expression as an indicator. Thus, the greater the level of RNA (messenger RNA) 
detected the higher the level of expression of the corresponding gene. Thus, gene 
expression, either absolute or relative, is determined by the relative expression of 
the RNAs encoded by such genes. 

30 RNA may be isolated from samples in a variety of ways, including lysis and 

denaturation with a phenolic solution containing a chaotropic agent (e.g., triazol) 
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followed by isopropanol precipitation, ethanol wash, and resuspension in aqueous 
solution; or lysis and denaturation followed by isolation on solid support, such as a 
Qiagen resin and reconstitution in aqueous solution; or lysis and denaturation in 
non-phenolic, aqueous solutions followed by enzymatic conversion of RNA to DNA 
5 template copies. 

Normally, prior to applying the processes of the invention, steady state RNA 
expression levels for the genes, and sets of genes, disclosed herein will have been 
obtained. It is the steady state level of such expression that is affected by potential 

1 0 antineoplastic agents as determined herein. Such steady state levels of expression 
are easily determined by any methods that are sensitive, specific and accurate. 
Such methods include, but are in no way limited to, real time quantitative 
polymerase chain reaction (PCR), for example, using a Perkin-Elmer 7700 
sequence detection system with gene specific primer probe combinations as 

1 5 designed using any of several commercially available software packages, such as 
Primer Express software., solid support based hybridization array technology using 
appropriate internal controls for quantitation, including filter, bead, or microchip 
based arrays, solid support based hybridization arrays using, for example, 
chemiluminescent, fluorescent, or electrochemical reaction based detection 

20 systems. 

The gene patterns indicative of a cancerous state need not be 
characteristic of every cell found to be cancerous. Thus, the methods disclosed 
herein are useful for detecting the presence of a cancerous condition within a 
tissue where less than all cells exhibit the complete pattern. Thus, for example, a 

25 set of selected genes, corresponding to any of the genes disclosed herein, may 
be found, using appropriate probes, either DNA or RNA, to be present in as little 
as 60% of cells derived from a sample of tumorous, or malignant, tissue while 
being absent from as much as 60% of cells derived from corresponding non- 
cancerous, or otherwise normal, tissue (and thus being present in as much as 

30 40% of such normal tissue cells). In a preferred embodiment, such gene pattern 
is found to be present in at least 50% of cells drawn from a cancerous tissue, 
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such as the lung cancer disclosed herein. In an additional embodiment, such 
gene pattern is found to be present in at least 100% of cells drawn from a 
cancerous tissue and absent from at least 100% of a corresponding normal, non- 
cancerous, tissue sample, although the latter embodiment may represent a rare 
5 occurrence. 

In another aspect the present invention relates to a process for determining 
the cancerous status of a test cell, comprising determining expression in said test 
cell of a gene as disclosed herein and then comparing said expression to 
10 expression of said at least one gene in at least one cell known to be non- 
cancerous whereby a difference in said expression indicates that said cell is 
cancerous. 

In one embodiment, said change in expression is a change in copy 
1 5 number, including either an increase or decrease in copy number. In accordance 
with the present invention, said change in gene copy number may be determined 
by determining a change in expression of messenger RNA encoded by said 
gene. 

20 Changes in gene copy number may be determined by determining a 

change in expression of messenger RNA encoded by a particular gene, 
especially that of Such change in gene copy number may be determined by 
determining a change in expression of messenger RNA encoded by a particular 
gene, especially that of KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, 

25 CDC6, CDK7, CDKN3, CRK7, DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, 
KCNK5, PRO2000 t RFC2 and RIPK2. Also in accordance with the present 
invention, said gene may be a cancer initiating gene, a cancer facilitating gene, 
or a cancer suppressing gene. In carrying out the methods of the present 
invention, a cancer facilitating gene is a gene that, while not directly initiating or 

30 suppressing tumor formation or growth, said gene acts, such as through the 
actions of its expression product, to direct, enhance, or otherwise facilitate the 



progress of the cancerous condition, including where such gene acts against 
genes, or gene expression products, that would otherwise have the effect of 
decreasing tumor formation and/or growth. 

5 Although the presence or absence of expression of a gene corresponding 

to one of KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, CDC6, CDK7, 
CDKN3, CRK7, DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, KCNK5, PRO2000, 
RFC2 and RIPK2, may be indicative of a cancerous status for a given cell, the 
mere presence or absence of such a gene may not alone be sufficient to achieve 

1 0 a malignant condition and thus the level of expression of such gene pattern may 
also be a significant factor in determining the attainment of a cancerous state. 
Thus, while a pattern of genes may be present in both cancerous and non- 
cancerous cells, the level of expression, as determined by any of the methods 
disclosed herein, all of which are well known in the art, may differ between the 

1 5 cancerous versus the non-cancerous cells. Thus, it becomes essential to also 
determine the level of expression of a gene such as that disclosed herein, 
including substantially similar genes, as a separate means of diagnosing the 
presence of a cancerous status for a given cell, groups of cells, or tissues, either 
in culture or in situ. 

20 

The level of expression of the polypeptides disclosed herein is also a 
measure of gene expression, such as polypeptides having sequence identical, or 
similar to any polypeptide encoded by any of KIAA1274, NEK6, PAK2, PAK4, 
STK38L, ACP1, ARHC, CDC6, CDK7, CDKN3, CRK7, DUSP16, FIGNL1, GUK1, 
25 ITPR2, KCNK1 , KCNK5, PRO2000, RFC2 and RIPK2. 

Thus, the present invention further relates to a method for identifying an 
agent that modulates the activity of a cancer-target polypeptide comprising: 

(a) contacting a test compound with a cell expressing a polypeptide 
30 encoded by a polynucleotide corresponding to a gene having the properties of a, 
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b and c disclosed above for identifying a cancer-target gene and under 
conditions promoting the expression of said polypeptide; and 

(b) determining a difference in expression of said polypeptide relative to 
when said test compound is not present wherein said difference indicates 
5 cancer-target polypeptide modulating activity, 

thereby identifying a cancer-target polypeptide modulating agent. 

The present invention further relates to a method for identifying an agent 
that modulates the activity of a cancer-target polypeptide comprising: 
10 (a) contacting a test compound with a polypeptide encoded by a 

polynucleotide corresponding to a gene having the properties of a, b and c of 
claim 1 and under conditions promoting the activity of said polypeptide; and 

(b) determining a difference in activity of said polypeptide relative to when 
said test compound is not present wherein said difference indicates cancer-target 
1 5 polypeptide modulating activity, 

thereby identifying a cancer-target polypeptide modulating agent. 

In any of these methods, a preferred embodiment utilizes a gene selected 
from KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC t CDC6, CDK7, 
20 CDKN3, CRK7, DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, KCNK5, PRO2000, 
RFC2 and RIPK2. 

In accordance with the foregoing, the present invention further relates to a 
process for determining the cancerous status of a cell to be tested, comprising 

25 determining the level of expression in said cell of at least one gene of KIAA1274, 
NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, CDC6, CDK7, CDKN3, CRK7, 
DUSP16, FIGNL1, GUK1, ITPR2, KCIMK1, KCNK5, PRO2Q00, RFC2 and RIPK2, 
including genes substantially identical to said sequences, or characteristic 
fragments thereof, or the complements of any of the foregoing and then 

30 comparing said expression to that of a cell known to be non-cancerous whereby 
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the difference in said expression indicates that said cell to be tested is 
cancerous. 

In accordance with the invention, although gene expression for a gene 
5 useful in the methods of the invention is preferably determined by use of a probe 
that is a fragment of such nucleotide sequence, it is to be understood that the 
probe may be formed from a different portion of the gene. Expression of the gene 
may be determined by use of a nucleotide probe that hybridizes to messenger 
RNA (mRNA) transcribed from a portion of the gene. 

10 

It should be noted that there are a variety of different contexts in which 
genes have been evaluated as being involved in the cancerous process. Thus, 
some genes may be oncogenes and encode proteins that are directly involved in 
the cancerous process and thereby promote the occurrence of cancer in an 

1 5 animal. In addition, other genes may serve to suppress the cancerous state in a 
given cell or cell type and thereby work against a cancerous condition forming in 
an animal. Other genes may simply be involved either directly or indirectly in the 
cancerous process or condition and may serve in an ancillary capacity with 
respect to the cancerous state. All such types of genes are deemed with those to 

20 be determined in accordance with the invention as disclosed herein. Thus, the 
gene determined by said process of the invention may be an oncogene, or the 
gene determined by said process may be a cancer facilitating gene, the latter 
including a gene that directly or indirectly affects the cancerous process, either in 
the promotion of a cancerous condition or in facilitating the progress of 

25 cancerous growth or otherwise modulating the growth of cancer cells, either in 
vivo or ex vivo. In addition, the gene determined by said process may be a 
cancer suppressor gene, which gene works either directly or indirectly to 
suppress the initiation or progress of a cancerous condition. Such genes may 
work indirectly where their expression alters the activity of some other gene or 

30 gene expression product that is itself directly involved in initiating or facilitating 
the progress of a cancerous condition. For example, a gene that encodes a 
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polypeptide, either wild or mutant in type, which polypeptide acts to suppress of 
tumor suppressor gene, or its expression product, will thereby act indirectly to 
promote tumor growth. 

5 In accordance with the foregoing, the methods of the present invention 

includes cancer modulating agents that are themselves either polypeptides, or 
small chemical entities, that affect the cancerous process, including initiation, 
suppression or facilitation of tumor growth, either in vivo or ex vivo. Said cancer 
modulating agent may have the effect of increasing gene expression or said 
1 0 cancer modulating agent may have the effect of decreasing gene expression as 
such terms have been described herein. 

Thus, the present invention relates to a method for treating cancer 
comprising contacting a cancerous cell with an effective amount of an agent that 
1 5 can reduce the activity of a cancer-target gene (i.e., a gene having the properties 
of a, b and c disclosed herein for identifying a cancer-target gene). In a preferred 
embodiment, said gene is one of KIAA1274, NEK6, PAK2, PAK4, STK38L, 
ACP1, ARHC, CDC6, CDK7, CDKN3, CRK7 t DUSP16, FIGNL1, GUK1, ITPR2, 
KCNK1 , KCNK5, PRO20Q0, RFC2 and RIPK2. 

20 

Such embodiments include use of any of the agents having activity in one 
or more of the screening methods disclosed herein, most preferably wherein the 
agent was first identified as having such activity using one or more of said 
methods. In a preferred embodiment, the cancerous cell is contacted in vivo. 

25 

In an additional preferred embodiment, the agent has affinity for an 
expression product of said gene, such as where the agent is an antibody, 
preferably one disclosed according to the present invention. 

30 The proteins encoded by the genes disclosed herein due to their 

expression, or elevated expression, in cancer cells, represent highly useful 
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therapeutic targets for "targeted therapies" utilizing such affinity structures as, for 
example, antibodies coupled to some cytotoxic agent. In such methodology, it is 
advantageous that nothing need be known about the endogenous ligands or 
binding partners for such cell surface molecules. Rather, an antibody or 
5 equivalent molecule that can specifically recognize the cell surface molecule 
(which could include an artificial peptide, a surrogate ligand, and the like) that is 
coupled to some agent that can induce cell death or a block in cell cycling offers 
therapeutic promise against these proteins. Thus, such approaches include the 
use of so-called suicide "bullets" against intracellular proteins 

10 

With the advent of methods of molecular biology and recombinant 
technology, it is now possible to produce antibody molecules by recombinant 
means and thereby generate gene sequences that code for specific amino 
acid sequences found in the polypeptide structure of the antibodies. Such 

15 antibodies can be produced by either cloning the gene sequences encoding 
the polypeptide chains of said antibodies or by direct synthesis of said 
polypeptide chains, with in vitro assembly of the synthesized chains to form 
active tetrameric (H 2 L 2 ) structures with affinity for specific epitopes and 
antigenic determinants. This has permitted the ready production of 

20 antibodies having sequences characteristic of neutralizing antibodies from 
different species and sources. 

Regardless of the source of the antibodies, or how they are 
recombinantly constructed, or how they are synthesized, in vitro or in vivo, 

25 using transgenic animals, such as cows, goats and sheep, using large cell 
cultures of laboratory or commercial size, in bioreactors or by direct chemical 
synthesis employing no living organisms at any stage of the process, all 
antibodies have a similar overall 3 dimensional structure. This structure is 
often given as H 2 L 2 and refers to the fact that antibodies commonly comprise 

30 2 light (L) amino acid chains and 2 heavy (H) amino acid chains. Both chains 

39 



have regions capable of interacting with a structurally complementary 
antigenic target. The regions interacting with the target are referred to as 
"variable" or "V" regions and are characterized by differences in amino acid 
sequence from antibodies of different antigenic specificity. 

5 

The variable regions of either H or L chains contains the amino acid 
sequences capable of specifically binding to antigenic targets. Within these 
sequences are smaller sequences dubbed "hypervariable" because of their 
extreme variability between antibodies of differing specificity. Such 
10 hypervariable regions are also referred to as "complementarity determining 
regions" or "CDR" regions. These CDR regions account for the basic 
specificity of the antibody for a particular antigenic determinant structure. 

The CDRs represent non-contiguous stretches of amino acids within 
15 the variable regions but, regardless of species, the positional locations of 
these critical amino acid sequences within the variable heavy and light chain 
regions have been found to have similar locations within the amino acid 
sequences of the variable chains. The variable heavy and light chains of all 
antibodies each have 3 CDR regions, each non-contiguous with the others 
20 (termed L1, L2, L3, H1, H2, H3) for the respective light (L) and heavy (H) 
chains. The accepted CDR regions have been described by Kabat et al, J. 
Biol. Chem. 252:6609-6616 (1977). The numbering scheme is shown in the 
figures, where the CDRs are underlined and the numbers follow the Kabat 
scheme. 

25 

In all mammalian species, antibody polypeptides contain constant (i.e., 
highly conserved) and variable regions, and, within the latter, there are the 
CDRs and the so-called "framework regions" made up of amino acid 
sequences within the variable region of the heavy or light chain but outside 
30 the CDRs. 



40 



The antibodies disclosed according to the invention may also be wholly 
synthetic, wherein the polypeptide chains of the antibodies are synthesized and, 
possibly, optimized for binding to the polypeptides disclosed herein as being 
5 receptors. Such antibodies may be chimeric or humanized antibodies and may 
be fully tetrameric in structure, or may be dimeric and comprise only a single 
heavy and a single light chain. Such antibodies may also include fragments, such 
as Fab and F(ab 2 )' fragments, capable of reacting with and binding to any of the 
polypeptides disclosed herein as being receptors. 

10 

In one aspect, the present invention relates to immunoglobulins, or 
antibodies, as described herein, that react with, especially where they are 
specific for, the polypeptides encoded by a gene identified by the methods of 
the invention, preferably one of KIAA1274, NEK6, PAK2, PAK4, STK38L, 
15 ACP1, ARHC, CDC6, CDK7, CDKN3, CRK7, DUSP16, FIGNL1, GUK1, ITPR2, 
KCNK1, KCNK5, PRO2000, RFC2 and RIPK2. Such antibodies may commonly 
be in the form of a composition, especially a pharmaceutical composition. 

Thus, the present invention contemplates an antibody that binds to a polypeptide 
20 encoded by a cancer-target gene (i.e., a gene having the properties of a, b and c 
disclosed above for identifying a cancer-target gene). In a preferred embodiment, 
the polypeptide or protein is encoded by one or more of genes KIAA1274, NEK6, 
PAK2, PAK4, STK38L, ACP1, ARHC, CDC6, CDK7 t CDKN3, CRK7, DUSP16, 
FIGNL1, GUK1, ITPR2. KCNK1, KCNK5, PRO2000, RFC2 and RIPK2. In 
25 specific embodiments, this may include a naturally occurring antibody, such as 
polyclonal antibodies, or more preferably a monoclonal antibody, or a 
recombinant antibody or a partly or wholly synthetic antibody. In additional 
preferred embodiments, the antibody further comprises a cytotoxic agent, such 
as an apoptotic agent. 

30 
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The pharmaceutical compositions useful herein also contain a 
pharmaceutical^ acceptable carrier, including any suitable diluent or 
excipient, which includes any pharmaceutical agent that does not itself 
induce the production of antibodies harmful to the individual receiving the 
5 composition, and which may be administered without undue toxicity. 
Pharmaceutical^ acceptable carriers include, but are not limited to, liquids 
such as water, saline, glycerol and ethanol, and the like, including carriers 
useful in forming sprays for nasal and other respiratory tract delivery or for 
delivery to the ophthalmic system. A thorough discussion of 
10 pharmaceutical^ acceptable carriers, diluents, and other excipients is 
presented in REMINGTON'S PHARMACEUTICAL SCIENCES (Mack Pub. Co., 
N.J. current edition). 

The process of the present invention includes embodiments of the above- 
1 5 recited processes wherein the cancer cell is contacted in vivo as well as ex vivo, 
preferably wherein said agent comprises a portion, or is part of an overall 
molecular structure, having affinity for said expression product. In one such 
embodiment, said portion having affinity for said expression product is an 
antibody, especially where said expression product is a polypeptide or 
20 oligopeptide or comprises an oligopeptide portion, or comprises a polypeptide. 

Such an agent can therefore be a single molecular structure, comprising 
both affinity portion and anti-cancer activity portions, wherein said portions are 
derived from separate molecules, or molecular structures, possessing such 

25 activity when separated and wherein such agent has been formed by combining 
said portions into one larger molecular structure, such as where said portions are 
combined into the form of an adduct. Said anti-cancer and affinity portions may 
be joined covalently, such as in the form of a single polypeptide, or polypeptide- 
like, structure or may be joined non-covalently, such as by hydrophobic or 

30 electrostatic interactions, such structures having been formed by means well 
known in the chemical arts. Alternatively, the anti-cancer and affinity portions 
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may be formed from separate domains of a single molecule that exhibits, as part 
of the same chemical structure, more than one activity wherein one of the 
activities is against cancer cells, or tumor formation or growth, and the other 
activity is affinity for an expression product produced by expression of genes 
5 related to the cancerous process or condition. 

In one embodiment of the present invention, a chemical agent, such as a 
protein or other polypeptide, is joined to an agent, such as an antibody, having 
affinity for an expression product of a cancerous cell, such as a polypeptide or 

10 protein encoded by a gene related to the cancerous process, especially a gene 
as disclosed herein according to the present invention. Thus, where the presence 
of said expression product is essential to tumor initiation and/or growth, binding 
of said agent to said expression product will have the effect of negating said 
tumor promoting activity. In one such embodiment, said agent is an apoptosis- 

1 5 inducing agent that induces cell suicide, thereby killing the cancer cell and halting 
tumor growth.. 

Other genes within the cancer cell that are regulated in a manner similar 
to that of the genes disclosed herein and thus change their expression in a 

20 coordinated way in response to chemical compounds represent genes that are 
located within a common metabolic, signaling, physiological, or functional 
pathway so that by analyzing and identifying such commonly regulated groups of 
genes (groups that include the gene, or similar sequences, disclosed according 
to the invention, one can (a) assign known genes and novel genes to specific 

25 pathways and (b) identify specific functions and functional roles for novel genes 
that are grouped into pathways with genes for which their functions are already 
characterized or described. For example, one might identify a group of 10 genes, 
at least one of which is the gene as disclosed herein, that change expression in a 
coordinated fashion and for which the function of one, such as the polypeptide 

30 encoded by a gene disclosed herein, is known then the other genes are thereby 
implicated in a similar function or pathway and may thus play a role in the 
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cancer-initiating or cancer-facilitating process. In the same way, if a gene were 
found in normal cells but not in cancer cells, or happens to be expressed at a 
higher level in normal as opposed to cancer cells, then a similar conclusion may 
be drawn as to its involvement in cancer, or other diseases. Therefore, the 
5 processes disclosed according to the present invention at once provide a novel 
means of assigning function to genes, i.e. a novel method of functional 
genomics, and a means for identifying chemical compounds that have potential 
therapeutic effects on specific cellular pathways. Such chemical compounds may 
have therapeutic relevance to a variety of diseases outside of cancer as well, in 
10 cases where such diseases are known or are demonstrated to involve the 
specific cellular pathway that is affected. 

The polypeptides contemplated by the invention, preferably those 
encoded by one or more of KIAA1274, NEK6, PAK2, PAK4. STK38L, ACP1, 

1 5 ARHC, CDC6, CDK7, CDKN3, CRK7, DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, 
KCNK5, PRO2000, RFC2 and RIPK2, also find use as vaccines in that, where 
the polypeptide represents a surface protein present on a cancer cell, such 
polypeptide may be administered to an animal, especially a human being, for 
purposes of activating cytotoxic T lymphocytes (CTLs) that will be specific for, 

20 and act to lyze, cancer cells in said animal. Where used as vaccines, such 
polypeptides are present in the form of a pharmaceutical composition. The 
present invention may also employ polypeptides that have the same, or similar, 
immunogenic character as the polypeptides encoded by one or more of 
KIAA1274, NEK6, PAK2. PAK4, STK38L, ACP1, ARHC, CDC6, CDK7, CDKN3, 

25 CRK7, DUSP16, FIGNL1, GUK1 , ITPR2, KCNK1 , KCNK5, PRO2000, RFC2 and 
RIPK2, and thereby elicit the same, or similar, immunogenic response after 
administration to an animal, such as an animal at risk of developing cancer, or 
afflicted therewith. Thus, the polypeptides disclosed according to the invention 
will commonly find use as immunogenic compositions. 

30 
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The present invention also relates to a process that comprises a 
method for producing a product, such as by generating test data to facilitate 
identification of such product, comprising identifying an agent according to 
one of the disclosed processes for identifying such an agent (i.e., the 
5 therapeutic agents identified according to the assay procedures disclosed 
herein) wherein said product is the data collected with respect to said agent 
as a result of said identification process, or assay, and wherein said data is 
sufficient to convey the chemical character and/or structure and/or 
properties of said agent. For example, the present invention specifically 

10 contemplates a situation whereby a user of an assay of the invention may 
use the assay to screen for compounds having the desired enzyme 
modulating activity and, having identified the compound, then conveys that 
information (i.e., information as to structure, dosage, etc) to another user 
who then utilizes the information to reproduce the agent and administer it for 

15 therapeutic or research purposes according to the invention. For example, 
the user of the assay (user 1) may screen a number of test compounds 
without knowing the structure or identity of the compounds (such as where 
a number of code numbers are used the first user is simply given samples 
labeled with said code numbers) and, after performing the screening process, 

20 using one or more assay processes of the present invention, then imparts to 
a second user (user 2), verbally or in writing or some equivalent fashion, 
sufficient information to identify the compounds having a particular 
modulating activity (for example, the code number with the corresponding 
results). This transmission of information from user 1 to user 2 is specifically 

25 contemplated by the present invention. 

The genes useful in the methods of the invention disclosed herein are 
genes corresponding to one of KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, 
ARHC, CDC6, CDK7, CDKN3, CRK7 t DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, 
30 KCNK5, PRO2000, RFC2 and RIPK2 and represent genes that may be over- 
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expressed in malignant cancer. In addition, in any given sample, not all cancer 
cells may express this gene a substantial expression thereof in a substantial 
number of such cells is sufficient to warrant a determination of a cancerous, or 
potentially cancerous, condition. 

5 

Thus, the genes disclosed according to the present invention are 
expressed in cancer compared to normal tissue samples or may be expressed at 
a higher level in cancer as compared to normal tissues. Further, such 
polynucleotide, or gene, sequence expression in normal tissues may correlate 
1 0 with individuals having a family history of cancer. 

Such genes may play a direct role in cancer progression, such as in 
cancer initiation or cancer cell proliferation/survival. For example, one or more 
genes encoding the same polypeptide as one or more of the sequences 

15 disclosed herein represent novel individual gene targets for screening and 
discovery of small molecules that inhibit enzyme or other cellular functions, e.g. 
kinase inhibitors. Such molecules represent valuable therapeutics for cancer. In 
addition, small molecules or agents, such as small organic molecules, that down- 
regulate the expression of these genes in cancer would represent valuable anti- 

20 cancer therapeutics. Expression of the gene in normal tissues may indicate a 
predisposition towards development of lung cancer. The encoded polypeptide 
might represent a potentially useful cell surface target for therapeutic molecules 
such as cytolytic antibodies, or antibodies attached to cytotoxic, or cytolytic, 
agents. 

25 

Expression of a gene corresponding to a polynucleotide disclosed herein, 
when in normal tissues, may indicate a predisposition towards development of 
colon cancer. The encoded polypeptide might then present a potentially useful 
cell surface target for therapeutic molecules such as cytolytic antibodies, or 
30 antibodies attached to cytotoxic, or cytolytic, agents. 
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The present invention specifically contemplates use of antibodies against 
polypeptides encoded by the genes disclosed herein, whereby said antibodies 
are conjugated to one or more cytotoxic agents so that the antibodies serve to 
target the conjugated immunotoxins to a region of cancerous activity, such as a 
5 solid tumor. For many known cytotoxic agents, lack of selectivity has presented a 
drawback to their use as therapeutic agents in the treatment of malignancies. For 
example, the class of two-chain toxins, consisting of a binding subunit (or B- 
chain) linked to a toxic subunit (A-chain) are extremely cytotoxic. Thus, such 
agents as ricin, a protein isolated from castor beans, kills cells at very low 

10 concentrations (even less than 10' 11 M) by inactivating ribosomes in said cells 
(see, for example, Lord et al., Ricin: structure, mode of action, and some current 
applications. Faseb J, 8. 201-208 (1994), and Biattler et al., Realizing the full 
potential of immunotoxins. Cancer Cells, 1: 50-55 (1989)). While isolated A- 
chains of protein toxins that functionally resemble ricin A-chain are only weakly 

1 5 cytotoxic for intact cells (in the concentration range of 10" 7 to 10" 6 M), they are 
very potent cytotoxic agents inside the cells. Thus, a single molecule of the A- 
subunit of diphtheria toxin can kill a cell once inside (see: Yamaizumi et al., One 
molecule of diphtheria toxin fragment A introduced into a cell can kill the cell. 
Cell, 15: 245-250, 1978). 

20 

The present invention solves this selectivity problem by using antibodies 
specific for antigens present on cancer cells to target the cytotoxins to said cells. 
In addition, use of antibodies decreases toxicity because the antibodies are non- 
toxic until they reach the tumor and, because the cytotoxin is bound to the 
25 antibody, it is presented with less opportunity to cause damage to non-targeted 
tissues. 

In addition, use of such antibodies alone can provide therapeutic effects 
on the tumor through the antibody-dependent cellular cytotoxic response (ADCC) 
30 and complement-mediated cell lysis mechanisms. 
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A number of recombinant immunotoxins (for example, consisting of Fv 
regions of cancer specific antibodies fused to truncated bacterial toxins) are well 
known (see, for example, Smyth et al., Specific targeting of chlorambucil to 
tumors with the use of monoclonal antibodies, J. Natl. Cancer Inst, 76(3):503- 
5 510 (1986); Cho et al., Single-chain Fv/folate conjugates mediate efficient lysis of 
folate-receptor-positive tumor cells, Bioconjug. Chem., 8(3):338-346 (1997)). As 
noted in the literature, these may contain, for example, a truncated version of 
Pseudomonas exotoxin as a toxic moiety but the toxin is modified in such a 
manner that by itself it does not bind to normal human cells, but it retains all other 

10 functions of cytotoxicity. Here, recombinant antibody fragments target the 
modified toxin to cancer cells which are killed, such as by direct inhibition of 
protein synthesis, or by concomitant induction of apoptosis. Cells that are not 
recognized by the antibody fragment, because they do not carry the cancer 
antigen, are not affected. Good activity and specificity has been observed for 

1 5 many recombinant immunotoxins in in vitro assays using cultured cancer cells as 
well as in animal tumor models. Ongoing clinical trials provide examples where 
the promising pre-clinical data correlate with successful results in experimental 
cancer therapy, (see, for example, Brinkmann U., Recombinant antibody 
fragments and immunotoxin fusions for cancer therapy, In Vivo (2000) 14:21-27). 

20 

While the safety of employing immunoconjugates in humans has been 
established, in vivo therapeutic results have been less impressive. Because 
clinical use of mouse MAbs in humans is limited by the development of a foreign 
anti-globulin immune response by the human host, genetically engineered 

25 chimeric human-mouse MAbs have been developed by replacing the mouse Fc 
region with the human constant region. In other cases, the mouse antibodies 
have been "humanized" by replacing the framework regions of variable domains 
of rodent antibodies by their human equivalents. Such humanized and 
engineered antibodies can even be structurally arranged to have specificities and 

30 effector functions determined by design and which characteristics do not appear 
in nature. The development of bispecific antibodies, having different binding ends 
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so that more than one antigenic site can be bound, have proven useful in 
targeting cancer cells. Thus, such antibody specificity has been improved by 
chemical coupling to various agents such as bacterial or plant toxins, 
radionuclides or cytotoxic drugs and other agents, (see, for example, Bodey, B. 
5 et al). Genetically engineered monoclonal antibodies for direct anti-neoplastic 
treatment and cancer cell specific delivery of chemotherapeutic agents. Curr 
Pharm Des (2000) Feb;6(3):261-76). See also, Garnett, M. C., Targeted drug 
conjugates: principles and progress. Adv. Drug Deliv. Rev. (2001 Dec 17) 
53(2):171-216; Brinkmann et al., Recombinant immunotoxins for cancer therapy. 
1 0 Expert Opin Biol Ther. (2001 ) 1 (4):693-702. 

Among the cytotoxic agents specifically contemplated for use as 
immunoconjugates according to the present invention are Calicheamicin, a 
highly toxic enediyne antibiotic isolated from Micromonospora echinospora 

15 ssp. Calichensis, and which binds to the minor groove of DNA to induce 
double strand breaks and cell death (see: Lee et al., Calicheamicins, a novel 
family of antitumor antibiotics. 1. Chemistry and partial structure of calichemicin 
gi. J Am Chem Soc, 109; 3464-3466 (1987); Zein et al., Calicheamicin gamma 
11: an antitumor antibiotic that cleaves double-stranded DNA site specifically, 

20 Science, 240: 1198-1201 (1988)). Useful derivatives of the calicheamicins 
include mylotarg and 138H11-Cam8. Mylotarg is an immunoconjugate of a 
humanized anti-CD33 antibody (CD33 being found in leukemic cells of most 
patients with acute myeloid leukemia) and N-acetyl gamma colicheamicin 
dimethyl hydrazide, the latter of which is readily coupled to an antibody of the 

25 present invention (in place of the anti-CD33 but which can also be humanized by 
substitution of human framework regions into the antibody during production as 
described elsewhere herein) to form an immunoconjugate of the invention, (see: 
Hamann et al. Gemtuzumab Ozogamicin, A Potent and Selective Anti-CD33 
Antibody-Calicheamicin Conjugate for Treatment of Acute Myeloid Leukemia, 

30 Bioconjug. Chem. 13, 47-58 (2002)) For use with 138H11-Cam9, 138H11 is an 
anti-y-glutamyl transferase antibody coupled to theta calicheamicin through a 



disulfide linkage and found useful in vitro against cultured renal cell carcinoma 
cells, (see: Knoll et al., Targeted therapy of experimental renal cell carcinoma 
with a novel conjugate of monoclonal antibody 138H11 and calicheamicin Bu 
Cancer Res, 60: 6089-6094 (2000) The same linkage may be utilized to link this 
5 cytotoxic agent to an antibody of the present invention, thereby forming a 
targeting structure for colon cancer cells. 

Also useful in forming the immunoconjugates of the invention is DC1, a 
disulfide-containing analog of adozelesin, that kills cells by binding to the minor 

10 groove of DNA, followed by alkylation of adenine bases. Adozelesin is a 
structural analog of CC-1065, an anti-tumor antibiotic isolated from microbial 
fermentation of Streptomyces zelensis, and is about 1,000 fold more toxic to 
cultured cell lines that other DNA interacting agents, such as cis-platin and 
doxorubicin. This agent is readily linked to antibodies through the disulfide bond 

15 of adozelesin. (see: Chari et al., Enhancement of the selectivity and antitumor 
efficacy of a CC-1065 analogue through immunoconjugate formation, Cancer 
Res, 55:4079-4084(1995)). 

Maytansine, a highly cytotoxic microtubular inhibitor isolated from the 
20 shrub Maytenus serrata found to have little value in human clinical trials, is much 
more effective in its derivatized form, denoted DM1, containing a disulfide bond 
to facilitate linkage to antibodies, is up to 10-fold more cytotoxic (see: Chari et al., 
Immunoconjugates containing novel maytansinoids: promising anticancer drugs, 
Cancer Res, 52: 127-131 (1992)). These same in vitro studies showed that up to 
25 four DM1 molecules could be linked to a single immunoglobulin without 
destroying the binding affinity. Such conjugates have been used against breast 
cancer antigens, such as the neu/HER2Jert)B-2 antigen, (see: Goldmacher et al., 
Immunogen, Inc., (2002) in press)] also see Liu, C. et al., Eradication of large 
colon tumor xenografts by targeted delivery of maytansinoids, Proc. Natl. Acad. 
30 Sci. USA, 93, 8618-8623 (1996)). For example, Liu et al. (1996) describes 
formation of an immunoconjugate of the maytansinoid cytotoxin DM1 and C242 
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antibody, a murine lgG1 immunoglobulin, available from Pharmacia and which 
has affinity for a mucin-like glycoprotein variably expressed by human colorectal 
cancers. The latter immunoconjugate was prepared according to Chari et al., 
Cancer Res., 52:127-131 (1992) and was found to be highly cytotoxic against 
5 cultured colon cancer cells as well as showing anti-tumor effects in vivo in mice 
bearing subcutaneous COLO 205 human colon tumor xenografts using doses 
well below the maximum tolerated dose. 

In accordance with the foregoing, a preferred embodiment of the present 
1 0 invention includes where the cytotoxic agent is a calicheamicin, a maytansinoid, 
an adozelesin, DC1, a cytotoxic protein, a taxol, a taxotere, or a taxoid. In 
especially preferred embodiments, the calicheamicin is calicheamicin yu N-acetyl 
gamma calicheamicin dimethyl hydrazide or calicheamicin 9i', the maytansinoid 
is DM1, the cytotoxic protein is ricin, abrin, gelonin, pseudomonas exotoxin or 
1 5 diphtheria toxin, the taxol is paclitaxel, and the taxotere is docetaxel. 

In addition, there are a variety of protein toxins (cytotoxic proteins), which 
include a number of different classes, such as those that inhibit protein synthesis: 
ribosome-inactivating proteins of plant origin, such as ricin, abrin, gelonin, and a 
20 number of others, and bacterial toxins such as pseudomonas exotoxin and 
diphtheria toxin. 

Another useful class is the one including taxol, taxotere, and taxoids. 
Specific examples include paclitaxel (taxol), its analog docetaxel (taxotere), and 
25 derivatives thereof. The first two are clinical drugs used in treating a number of 
tumors while the taxoids act to induce cell death by inhibiting the de- 
polymerization of tubulin. Such agents are readily linked to antibodies through 
disulfide bonds without disadvantageous effects on binding specificity. 

30 In one instance, a truncated Pseudomonas exotoxin was fused to an anti- 

CD22 variable fragment and used successfully to treat patients with 
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chemotherapy-resistant hairy-cell leukemia, (see: Kreitman et al., Efficacy of the 
anti-CD22 recombinant immunotoxin BL22 in chemotherapy-resistant hairy-cell 
leukemia, N Engl J Med, 345: 241-247 (2001)) Conversely, the cancer-linked 
peptides of the present invention offer the opportunity to prepare antibodies, 
5 recombinant or otherwise, against the appropriate antigens to target solid tumors, 
preferably those of malignancies of colon tissue, using the same or similar 
cytotoxic conjugates. Thus, many of the previously used immunoconjugates have 
been formed using antibodies against general antigenic sites linked to cancers 
whereas the antibodies formed using the peptides disclosed herein are more 
10 specific and target the antibody-cytotoxic agent to a particular tissue or organ, 
thus further reducing toxicity and other undesirable side effects. 

In addition, the immunoconjugates formed using the antibodies prepared 
against the cancer-linked antigens disclosed herein can be formed by any type of 
15 chemical coupling. Thus, the cytotoxic agent of choice, along with the 
immunoglobulin, can be coupled by any type of chemical linkage, covalent or 
non-covalent, including electrostatic linkage, to form the immunoconjugates of 
the present invention. 

20 When used as immunoconjugates, the antitumor agents of the present 

invention represent a class of pro-drugs that are relatively non-toxic when first 
administered to an animal (due mostly to the stability of the immunoconjugate), 
such as a human patient, but which are targeted by the conjugated 
immunoglobulin to a cancer cell where they then exhibit good toxicity. The tumor- 

25 related, associated, or linked, antigens, preferably those presented herein, serve 
as targets for the antibodies (monoclonal, recombinant, and the like) specific for 
said antigens. The end result is the release of active cytotoxic agent inside the 
cell after binding of the immunoglobulin portion of the immunoconjugate. 

30 The cited references describe a number of useful procedures for the 

chemical linkage of cytotoxic agents to immunoglobulins and the disclosures of 
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all such references cited herein are hereby incorporated by reference in their 
entirety. For other reviews see Ghetie et al., Immunotoxins in the therapy of 
cancer: from bench to clinic, Pharmacol Ther, 63: 209-234 (1994), Pietersz et al. 
The use of monoclonal antibody immunoconjugates in cancer therapy, Adv Exp 
5 Med Biol, 353:169-179 (1994), and Pietersz, G. A. The linkage of cytotoxic drugs 
to monoclonal antibodies for the treatment of cancer, Bioconjug Chem, 1:89-95 
(1990). 

Thus, the present invention provides highly useful cancer-associated 
1 0 antigens for generation of antibodies for linkage to a number of different cytotoxic 
agents which are already known to have some in vitro toxicity and possess 
chemical groups available for linkage to antibodies. 

It should be cautioned that, in carrying out the procedures of the present 
15 invention as disclosed herein, any reference to particular buffers, media, 
reagents, cells, culture conditions and the like are not intended to be limiting, but 
are to be read so as to include all related materials that one of ordinary skill in the 
art would recognize as being of interest or value in the particular context in which 
that discussion is presented. For example, it is often possible to substitute one 
20 buffer system or culture medium for another and still achieve similar, if not 
identical, results. Those of skill in the art will have sufficient knowledge of such 
systems and methodologies so as to be able, without undue experimentation, to 
make such substitutions as will optimally serve their purposes in using the 
methods and procedures disclosed herein. 

25 

EXAMPLE 

SW480 cells are grown to a density of 10 5 cells/cm 2 in Leibovitz's L-15 
30 medium supplemented with 2 mM L-glutamine (90%) and 10% fetal bovine 
serum. The cells are collected after treatment with 0.25% trypsin, 0.02% EDTA at 
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37°C for 2 to 5 minutes. The trypsinized cells are then diluted with 30 ml growth 
medium and plated at a density of 50,000 cells per well in a 96 well plate (200 
lil/well). The following day, cells are treated with either compound buffer alone, or 
compound buffer containing a chemical agent to be tested, for 24 hours. The 
media is then removed, the cells lysed and the RNA recovered using the 
RNAeasy reagents and protocol obtained from Qiagen. RNA is quantitated and 
10 ng of sample in 1 ul are added to 24 ul of Taqman reaction mix containing 1X 
PCR buffer, RNAsin, reverse transcriptase, nucleoside triphosphates, amplitaq 
gold, tween 20, glycerol, bovine serum albumin (BSA) and specific PCR primers 
and probes for a reference gene (18S RNA) and a test gene (Gene X). Reverse 
transcription is then earned out at 48°C for 30 minutes. The sample is then 
applied to a Periin Elmer 7700 sequence detector and heat denatured for 10 
minutes at 95°C. Amplification is performed through 40 cycles using 15 seconds 
annealing at 60°C followed by a 60 second extension at 72°C and 30 second 
denaturation at 95°C. Data files are then captured and the data analyzed with the 
appropriate baseline windows and thresholds. 

The quantitative difference between the target and reference gene is then 
calculated and a relative expression value determined for all of the samples 
20 used. This procedure is then repeated for other genes functionally related to the 
gene as disclosed herein and the level of function, or expression, noted. The 
relative expression ratios for each pair of genes is determined (i.e., a ratio of 
expression is determined for each target gene versus each of the other genes for 
which expression is measured, where each gene's absolute expression is 
• 25 determined relative to the reference gene for each compound, or chemical agent, 
to be screened). The samples are then scored and ranked according to the 
degree of alteration of the expression profile in the treated samples relative to the 
control. The overall expression of the particular gene relative to the controls, as 
modulated by one chemical agent relative to another, is also ascertained. 
30 Chemical agents having the most effect on a given gene, or set of genes, are 
considered the most anti-neoplastic. 
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Table 6 below contains a listing of the genes (numbered 1 to 20) along 
with their Gencarta names (or accessions). Each gene is represented as a 
consensus sequence followed by predicted mRNA transcripts and then predicted 
polypeptides. All of the sequences, with additional information, are presented in 
Figure 1 . 

The present invention also relates to an isolated cancer target gene 
wherein said gene is a gene identified in Table 6. Thus, the present invention 
encompasses isolated genes identified herein as KIAA1274, NEK6, PAK2, 
PAK4, STK38L, ACP1, ARHC, CDC6, CDK7, CDKN3, CRK7, DUSP16, FIGNL1, 
GUK1, ITPR2, KCNK1, KCNK5, PRO2000, RFC2 and RIPK2 and uses of these 
genes, whether isolated or not, in any of the methods of the invention. 
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Table 6. Sequence Identification Numbers for genes, transcripts and 
polypeptides* 



Gene 


Accession No. 


Consensus 


Transcripts 


Polypeptides 


1 


AA383349 


1 


2-10 ! 


11-16 


2 


AA553584 


17 


18-22 


23-27 


3 


D61791 


28 


29-36 


37-40 


4 


F02366 


41 


42-56 


57-68 


5 


H61320 


69 


70-75 


76-78 


6 


HUMAAPA 


79 


80-104 


105-115 


7 


HUMPTPB 


116 


117-125 


126-131 


8 


R03897 


132 


133-150 


151-158 


9 


R14324 


159 


160-165 


166-170 


10 


R25184 


171 


172-176 


177-179 


11 


T08090 


180 


181-235 


236-254 


12 


T11445 


255 


256-282 


283-294 


13 


T23935 


295 


296-310 


311-315 


14 


T60764 


316 


317-329 


330-334 


15 


T62520 


335 


336-365 


366-378 


16 


T83032 


379 


380-389 


390-392 


17 


Z26993 


393 


394-415 


416-421 


18 


Z38709 


422 


423-439 


440-449 


19 


Z39663 


450 


451-461 


462-470 


20 


Z44462 


471 


472-490 


491-501 



'Accession numbers in Table 6 are for the Gencarta database. 
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WHAT IS CLAIMED IS: 



1. A method for identifying a cancer-target gene, comprising: 

a) identifying a gene that is at least 5 fold over-expressed in a cancer cell 
5 line and that maps to a chromosomal region with a CGH ratio of at least 1 .25; 

b) determining an RNA expression level of said gene of at least 1 .5 fold in 
a tumor tissue compared to corresponding normal tissue in a genetic database, 
and 

c) determining that said gene encodes a protein domain that is modulated 
1 0 by chemical compounds, 

wherein a gene that meets the criteria of steps a, b and c is considered to 
be a cancer-target gene, 

thereby identifying a cancer-target gene. 

15 2. A set of cancer-target genes identified by the method of claim 1 . 

3. A method for identifying an agent that modulates the activity of a 
cancer-target gene comprising: 

(a) contacting a test compound with a cell that expresses a polynucleotide 
20 that corresponds to a gene that has the properties of a, b and c of claim 1 and 

under conditions supporting said expression; and 

(b) determining a difference in expression of said gene relative to when 
said test compound is not present wherein said difference indicates gene 
modulating activity, 

25 thereby identifying said test compound as an agent that modulates the 

activity of said cancer-related gene. 

4. The method of claim 3 wherein said gene was first identified as a 
cancer target gene using the method of claim 1 or 2. 

30 
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5. The method of claim 4 wherein said gene is a gene selected from the 
group consisting of KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, 
CDC6, CDK7, CDKN3, CRK7, DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, 
KCNK5, PRO2000, RFC2 and RIPK2. 

5 

6. The method of claim 3 wherein said expression is transcription to form 

RNA. 

7. The method of claim 3 wherein said expression is translation to form 
1 0 protein. 

8. The method of claim 4 wherein the cell is a cancer cell and the 
determined difference in expression is a decrease in expression. 

1 5 9. The method of claim 4 wherein the cell is a recombinant cell and the 

difference in expression is a decrease in expression. 

10. A method for identifying an anti-neoplastic agent comprising 
contacting a cell exhibiting neoplastic activity with a compound first identified as a 

20 cancer target gene modulator using the method of claim 3 and detecting a 
decrease in said neoplastic activity after said contacting compared to when said 
contacting does not occur. 

1 1 . The process of claim 10 wherein said neoplastic activity is accelerated 
25 cellular replication. 

12. The process of claim 10 wherein said decrease in neoplastic activity 
results from the death of the cell. 

30 13. A method for identifying an anti-neoplastic agent comprising 

contacting a cell exhibiting neoplastic activity with a compound that modulates 



58 



expression of at least one of genes 1 to 20 of Table 6 and detecting a decrease 
in said neoplastic activity after said contacting compared to when said contacting 
does not occur. 

5 14. The process of claim 13 wherein said neoplastic activity is accelerated 

cellular replication. 

15. The process of claim 13 wherein said decrease in neoplastic activity 
results from the death of the cell. 

10 

16. A method for identifying an antineoplastic agent comprising 
administering to an animal exhibiting a cancerous condition an effective amount 
of an agent that modulates expression of at least one of genes 1 to 20 of Table 6 
and detecting a decrease in said cancerous condition. 

15 

17. A method for identifying an antineoplastic agent comprising 
administering to an animal exhibiting a cancerous condition an effective amount 
of a cancer target gene modulating agent by the method of claim 3 and detecting 
a decrease in said cancerous condition. 

20 

18. A method for determining the cancerous status of a cell, comprising 
determining an increase in the level of expression in said cell of at least one gene 
that meets the criteria of a, b and c of claim 1 . 

25 19. The method of claim 14 wherein said gene is a gene selected from the 

group consisting of KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, 
CDC6, CDK7, CDKN3, CRK7, DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, 
KCNK5, PRO2000, RFC2 and RIPK2. 

30 20. A method for identifying an agent that modulates the activity of a 

cancer-target polypeptide comprising: 
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(a) contacting a test compound with a cell expressing a polypeptide 
encoded by a polynucleotide corresponding to a gene having the properties of a, 
b and c of claim 1 and under conditions promoting the expression of said 
polypeptide; and 

5 (b) determining a difference in expression of said polypeptide relative to 

when said test compound is not present wherein said difference indicates 
cancer-target polypeptide modulating activity, 

thereby identifying a cancer-target polypeptide modulating agent 

10 21. The method of claim 20 wherein said polypeptide is encoded by a 

gene selected from the group consisting of KIAA1274, NEK6, PAK2, PAK4, 
STK38L, ACP1 , ARHC, CDC6, CDK7, CDKN3, CRK7, DUSP16, FIGNL1, GUK1, 
ITPR2, KCNK1 , KCNK5, PRO2000, RFC2 and RIPK2. 

15 22. A method for identifying an agent that modulates the activity of a 

cancer-target polypeptide comprising: 

(a) contacting a test compound with a polypeptide encoded by a 
polynucleotide corresponding to a gene having the properties of a, b and c of 
claim 1 and under conditions promoting the activity of said polypeptide; and 
20 (b) determining a difference in activity of said polypeptide relative to when 

said test compound is not present wherein said difference indicates cancer-target 
polypeptide modulating activity, 

thereby identifying a cancer-target polypeptide modulating agent 

25 23. The method of claim 22 wherein said polypeptide is encoded by a 

gene selected from the group consisting of KIAA1274, NEK6, PAK2, PAK4, 
STK38L, ACP1, ARHC, CDC6, CDK7, CDKN3, CRK7 t DUSP16, FIGNL1, GUK1, 
ITPR2, KCNK1, KCNK5, PRO2000, RFC2 and RIPKZ 

30 24. An antibody that binds to a polypeptide encoded by a gene having the 

properties of a, b and c of claim 1 . 
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25. The antibody of claim 24 wherein said gene is a gene selected from 
the group consisting of KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, 
CDC6, CDK7, CDKN3, CRK7, DUSP16, FIGNL1, GUK1, ITPR2, KCNK1, 
KCNK5, PRO2000, RFC2 and RIPK2. 

5 

26. The antibody of claim 24 wherein said antibody is a monoclonal 
antibody. 

27. The antibody of claim 24 wherein said antibody is a recombinant 
1 0 antibody. 

28. The antibody of claim 24 wherein said antibody is a synthetic antibody. 

29. The antibody of claim 24 wherein said antibody further comprises a 
1 5 cytotoxic agent. 

30. The antibody of claim 29 wherein said cytotoxic agent is an apoptotic 

agent. 

20 31. A method for treating cancer comprising contacting a cancerous cell 

with an effective amount of an agent that can reduce the activity of a gene having 
the properties of a, b and c of claim 1 . 

32. The method of claim 31 wherein said agent having activity in the 
25 method of claim 3. 

33. The method of claim 31 wherein said agent was first identified as 
having such activity using the method of claim 3. 

30 34. The method of claim 31 wherein said agent having activity in the 

method of claim 16. 
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35. The method of claim 31 wherein said agent was first identified as 
having such activity using the method of claim 16. 

36. The method of claim 31 wherein said agent having activity in the 
5 method of claim 22. 

37. The method of claim 31 wherein said agent was first identified as 
having such activity using the method of claim 22. 

1 0 38. The method of claim 31 wherein said gene is a gene selected from the 

group consisting of KIAA1274, NEK6, PAK2, PAK4, STK38L, ACP1, ARHC, 
CDC6, CDK7, CDKN3, CRK7, DUSP16, FIGNL1, GUK1, /7PR2, KCNK1, 
KCNK5, PRO2000, RFC2 and RIPK2. 

1 5 39. The method of claim 31 wherein said cancerous cell is contacted in 

vivo. 

40. The method of claim 31 wherein said agent has affinity for an 
expression product of said gene. 

20 

41. The method of claim 40 wherein said agent is an antibody of claim 24 

-30. 

42. A cancer target gene wherein said gene is a gene identified in Table 6. 

25 

43. A method for producing test data with respect to the gene 
modulating activity of a compound comprising: 

(a) contacting a compound with a cell containing a polynucleotide 
comprising a nucleotide sequence corresponding to a gene whose expression is 
30 increased in a cancerous cell over that in a non-cancerous cell and under 
conditions wherein said polynucleotide is being expressed, 

62 



(b) determining a change in expression of polynucleotides as a result of 
said contacting, and 

(c) producing test data with respect to the gene modulating activity of said 
compound based on a decrease in the expression of the determined gene whose 
expression is otherwise increased in a cancerous cell over that in a non- 
cancerous cell indicating gene modulating activity. 
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ABSTRACT 

Cancer-linked gene sequences, and derived amino acid sequences, are 
disclosed along with processes for assaying potential antitumor agents based on 
5 their modulation of the expression of these cancer-linked genes. Also disclosed 
are antibodies that react with the disclosed polypeptides and methods of 
diagnosing and treating cancer using the gene sequences. A novel gene and 
polypeptide are also disclosed. 

10 
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Figur 1 



SEQ ID N0:1. >AA383349 # TY Consensus # Length 3600 # Number of exons 23 

ggccggggacccgtgggccggttcctggggtggcccagagcaagggcaagtttcgcccgg 

gacccgcccagctggccgggagccagtagcagggaagggccggctggcgcgagcaccgcc 

cacgcggagccatgggggcggggcctgggcgggggcggggccgcggccgagggcggggca 

gggaggcagcatgctaaaccgggtgcgctcggccgtggcgcacctggtgagctccggggg 

cgctccgcctccgcgccccaaatccccggacctgcccaacgccgcctcggcgccgcccgc 

cgccgctccagaagcgcccaggagccctcccgcgaaggctgggagcgggagcgcgacgcc 

cgcgaaggctgttgaggctcgagcgagcttctccagaccgacctttctgcagctgagccc 

cggggggctgcgacgcgccgatgaccacgcgggccgggctgtgcaaagccccccggacac 

gggccgccgcctgccctggagcacaggctacgccgagtgagcgccccctggggcacccaa 

accaggatggggctcccacccctctccccagctccgcatccccggcgctaggacgcgttc 

cccacgccgcgtccgggccaggagctcccttttccgtggacctttgctatcctctggtct 

tcgggccgcaccccctcccaacccattttccagtggggggcagcctgtgtcaccttcttc 

acgtccttcccgctcattgactgccctcgcccacgccgcctcaggaccctgttctgcccc 

agagcccggagggcggagagcccggcgaaggatgagttggccagttccccgtcgcggccc 

ggcagcttaaaggctaagggaaaaggggtttcacgaaggagcggggttctttttaatagg 

ggacatagcggttgggaagactcgctcacccgcttcccggctccagcgccccagttccct 

gtccctcttaccgtagttcccctccccctccacacccagaaatagcccgcgacaccagga 

ggccgccagcttccccaggagcggggagggggacgcccggggtagaggagggtcccattt 

agatgcccttcagcctgccaactcgtgctggcctggcaaagaagcggaccccctgcccgg 

agcggccggctggcccccgggctgtgtgtattttaaatgcatctgccgggaacgcagagc 

accgagggagatgggggcgctcagttcgctgaggaaggtggctggtggcccatggaccca 

ccaccacctcccttagcctcctgtgtgggaggagtttatgggtatgtggctcctgcccag 

tccaggtgggctttcacttctactctatttcagttcctctttcccgatctgggctggaga 

gcttcctcattgttaaggcagcagaaactttcgctggatggttttaggataaggggtcat 

caatgctggcaagagtcggcacaatgaggaccaggcttgctgtgaagtggtgtatgtgga 

aggtcggaggagtgttacaggagtacctagggagcctagccgaggccagggactctgctt 

ctactactggggcctatttgatgggcatgcagggggcggagctgctgaaatggcctcacg 

gctcctgcatcgccatatccgagagcagctaaaggacctggtagagatacttcaggaccc 

ttcgccaccacccctctgcctcccaaccactccggggaccccagattcctccgatccctc 

tcacttgcttggccctcagtcctgctggtcttcacagaaggaagtgagccacgagagcct 

ggtagtgggggccattgagaatgccttccagctcatggatgagcagatggcccgggagcg 

gcgtggccaccaagtggaggggggctgctgtgcactggttgtgatctacctgctaggcaa 

ggtgtacgtggccaatgcaggcgatagcagggccatcattgtccggaatggtgaaatcat 

tccaatgtcccgggagtttaccccggagactgagcgccagcgtcttcagctgcttggtct 

gctgtagtcaacgtcatctgtgtgccaccgtgtgctgtgagcttaatgccccctggctct 

ctgttctgatcatccccaactccagggcttcctgaaaccagagctgctaggcagtgaatt 

cacccaccttgagttcccccgcagagttctgcccaaggagctggggcagaggatgttgta 

ccgggaccagaacatgaccggctgggcctacaaaaagatcgagctggaggatctcaggtt 

tcctctggtctgtggggagggcaaaaaggctcgggtgatggccaccattggggtgacccg 

aggcttgggagaccacagccttaaggtctgcagttccaccctgcccatcaagccctttct 

ctcctgcttccctgaggtacgagtgtatgacctgacacaatatgagcactgcccagatga 

tgtgctagtcctgggaacagatggcctgtgggatgtcactactgactgtgaggtagctgc 

cactgtggacagggtgctgtcggcctatgagcctaatgaccacagcagcacatgtgggcc 

tgtgtatctgtcttcccacatacatgttggtacatgaatgtgcatgtgtgctcctggcag 

gtatacagctctggcccaagctctggtcctgggggcccggggtaccccccgagaccgtgg 

ctggcgtctccccaacaacaagctgggttccggggatgacatctctgtcttcgtcatccc 

cctgggagggccaggcagttactcctgaggggctgaacaccatccctcccactagcctct 

ccatacttactcctctcacagcccaaattctgaagttgtctccctgacccttctttagtg 

gcaacttaactgaagaagggatgtccgctatatccaaaattacagctattggcaaataaa 

cgagatggataaaggccccaccatggctgcaatccgaaagaagctggtgatcgttgggga 

tggtgcctgtgggaagacctgcctcctcatcgtcttcagcaaggatcagtttccggaggt 

ctacgtccctactgtctttgagaactatattgcggacattgaggtggacggcaagcaggt 
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ggagctggctctgtgggacacagcagggcaggaagactatgatcgactgcggcctctctc 
ctacccggacactgatgtcatcctcatgtgcttctccatcgacagccctgacagcctgga 
aaacattcctgagaagtggaccccagaggtgaagcacttctgccccaacgtgcccatcat 
cctggtggggaataagaagaaaactcgtgccgttttagcgggtacaagatctaccccgga 
cacgggaggcgctacgncagggaccgacgggaaaggtttttncagtttcttaatgcgaaa 
tgcgagtcgggttttctttgccgcatacgtcgcactatcctctggatacgtcatcactct 
cgcagggcacagtcgacagtggtgctccttgaccattccttagtgaagaataaactaaac 
aacggatgcgcgcggatactcccgaagcacaggtaatggctgattgcgcgaatggatcat 

SEQ ID N0:2. >AA383349_T1 # TY Transcript # LN 2676 # Source Gene: AA383349 # 
Encoded protein: AA383349_P1 

ggccggggacccgtgggccggttcctggggtggcccagagcaagggcaagtttcgcccgg 
gacccgcccagctggccgggagccagtagcagggaagggccggctggcgcgagcaccgcc 
cacgcggagccatgggggcggggcctgggcgggggcggggccgcggccgagggcggggca 
gggaggcagcatgctaaaccgggtgcgctcggccgtggcgcacctggtgagctccggggg 
cgctccgcctccgcgccccaaatccccggacctgcccaacgccgcctcggcgccgcccgc 
cgccgctccagaagcgcccaggagccctcccgcgaaggctgggagcgggagcgcgacgcc 
cgcgaaggctgttgaggctcgagcgagcttctccagaccgacctttctgcagctgagccc 
cggggggctgcgacgcgccgatgaccacgcgggccgggctgtgcaaagccccccggacac 
gggccgccgcctgccctggagcacaggctacgccgagtgagcgccccctggggcacccaa 
accaggatggggctcccacccctctccccagctccgcatccccggcgctaggacgcgttc 
cccacgccgcgtccgggccaggagctcccttttccgtggacctttgctatcctctggtct 
tcgggccgcaccccctcccaacccattttccagtggggggcagcctgtgtcaccttcttc 
acgtccttcccgctcattgactgccctcgcccacgccgcctcaggaccctgttctgcccc 
agagcccggagggcggagagcccggcgaaggatgagttggccagttccccgtcgcggccc 
ggcagcttaaaggctaagggaaaaggggtttcacgaaggagcggggttctttttaatagg 
ggacatagcggttgggaagactcgctcacccgcttcccggctccagcgccccagttccct 
gtccctcttaccgtagttcccctccccctccacacccagaaatagcccgcgacaccagga 
ggccgccagcttccccaggagcggggagggggacgcccggggtagaggagggtcccattt 
agatgcccttcagcctgccaactcgtgctggcctggcaaagaagcggaccccctgcccgg 
agcggccggctggcccccgggctgtgtgtattttaaatgcatctgccgggaacgcagagc 
accgagggagatgggggcgctcagttcgctgaggaaggtggctggtggcccatggaccca 
ccaccacctcccttagcctcctgtgtgggaggagtttatgggtatgtggctcctgcccag 
tccaggtgggctttcacttctactctatttcagttcctctttcccgatctgggctggaga 
gcttcctcattgttaaggcagcagaaactttcgctggatggttttaggataaggggtcat 
caatgctggcaagagtcggcacaatgaggaccaggcttgctgtgaagtggtgtatgtgga 
aggtcggaggagtgttacaggagtacctagggagcctagccgaggccagggactctgctt 
ctactactggggcctatttgatgggcatgcagggggcggagctgctgaaatggcctcacg 
gctcctgcatcgccatatccgagagcagctaaaggacctgaaggaagtgagccacgagag 
cctggtagtgggggccattgagaatgccttccagctcatggatgagcagatggcccggga 
gcggcgtggccaccaagtggaggggggctgctgtgcactggttgtgatctacctgctagg 
caaggtgtacgtggccaatgcaggcgatagcagggccatcattgtccggaatggtgaaat 
cattccaatgtcccgggagtttaccccggagactgagcgccagcgtcttcagctgcttgg 
cttcctgaaaccagagctgctaggcagtgaattcacccaccttgagttcccccgcagagt 
tctgcccaaggagctggggcagaggatgttgtaccgggaccagaacatgaccggctgggc 
ctacaaaaagatcgagctggaggatctcaggtttcctctggtctgtggggagggcaaaaa 
ggctcgggtgatggccaccattggggtgacccgaggcttgggagaccacagccttaaggt 
ctgcagttccaccctgcccatcaagccctttctctcctgcttccctgaggtacgagtgta 
tgacctgacacaatatgagcactgcccagatgatgtgctagtcctgggaacagatggcct 
gtgggatgtcactactgactgtgaggtagctgccactgtggacagggtgctgtcggccta 
tgagcctaatgaccacagcaggtatacagctctggcccaagctctggtcctgggggcccg 
gggtaccccccgagaccgtggctggcgtctccccaacaacaagctgggttccggggatga 
catctctgtcttcgtcatccccctgggagggccaggcagttactcctgaggggctgaaca 
ccatccctcccactagcctctccatacttactcctctcacagcccaaattctgaagttgt 
ctccctgacccttctttagtggcaacttaactgaagaagggatgtccgctatatccaaaa 
ttacagctattggcaaataaacgagatggataaagg 



2/365 



Figure 1 (Cont'd) 



SEQ ID NO: 3. >AA383349_T2 # TY Transcript # LN 1875 # Source Gene: 

AA383349 # Encoded protein: AA383349_P4 

ggccggggacccgtgggccggttcctggggtggcccagagcaagggcaagtttcgcccgg 
gacccgcccagctggccgggagccagtagcagggaagggccggctggcgcgagcaccgcc 
cacgcggagccatgggggcggggcctgggcgggggcggggccgcggccgagggcggggca 
gggaggcagcatgctaaaccgggtgcgctcggccgtggcgcacctggtgagctccggggg 
cgctccgcctccgcgccccaaatccccggacctgcccaacgccgcctcggcgccgcccgc 
cgccgctccagaagcgcccaggagccctcccgcgaaggctgggagcgggagcgcgacgcc 
cgcgaaggctgttgaggctcgagcgagcttctccagaccgacctttctgcagctgagccc 
cggggggctgcgacgcgccgatgaccacgcgggccgggctgtgcaaagccccccggacac 
gggccgccgcctgccctggagcacaggctacgccgaggtcatcaatgctggcaagagtcg 
gcacaatgaggaccaggcttgctgtgaagtggtgtatgtggaaggtcggaggagtgttac 
aggagtacctagggagcctagccgaggccagggactctgcttctactactg'gggcctatt 
tgatgggcatgcagggggcggagctgctgaaatggcctcacggctcctgcatcgccatat 
ccgagagcagctaaaggacctggtagagatacttcaggacccttcgccaccacccctctg 
cctcccaaccactccggggaccccagattcctccgatccctctcacttgcttggccctca 
gtcctgctggtcttcacagaaggaagtgagccacgagagcctggtagtgggggccattga 
gaatgccttccagctcatggatgagcagatggcccgggagcggcgtggccaccaagtgga 
ggggggctgctgtgcactggttgtgatctacctgctaggcaaggtgtacgtggccaatgc 
aggcgatagcagggccatcattgtccggaatggtgaaatcattccaatgtcccgggagtt 
taccccggagactgagcgccagcgtcttcagctgcttggcttcctgaaaccagagctgct 
aggcagtgaattcacccaccttgagttcccccgcagagttctgcccaaggagctggggca 
gaggatgttgtaccgggaccagaacatgaccggctgggcctacaaaaagatcgagctgga 
ggatctcaggtttcctctggtctgtggggagggcaaaaaggctcgggtgatggccaccat 
tggggtgacccgaggcttgggagaccacagccttaaggtctgcagttccaccctgcccat 
caagccctttctctcctgcttccctgaggtacgagtgtatgacctgacacaatatgagca 
ctgcccagatgatgtgctagtcctgggaacagatggcctgtgggatgtcactactgactg 
tgaggtagctgccactgtggacagggtgctgtcggcctatgagcctaatgaccacagcag 
gtatacagctctggcccaagctctggtcctgggggcccggggtaccccccgagaccgtgg 
ctggcgtctccccaacaacaagctgggttccggggatgacatctctgtcttcgtcatccc 
cctgggagggccaggcagttactcctgaggggctgaacaccatccctcccactagcctct 
ccatacttactcctctcacagcccaaattctgaagttgtctccctgacccttctttagtg 
gcaacttaactgaagaagggatgtccgctatatccaaaattacagctattggcaaataaa 
cgagatggataaagg 

SEQ ID NO: 4. >AA383349_T3 # TY Transcript # LN 1995 # Source Gene: 

AA38334 9 # Encoded protein: AA38334 9_P4 

ggccggggacccgtgggccggttcctggggtggcccagagcaagggcaagtttcgcccgg 
gacccgcccagctggccgggagccagtagcagggaagggccggctggcgcgagcaccgcc 
cacgcggagccatgggggcggggcctgggcgggggcggggccgcggccgagggcggggca 
gggaggcagcatgctaaaccgggtgcgctcggccgtggcgcacctggtgagctccggggg 
cgctccgcctccgcgccccaaatccccggacctgcccaacgccgcctcggcgccgcccgc 
cgccgctccagaagcgcccaggagccctcccgcgaaggctgggagcgggagcgcgacgcc 
cgcgaaggctgttgaggctcgagcgagcttctccagaccgacctttctgcagctgagccc 
cggggggctgcgacgcgccgatgaccacgcgggccgggctgtgcaaagccccccggacac 
gggccgccgcctgccctggagcacaggctacgccgaggtcatcaatgctggcaagagtcg 
gcacaatgaggaccaggcttgctgtgaagtggtgtatgtggaaggtcggaggagtgttac 
aggagtacctagggagcctagccgaggccagggactctgcttctactactggggcctatt 
tgatgggcatgcagggggcggagctgctgaaatggcctcacggctcctgcatcgccatat 
ccgagagcagctaaaggacctggtagagatacttcaggacccttcgccaccacccctctg 
cctcccaaccactccggggaccccagattcctccgatccctctcacttgcttggccctca 
gtcctgctggtcttcacagaaggaagtgagccacgagagcctggtagtgggggccattga 
gaatgccttccagctcatggatgagcagatggcccgggagcggcgtggccaccaagtgga 
ggggggctgctgtgcactggttgtgatctacctgctaggcaaggtgtacgtggccaatgc 
aggcgatagcagggccatcattgtccggaatggtgaaatcattccaatgtcccgggagtt 
taccccggagactgagcgccagcgtcttcagctgcttggcttcctgaaaccagagctgct 
aggcagtgaattcacccaccttgagttcccccgcagagttctgcccaaggagctggggca 
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gaggatgttgtaccgggaccagaacatgaccggctgggcctacaaaaagatcgagctgga 
ggatctcaggtttcctctggtctgtggggagggcaaaaaggctcgggtgatggccaccat 
tggggtgacccgaggcttgggagaccacagccttaaggtctgcagttccaccctgcccat 
caagccctttctctcctgcttccctgaggtacgagtgtatgacctgacacaatatgagca 
ctgcccagatgatgtgctagtcctgggaacagatggcctgtgggatgtcactactgactg 
tgaggtagctgccactgtggacagggtgctgtcggcctatgagcctaatgaccacagcag 
gtatacagctctggcccaagctctggtcctgggggcccggggtaccccccgagaccgtgg 
ctggcgtctccccaacaacaagctgggttccggggatgacatctctgtcttcgtcatccc 
cctgggagggccaggcagttactcctgaggggctgaacaccatccctcccactagcctct 
ccatacttactcctctcacagcccaaattctgaagttgtctccctgacccttctttagtg 
gcaacttaactgaagaagggatgtccgctatatccaaaattacagctattggcaaataaa 
cgagatggataaaggaaaactc'gtgccgttttagcgggtacaagatctaccccggacacg 
ggaggcgctacgncagggaccgacgggaaaggtttttncagtttcttaatgcgaaatgcg 
agtcgggttttcttt 

SEQ ID NO: 5. >AA383349_T4 # TY Transcript # LN 1965 # Source Gene: 

AA383349 # Encoded protein: AA383349_P3 

ggccggggacccgtgggccggttcctggggtggcccagagcaagggcaagtttcgcccgg 
gacccgcccagctggccgggagccagtagcagggaagggccggctggcgcgagcaccgcc 
cacgcggagccatgggggcggggcctgggcgggggcggggccgcggccgagggcggggca 
gggaggcagcatgctaaaccgggtgcgctcggccgtggcgcacctggtgagctccggggg 
cgctccgcctccgcgccccaaatccccggacctgcccaacgccgcctcggcgccgcccgc 
cgccgctccagaagcgcccaggagccctcccgcgaaggctgggagcgggagcgcgacgcc 
cgcgaaggctgttgaggctcgagcgagcttctccagaccgacctttctgcagctgagccc 
cggggggctgcgacgcgccgatgaccacgcgggccgggctgtgcaaagccccccggacac 
gggccgccgcctgccctggagcacaggctacgccgaggtcatcaatgctggcaagagtcg 
gcacaatgaggaccaggcttgctgtgaagtggtgtatgtggaaggtcggaggagtgttac 
aggagtacctagggagcctagccgaggccagggactctgcttctactactggggcctatt 
tgatgggcatgcagggggcggagctgctgaaatggcctcacggctcctgcatcgccatat 
ccgagagcagctaaaggacctggtagagatacttcaggacccttcgccaccacccctctg 
cctcccaaccactccggggaccccagattcctccgatccctctcacttgcttggccctca 
gtcctgctggtcttcacagaaggaagtgagccacgagagcctggtagtgggggccattga 
gaatgccttccagctcatggatgagcagatggcccgggagcggcgtggccaccaagtgga 
ggggggctgctgtgcactggttgtgatctacctgctaggcaaggtgtacgtggccaatgc 
aggcgatagcagggccatcattgtccggaatggtgaaatcattccaatgtcccgggagtt 
taccccggagactgagcgccagcgtcttcagctgcttggtctgctgtagtcaacgtcatc 
tgtgtgccaccgtgtgctgtgagcttaatgccccctggctctctgttctgatcatcccca 
actccagggcttcctgaaaccagagctgctaggcagtgaattcacccaccttgagttccc 
ccgcagagttctgcccaaggagctggggcagaggatgttgtaccgggaccagaacatgac 
cggctgggcctacaaaaagatcgagctggaggatctcaggtttcctctggtctgtgggga 
gggcaaaaaggctcgggtgatggccaccattggggtgacccgaggcttgggagaccacag 
ccttaaggtctgcagttccaccctgcccatcaagccctttctctcctgcttccctgaggt 
acgagtgtatgacctgacacaatatgagcactgcccagatgatgtgctagtcctgggaac 
agatggcctgtgggatgtcactactgactgtgaggtagctgccactgtggacagggtgct 
gtcggcctatgagcctaatgaccacagcaggtatacagctctggcccaagctctggtcct 
gggggcccggggtaccccccgagaccgtggctggcgtctccccaacaacaagctgggttc 
cggggatgacatctctgtcttcgtcatccccctgggagggccaggcagttactcctgagg 
ggctgaacaccatccctcccactagcctctccatacttactcctctcacagcccaaattc 
tgaagttgtctccctgacccttctttagtggcaacttaactgaagaagggatgtccgcta 
tatccaaaattacagctattggcaaataaacgagatggataaagg 

SEQ ID NO: 6. >AA38334 9_T5 S TY Transcript # LN 1969 # Source Gene: 

AA383349 # Encoded protein: AA383349_P4 

ggccggggacccgtgggccggttcctggggtggcccagagcaagggcaagtttcgcccgg 
gacccgcccagctggccgggagccagtagcagggaagggccggctggcgcgagcaccgcc 
cacgcggagccatgggggcggggcctgggcgggggcggggccgcggccgagggcggggca 
gggaggcagcatgctaaaccgggtgcgctcggccgtggcgcacctggtgagctccggggg 
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cgctccgcctccgcgccccaaatccccggacctgcccaacgccgcctcggcgccgcccgc 
cgccgctccagaagcgcccaggagccctcccgcgaaggctgggagcgggagcgcgacgcc 
cgcgaaggctgttgaggctcgagcgagcttctccagaccgacctttctgcagctgagccc 
cggggggctgcgacgcgccgatgacc.acgcgggccgggctgtgcaaagccccccggacac 
gggccgccgcctgccctggagcacaggctacgccgagaccctgttctgccccagagcccg 
gagggcggagagcccggcgaaggatgagttggccagttccccgtcgcggcccggcagctt 
aaaggctaagggtcatcaatgctggcaagagtcggcacaatgaggaccaggcttgctgtg 
aagtggtgtatgtggaaggtcggaggagtgttacaggagtacctagggagcctagccgag 
gccagggactctgcttctactactggggcctatttgatgggcatgcagggggcggagctg 
ctgaaatggcctcacggctcctgcatcgccatatccgagagcagctaaaggacctggtag 
agatacttcaggacccttcgccaccacccctctgcctcccaaccactccggggaccccag 
attcctccgatccctctcacttgcttggccctcagtcctgctggtcttcacagaaggaag 
tgagccacgagagcctggtagtgggggccattgagaatgccttccagctcatggatgagc 
agatggcccgggagcggcgtggccaccaagtggaggggggctgctgtgcactggttgtga 
tctacctgctaggcaaggtgtacgtggccaatgcaggcgatagcagggccatcattgtcc 
ggaatggtgaaatcattccaatgtcccgggagtttaccccggagactgagcgccagcgtc 
ttcagctgcttggcttcctgaaaccagagctgctaggcagtgaattcacccaccttgagt 
tcccccgcagagttctgcccaaggagctggggcagaggatgttgtaccgggaccagaaca 
tgaccggctgggcctacaaaaagatcgagctggaggatctcaggtttcctctggtctgtg 
gggagggcaaaaaggctcgggtgatggccaccattggggtgacccgaggcttgggagacc 
acagccttaaggtctgcagttccaccctgcccatcaagccctttctctcctgcttccctg 
aggtacgagtgtatgacctgacacaatatgagcactgcccagatgatgtgctagtcctgg 
gaacagatggcctgtgggatgtcactactgactgtgaggtagctgccactgtggacaggg 
tgctgtcggcctatgagcctaatgaccacagcaggtatacagctctggcccaagctctgg 
tcctgggggcccggggtaccccccgagaccgtggctggcgtctccccaacaacaagctgg 
gttccggggatgacatctctgtcttcgtcatccccctgggagggccaggcagttactcct 
gaggggctgaacaccatccctcccactagcctctccatacttactcctctcacagcccaa 
attctgaagttgtctccctgacccttctttagtggcaacttaactgaagaagggatgtcc 
gctatatccaaaattacagctattggcaaataaacgagatggataaagg 

SEQ ID NO: 7. >AA383349JT6 # TY Transcript # LN 2545 # Source Gene: 

AA383349 # Encoded protein: AA383349_P4 

ggccggggacccgtgggccggttcctggggtggcccagagcaagggcaagtttcgcccgg 
gacccgcccagctggccgggagccagtagcagggaagggccggctggcgcgagcaccgcc 
cacgcggagccatgggggcggggcctgggcgggggcggggccgcggccgagggcggggca 
gggaggcagcatgctaaaccgggtgcgctcggccgtggcgcacctggtgagctccggggg 
cgctccgcctccgcgccccaaatccccggacctgcccaacgccgcctcggcgccgcccgc 
cgccgctccagaagcgcccaggagccctcccgcgaaggctgggagcgggagcgcgacgcc 
cgcgaaggctgttgaggctcgagcgagcttctccagaccgacctttctgcagctgagccc 
cggggggctgcgacgcgccgatgaccacgcgggccgggctgtgcaaagccccccggacac 
gggccgccgcctgccctggagcacaggctacgccgagaccctgttctgccccagagcccg 
gagggcggagagcccggcgaaggatgagttggccagttccccgtcgcggcccggcagctt 
aaaggctaagggaaaaggggtttcacgaaggagcggggttctttttaataggggacatag 
cggttgggaagactcgctcacccgcttcccggctccagcgccccagttccctgtccctct 
taccgtagttcccctccccctccacacccagaaatagcccgcgacaccaggaggccgcca 
gcttccccaggagcggggagggggacgcccggggtagaggagggtcccatttagatgccc 
ttcagcctgccaactcgtgctggcctggcaaagaagcggaccccctgcccggagcggccg 
gctggcccccgggctgtgtgtattttaaatgcatctgccgggaacgcagagcaccgaggg 
agatgggggcgctcagttcgctgaggaaggtggctggtggcccatggacccaccaccacc 
tcccttagcctcctgtgtgggaggagtttatgggtatgtggctcctgcccagtccaggtg 
ggctttcacttctactctatttcagttcctctttcccgatctgggctggagagcttcctc 
attgttaaggcagcagaaactttcgctggatggttttaggataaggggtcatcaatgctg 
gcaagagtcggcacaatgaggaccaggcttgctgtgaagtggtgtatgtggaaggtcgga 
ggagtgttacaggagtacctagggagcctagccgaggccagggactctgcttctactact 
ggggcctatttgatgggcatgcagggggcggagctgctgaaatggcctcacggctcctgc 
atcgccatatccgagagcagctaaaggacctggtagagatacttcaggacccttcgccac 
cacccctctgcctcccaaccactccggggaccccagattcctccgatccctctcacttgc 
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Figure 1 (Cont'd) 



ttggccctcagtcctgctggtcttcacagaaggaagtgagccacgagagcctggtagtgg 
gggccattgagaatgccttccagctcatggatgagcagatggcccgggagcggcgtggcc 
accaagtggaggggggctgctgtgcactggttgtgatctacctgctaggcaaggtgtacg 
tggccaatgcaggcgatagcagggccatcattgtccggaatggtgaaatcattccaatgt 
cccgggagtttaccccggagactgagcgccagcgtcttcagctgcttggcttcctgaaac 
cagagctgctaggcagtgaattcacccaccttgagttcccccgcagagttctgcccaagg 
agctggggcagaggatgttgtaccgggaccagaacatgaccggctgggcctacaaaaaga 
tcgagctggaggatctcaggtttcctctggtctgtggggagggcaaaaaggctcgggtga 
tggccaccattggggtgacccgaggcttgggagaccacagccttaaggtctgcagttcca 
ccctgcccatcaagccctttctctcctgcttccctgaggtacgagtgtatgacctgacac 
aatatgagcactgcccagatgatgtgctagtcctgggaacagatggcctgtgggatgtca 
ctactgactgtgaggtagctgccactgtggacagggtgctgtcggcctatgagcctaatg 
accacagcaggtatacagctctggcccaagctctggtcctgggggcccggggtacccccc 
gagaccgtggctggcgtctccccaacaacaagctgggttccggggatgacatctctgtct 
tcgtcatccccctgggagggccaggcagttactcctgaggggctgaacaccatccctccc 
actagcctctccatacttactcctctcacagcccaaattctgaagttgtctccctgaccc 
ttctttagtggcaacttaactgaagaagggatgtccgctatatccaaaattacagctatt 
ggcaaataaacgagatggataaagg 

SEQ ID NO: 8. >AA38334 9_T7 # TY Transcript # LN 1876 # Source Gene: 

AA383349 # Encoded protein: AA383349_P2 

ggccggggacccgtgggccggttcctggggtggcccagagcaagggcaagtttcgcccgg 
gacccgcccagctggccgggagccagtagcagggaagggccggctggcgcgagcaccgcc 
cacgcggagccatgggggcggggcctgggcgggggcggggccgcggccgagggcggggca 
gggaggcagcatgctaaaccgggtgcgctcggccgtggcgcacctggtgagctccggggg 
cgctccgcctccgcgccccaaatccccggacctgcccaacgccgcctcggcgccgcccgc 
cgccgctccagaagcgcccaggagccctcccgcgaaggctgggagcgggagcgcgacgcc 
cgcgaaggctgttgaggctcgagcgagcttctccagaccgacctttctgcagctgagccc 
cggggggctgcgacgcgccgatgaccacgcgggccgggctgtgcaaagccccccggacac 
gggccgccgcctgccctggagcacaggctacgccgaggtcatcaatgctggcaagagtcg 
gcacaatgaggaccaggcttgctgtgaagtggtgtatgtggaaggtcggaggagtgttac 
aggagtacctagggagcctagccgaggccagggactctgcttctactactggggcctatt 
tgatgggcatgcagggggcggagctgctgaaatggcctcacggctcctgcatcgccatat 
ccgagagcagctaaaggacctggtagagatacttcaggacccttcgccaccacccctctg 
cctcccaaccactccggggaccccagattcctccgatccctctcacttgcttggccctca 
gtcctgctggtcttcacagaaggaagtgagccacgagagcctggtagtgggggccattga 
gaatgccttccagctcatggatgagcagatggcccgggagcggcgtggccaccaagtgga 
ggggggctgctgtgcactggttgtgatctacctgctaggcaaggtgtacgtggccaatgc 
aggcgatagcagggccatcattgtccggaatggtgaaatcattccaatgtcccgggagtt 
taccccggagactgagcgccagcgtcttcagctgcttggcttcctgaaaccagagctgct 
aggcagtgaattcacccaccttgagttcccccgcagagttctgcccaaggagctggggca 
gaggatgttgtaccgggaccagaacatgaccggctgggcctacaaaaagatcgagctgga 
ggatctcaggtttcctctggtctgtggggagggcaaaaaggctcgggtgatggccaccat 
tggggtgacccgaggcttgggagaccacagccttaaggtctgcagttccaccctgcccat 
caagccctttctctcctgcttccctgaggtacgagtgtatgacctgacacaatatgagca 
ctgcccagatgatgtgctagtcctgggaacagatggcctgtgggatgtcactactgactg 
tgaggtagctgccactgtggacagggtgctgtcggcctatgagcctaatgaccacagcag 
gtatacagctctggcccaagctctggtcctgggggcccggggtaccccccgagaccgtgg 
ctggcgtctccccaacaacaagctgggttccggggatgacatctctgtcttcgtcatccc 
cctgggagggccaggcagttactcctgaggggctggccgcatacgtcgcactatcctctg 
gatacgtcatcactctcgcagggcacagtcgacagtggtgctccttgaccattccttagt 
gaagaataaactaaacaacggatgcgcgcggatactcccgaagcacaggtaatggctgat 
tgcgcgaatggatcat 

SEQ ID N0:9. >AA383349_T8 # TY Transcript # LN 1950 # Source Gene: 

AA38334 9 # Encoded protein: AA38334 9_P5 

ggccggggacccgtgggccggttcctggggtggcccagagcaagggcaagtttcgcccgg 
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Figur 1 (Conf d) 



gacccgcccagctggccgggagccagtagcagggaagggccggctggcgcgagcaccgcc 
cacgcggagccatgggggcggggcctgggcgggggcggggccgcggccgagggcggggca 
gggaggcagcatgctaaaccgggtgcgctcggccgtggcgcacctggtgagctccggggg 
cgctccgcctccgcgccccaaatccccggacctgcccaacgccgcctcggcgccgcccgc 
cgccgctccagaagcgcccaggagccctcccgcgaaggctgggagcgggagcgcgacgcc 
cgcgaaggctgttgaggctcgagcgagcttctccagaccgacctttctgcagctgagccc 
cggggggctgcgacgcgccgatgaccacgcgggccgggctgtgcaaagccccccggacac 
gggccgccgcctgccctggagcacaggctacgccgaggtcatcaatgctggcaagagtcg 
gcacaatgaggaccaggcttgctgtgaagtggtgtatgtggaaggtcggaggagtgttac 
aggagtacctagggagcctagccgaggccagggactctgcttctactactggggcctatt 
tgatgggcatgcagggggcggagctgctgaaatggcctcacggctcctgcatcgccatat 
ccgagagcagctaaaggacctggtagagatacttcaggacccttcgccaccacccctctg 
cctcccaaccactccggggaccccagattcctccgatccctctcacttgcttggccctca 
gtcctgctggtcttcacagaaggaagtgagccacgagagcctggtagtgggggccattga 
gaatgccttccagctcatggatgagcagatggcccgggagcggcgtggccaccaagtgga 
ggggggctgctgtgcactggttgtgatctacctgctaggcaaggtgtacgtggccaatgc 
aggcgatagcagggccatcattgtccggaatggtgaaatcattccaatgtcccgggagtt 
taccccggagactgagcgccagcgtcttcagctgcttggtctgctgtagtcaacgtcatc 
tgtgtgccaccgtgtgctgtgagcttaatgccccctggctctctgttctgatcatcccca 
actccagggcttcctgaaaccagagctgctaggcagtgaattcacccaccttgagttccc 
ccgcagagttctgcccaaggagctggggcagaggatgttgtaccgggaccagaacatgac 
cggctggctcgggtgatggccaccattggggtgacccgaggcttgggagaccacagcctt 
aaggtctgcagttccaccctgcccatcaagccctttctctcctgcttccctgaggtacga 
gtgtatgacctgacacaatatgagcactgcccagatgatgtgctagtcctgggaacagat 
ggcctgtgggatgtcactactgactgtgaggtagctgccactgtggacagggtgctgtcg 
gcctatgagcctaatgaccacagcagccccaccatggctgcaatccgaaagaagctggtg 
atcgttggggatggtgcctgtgggaagacctgcctcctcatcgtcttcagcaaggatcag 
tttccggaggtctacgtccctactgtctttgagaactatattgcggacattgaggtggac 
ggcaagcaggtggagctggctctgtgggacacagcagggcaggaagactatgatcgactg 
cggcctctctcctacccggacactgatgtcatcctcatgtgcttctccatcgacagccct 
gacagcctggaaaacattcctgagaagtggaccccagaggtgaagcacttctgccccaac 
gtgcccatcatcctggtggggaataagaag 

SEQ ID NO: 10. >AA383349_T9 # TY Transcript # LN 387 # Source Gene: 

AA38334 9 # Encoded protein: AA38334 9_P6 

cacatgtgggcctgtgtatctgtcttcccacatacatgttggtacatgaatgtgcatgtg 
tgctcctggcaggtatacagctctggcccaagctctggtcctgggggcccggggtacccc 
ccgagaccgtggctggcgtctccccaacaacaagctgggttccggggatgacatctctgt 
cttcgtcatccccctgggagggccaggcagttactcctgaggggctgaacaccatccctc 
ccactagcctctccatacttactcctctcacagcccaaattctgaagttgtctccctgac 
ccttctttagtggcaacttaactgaagaagggatgtccgctatatccaaaattacagcta 
ttggcaaataaacgagatggataaagg 

SEQ ID NO: 11. >AA38334 9_P1 # TY Protein # CC #LN 299 # Source Gene: 

AA38334 9 # Encoding Transcript: 1 

MASRLLHRHIREQLKDLKEVSHESLWGAIENAFQLMDEQMARERRGHQVEGGCCALVVI 
YLLGKVYVANAGDSRAI I VRNGE I I PMSREFTPETERQRLQLLGFLKPELLGSEFTHLEF 
PRRVLPKELGQRMLYRDQNMTGWAYKKIELEDLRFPLVCGEGKKARVMATIGVTRGLGDH 
SLKVCSSTLPIKPFLSCFPEVRVYDLTQYEHCPDDVLVLGTDGLWDVTTDCEVAATVDRV 
LSAYEPNDHSRYTALAQALVLGARGTPRDRGWRLPNNKLGSGDDISVFVIPLGGPGSYS 

SEQ ID NO: 12. >AA383349_P4 # TY Protein # CC #LN 338 # Source Gene: 

AA38334 9 # Encoding Transcript: 2 

MASRLLHRHIREQLKDLVEILQDPSPPPLCLPTTPGTPDSSDPSHLLGPQSCWSSQKEVS 
HESLWGAI ENAFQLMDEQMARERRGHQVEGGCCALW I YLLGKVYVANAGDSRAI I VRN 
GEIIPMSREFTPETERQRLQLLGFLKPELLGSEFTHLEFPRRVLPKELGQRMLYRDQNMT 
GWAYKKIELEDLRFPLVCGEGKKARVMATIGVTRGLGDHSLKVCSSTLPIKPFLSCFPEV 
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Figure 1 (Cont'd) 



RVYDLTQYEHCPDDVLVLGTDGLWDVTTDCEVAATVDRVLSAYEPNDHSRYTALAQALVL 
GARGTPRDRGWRLPNNKLGSGDDISVFVI PLGGPGSYS 

SEQ ID NO: 13. >AA383349_P3 # TY Protein # CC #LN 535 # Source Gene: 

AA383349 # Encoding Transcript: 4 

MLNRVRSAVAHLVSSGGAPPPRPKSPDLPNAASAPPAAAPEAPRSPPAKAGSGSATPAKA 
VEARASFSRPTFLQLSPGGLRRADDHAGRAVQSPPDTGRRLPWSTGYAEVINAGKSRHNE 
DQACCEWYVEGRRSVTGVPREPSRGQGLCFYYWGLFDGHAGGGAAEMASRLLHRHIREQ 
LKDLVEILQDPSPPPLCLPTTPGTPDSSDPSHLLGPQSCWSSQKEVSHESLWGAIENAF 
QLMDEQMARERRGHQVEGGCCALWI YLLGKVYVANAGDSRAI I VRNGE 1 1 PMSRE FT PE 
TERQRLQLLGLLYQRHLCATVCCELNAPWLSVLIIPNIQGFLKPELLGSEFTHLEFPRRV 
LPKELGQRMLYRDQNMTGWAYKKIELEDLRFPLVCGEGKKARVMATIGVTRGLGDHSLKV 
CSSTLPIKPFLSCFPEVRVYDLTQYEHCPDDVLVLGTDGLWDVTTDCEVAATVDRVLSAY 
EPNDHSRYTAIAQALVLGARGTPRDRGWRLPNNKLGSGDDISVFVIPLGGPGSYS 

SEQ ID NO: 14. >AA383349_P2 # TY Protein # CC #LN 505 # Source Gene: 

AA383349 # Encoding Transcript: 7 

MLNRVRSAVAHLVSSGGAPPPRPKSPDLPNAASAPPAAAPEAPRSPPAKAGSGSATPAKA 
VEARASFSRPTFLQLSPGGLRRADDHAGRAVQSPPDTGRRLPWSTGYAEVINAGKSRHNE 
DQACCEWYVEGRRSVTGVPREPSRGQGLCFYYWGLFDGHAGGGAAEMASRLLHRHIREQ 
LKDLVEILQDPSPPPLCLPTTPGTPDSSDPSHLLGPQSCWSSQKEVSHESLWGAI ENAF 
QLMDEQMARERRGHQVEGGCCALWI YLLGKVYVANAGDSRAI I VRNGE 1 1 PMSRE FT PE 
TERQRLQLLGFLKPELLGSEFTHLEFPRRVLPKELGQRMLYRDQNMTGWAYKKIELEDLR 
FPLVCGEGKKARVMATIGVTRGLGDHSLKVCSSTLPIKPFLSCFPEVRVYDLTQYEHCPD 
DVLVLGTDGLWDVTTDCEVAATVDRVLSAYEPNDHSRYTALAQALVLGARGTPRDRGWRL 
PNNKLGSGDDISVFVI PLGGPGSYS 

SEQ ID NO:15. >AA383349_P5 # TY Protein # CC #LN 312 # Source Gene: 

AA383349 # Encoding Transcript: 8 

MLNRVRSAVAHLVSSGGAPPPRPKSPDLPNAASAPPAAAPEAPRSPPAKAGSGSATPAKA 
VEARAS FS RPT FLQLS PGGLRRADDHAGRAVQS P PDTGRRLPWSTGYAE VINAGKSRHNE 
DQACCEWYVEGRRSVTGVPREPSRGQGLCFYYWGLFDGHAGGGAAEMASRLLHRHIREQ 
LKDLVEILQDPSPPPLCLPTTPGTPDSSDPSHLLGPQSCWSSQKEVSHESLWGAIENAF 
QLMDEQMARERRGHQVEGGCCALWIYLLGKVYVANAGDSRAIIVRNGEIIPMSREFTPE 
TERQRLQLLGLL 

SEQ ID NO: 16. >AA383349_P6 # TY Protein # CC #LN 56 # Source Gene: 

AA383349 # Encoding Transcript: 9 

MCMCAPGRYTALAQALVLGARGTPRDRGWRLPNNKLGSGDDISVFVI PLGGPGSYS 



SEQ ID NO: 17. >AA553584 # TY Consensus # Length 5315 # Number of exons 28 

cggggygsgcaggtcccgtcgccggtgagcacgggctccctctcgcgtggcctcgccggg 

tccgcctggcctgcccacctccggagccacctctgcccccgcatgggctggcgaagttgg 

gaggagcgagctggagccagagcgcgcgccgggcgcgccccgtcgctgcctgactcggcg 

cccgcagttcgggcgcagcacgccggccgcaggagcacggatgccccccggagccgcggg 

ctggcaggtctggggtcctgaggctgctggcagactatgggtacaacggccagcacagcc 

cagcagacggtctcggcaggcaccccatttgagggcctacagggcagtggcacgatggac 

agtcggcactccgtcagcatccactccttccagagcactagcttgcataacagcaaggcc 

aagtccatcatccccaacaaggtggcccctgttgtgatcacgtacaactgcaaggaggag 

ttccagatccatgatgagctgctcaaggctcattacacgttgggccggctctcggacaac 

acccctgagcactacctggtgcaaggccgctacttcctggtgcgggatgtcactgagaag 

atggatgtgctgggcaccgtgggaagctgtggggcccccaacttccggcaggtgcagggt 

gggctcactgtgttcggcatgggacagcccagcctctcagggttcaggcgggtcctccag 
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Figur 1 (Cont'd) 



aaactccagaaggacggacatagggagtgtgtcatcttctgtgtgcgggaggaacctgtg 
cttttcctgcgtgcagatgaggactttgtgtcctacacacctcgagacaagcagaacctt 
catgagaacctccagggccttggacccggggtccgggtggagagcctggagctggccatc 
cggaaagagatccacgactttgcccagctgagcgagaacacataccatgtgtaccataac 
accgaggacctgtggggggagccccatgctgtggccatccatggtgaggacgacttgcat 
gtgacggaggaggtgtacaagcggcccctcttcctgcagcccacctacaggtaccaccgc 
ctgcccctgcccgagcaagggagtcccctggaggcccagttggacgcctttgtcagtgtt 
ctccggctacgggaccccccccgcccgccgccgccccctcatgccaccaggaggggtggt 
accgggccggtatttcgcgaaagcgtcgacccgcgcgtcctgagccccgtcccagacana 
gagacccccagcctgctgcagctccgtgatgcccacgggcctcccccagccctcgtcttc 
agctgccagatgggcgtgggcaggaccaacctgggcatggtcctgggcaccctcatcctg 
cttcaccgcagtgggaccacctcccagccagaggctgcccccacgcaggccaagcccctg 
cctatggagcagttccaggtgatccagagctttctccgcatggtgccccagggaaggagg 
atggtggaagaggtggacagagccatcactgcctgtgccgagttgcatgacctgaaagaa 
gtggtcttggaaaaccagaagaagttagaaggtatccgaccggagagcccagcccaggga 
agcggcagccgacacagcgtctggcagagggcgctgtggagcctggagcgatacttctac 
ctgatcctgtttaactactaccttcatgagcagtacccgctggcctttgccctcagtttc 
agccgctggctgtgtgcccaccctgagctgtaccgcctgcccgtgacgctgagctcagca 
ggccctgtggctccgagggacctcatcgccaggggctccctacgggaggacgatctggtc 
tccccggacgcgctcagcactgtcagagagatggatgtggccaacttccggcgggtgccc 
cgcatgcccatctacggcacggcccagcccagcgccaaggccctggggagcatcctggcc 
tacctgacggacgccaagaggaggctgcggaaggttgtctgggtgagccttcgggaggag 
gccgtgttggagtgtgacgggcacacctacagcctgcggtggcctgggccccctgtggct 
cctgaccagctggagaccctggaggcccagctgaaggcccatctaagcgagcctccccca 
ggcaaggagggccccctgacctacaggttccagacctgccttaccatgcaggaggtcttc 
agccagcaccgcagggcctgtcctggcctcacctaccaccgcatccccatgccggacttc 
tgtgccccccgagaggaggtgagggtgctgctcaggctgaggcctctggggagggacagg 
aggcctccctgtctcccctctggtctgggctctgggagaatgaaacgaccccagcttctc 
tacggggcccgggaatggtctcacgaggttttccgatttggcccaagtggtttattcaca 
ttccttccaagctttgtgtgtgtgtacgtgtgtttattatgtgttatagggtatatgtgt 
gtgtattgttataggtgtgtgtgtattggagtgtgtattctgttacgtatgtgcatatgt 
gttataagatgtgtgtacatgtgtttattatgttgtagggtgtatgtgtgtgtattgtgt 
tatgggtgtgtgttataggtgtgtgtgtgttatcggggtgtgtgtgtattctgttacaga 
cacgggaccgtaggcagagacactagtgaatgggcgtcatggtctcgctgacagtgtcca 
gcgcgcggcgtccacctccttgcatcacacggagcgcatcgggtagcaggactttgacca 
gctgctggaggccctgcgggccgccctctccaaggacccaggcactggcttcgtgttcag 
ctgcctcagcggccagggccgtaccacaactgcgatggtggtggctgtcctggccttctg 
gcacatccaaggcttccccgaggtgggtgaggaggagctcgtgagtgtgcctgatgccaa 
gttcactaagggtgaatttcaggtgcctgtaatcccagctactcgggaggctgaggcggg 
agaatttcttgaacctaagaggcagaggttgcagtgagccgagatctcaccactacactc 
cagactgggtgacaggtagtaatgaaggtggtgcagctgctacccgatgggcaccgtgtg 
aagaaggaggtggacgcagcgctggacactgtcagcgagaccatgacgcccatgcactac 
cacctgcgggagatcatcatctgcacctaccgccaggcgaaggcagcgaaagaggcgcag 
gaaatgcggaggctgcagctgcggagcctgcagtacttggagcgctatgtctgcctgatt 
ctcttcaacgcgtacctccacctggagaaggccgactcctggcagaggcccttcagcacc 
tggatgcaggaggtggcatcgaaggctggcatctacgagatccttaacgagctgggcttc 
cccgagctggagagcggggaggaccagcccttctccaggctgcgctaccggtggcaggag 
cagagctgcagcctcgagccctctgcccccgaggacttgctgtagggggccttactccct 
gtccccccacccacagggccccacgcaggcctggggtgtctgaggtgctcttggctggga 
gcggccctgaggggtgctggccttgaaatgattcccccacttcctggagagactgagcgg 
agttgggagcctttttagaaagaactttttataggacagggagacagcacagccatccct 
tgcaaaccaccaaggtgtgtggctgacctccagggaggagcactcactggagtgctcaca 
aggtgcacactgctgtgtgtaccttgcagacaggccggcgttcagcctccaaggggctca 
ctcccccagttgccaaacactgtggatctctctgtcctcttctcccctctctcagattgg 
cctggcagcccctggcacagagcagacccggccactggtagctccccacttccttactcc 
tgctgctctgccattgccgctccccttcttgctgcccaagcactgccctcgggcgtctgg 
cagcctgaggtgggtggaggggacagtgttctggatagatctattatgtgaaaggcagct 
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Figure 1 (Cont'd) 



tcacccagttttctggactctcatgcccccatctccgacctgggagacttcaggaatgac 
aacctacccagcctggtggggctggcaggatggtggaggtttctcaaggagctggagact 
tcagggagcccctctcatggggaggaaagagcttccagggggcgaacgcagcacagagga 
agaggcctgctccacttgtctgggaacctgggcaggaggcacagaggaagccaaggcctg 
gagctgcaggtcccccggcatctctctctgtcccggcagcccaggatggcctggtgcccc 
cacctgctgcagcaggagccccaaggagtgctagctgagggtggttgctggggtggtcct 
catggacagtgaggtgtgcaagggtgcactgagggtggtgggaggggatcacctgggttc 
caggccatccttgctgagcatctttgagcctgccttccggtgggagcagaaaaggccaga 
ccctgctgagttagaggctgctgggatccactgtttccacacagcgggaaggctgctggg 
aacaggtggcagagaagtgccatgtttgcgttgagccttgcagctcttccagctggggac 
tggtgcttgctgaaacccaggagctgaacagtgaggaggctgtccaccttgcttggctca 
ctgggaccaggaaagcctgtctttggttaggctcgtgtacttctgcaggaaaaaaaaaaa 
aggatgtgtcattggtcatgatatttgaaaaggggaggaggccgaagttgttcccattta 
tccagtattggaaaatatttgacccccttggctgaattcttttgcagaactactgtgtgt 
ctgttcactaccttttcaggtttattgtttttatttttgcatgaattaagacgttttaat 
ttctttgcagacaaggtctagatgcggagtcagagatgggactgaatggggagggatcct 
ttgtgttctcatggttggctctgactttcagctgtgttgggaccactggctgatcacatc 
acctctctgcctcagtttccccatctgtaaaatgggagaataatacttgcctacctacct 
cacaggggtgttgtgaggattcatttgtgatttttttttttttttttgtacagagctttt 
aagcattaaaaacagctaaatgtgaaaaaaaatgg 

SEQ ID NO: 18. >AA553584_T1 # TY Transcript # LN 4577 # Source Gene: 

AA553584 # Encoded protein: AA553584JP1 

cggggygsgcaggtcccgtcgccggtgagcacgggctccctctcgcgtggcctcgccggg 
tccgcctggcctgcccacctccggagccacctctgcccccgcatgggctggcgaagttgg 
gaggagcgagctggagccagagcgcgcgccgggcgcgccccgtcgctgcctgactcggcg 
cccgcagttcgggcgcagcacgccggccgcaggagcacggatgccccccggagccgcggg 
ctggcaggtctggggtcctgaggctgctggcagactatgggtacaacggccagcacagcc 
cagcagacggtctcggcaggcaccccatttgagggcctacagggcagtggcacgatggac 
agtcggcactccgtcagcatccactccttccagagcactagcttgcataacagcaaggcc 
aagtccatcatccccaacaaggtggcccctgttgtgatcacgtacaactgcaaggaggag 
ttccagatccatgatgagctgctcaaggctcattacacgttgggccggctctcggacaac 
acccctgagcactacctggtgcaaggccgctacttcctggtgcgggatgtcactgagaag 
atggatgtgctgggcaccgtgggaagctgtggggcccccaacttccggcaggtgcagggt 
gggctcactgtgttcggcatgggacagcccagcctctcagggttcaggcgggtcctccag 
aaactccagaaggacggacatagggagtgtgtcatcttctgtgtgcgggaggaacctgtg 
cttttcctgcgtgcagatgaggactttgtgtcctacacacctcgagacaagcagaacctt 
catgagaacctccagggccttggacccggggtccgggtggagagcctggagctggccatc 
cggaaagagatccacgactttgcccagctgagcgagaacacataccatgtgtaccataac 
accgaggacctgtggggggagccccatgctgtggccatccatggtgaggacgacttgcat 
gtgacggaggaggtgtacaagcggcccctcttcctgcagcccacctacaggtaccaccgc 
ctgcccctgcccgagcaagggagtcccctggaggcccagttggacgcctttgtcagtgtt 
ctccgggagacccccagcctgctgcagctccgtgatgcccacgggcctcccccagccctc 
gtcttcagctgccagatgggcgtgggcaggaccaacctgggcatggtcctgggcaccctc 
atcctgcttcaccgcagtgggaccacctcccagccagaggctgcccccacgcaggccaag 
cccctgcctatggagcagttccaggtgatccagagctttctccgcatggtgccccaggga 
aggaggatggtggaagaggtggacagagccatcactgcctgtgccgagttgcatgacctg 
aaagaagtggtcttggaaaaccagaagaagttagaaggtatccgaccggagagcccagcc 
cagggaagcggcagccgacacagcgtctggcagagggcgctgtggagcctggagcgatac 
ttctacctgatcctgtttaactactaccttcatgagcagtacccgctggcctttgccctc 
agtttcagccgctggctgtgtgcccaccctgagctgtaccgcctgcccgtgacgctgagc 
tcagcaggccctgtggctccgagggacctcatcgccaggggctccctacgggaggacgat 
ctggtctccccggacgcgctcagcactgtcagagagatggatgtggccaacttccggcgg 
gtgccccgcatgcccatctacggcacggcccagcccagcgccaaggccctggggagcatc 
ctggcctacctgacggacgccaagaggaggctgcggaaggttgtctgggtgagccttcgg 
gaggaggccgtgttggagtgtgacgggcacacctacagcctgcggtggcctgggccccct 
gtggctcctgaccagctggagaccctggaggcccagctgaaggcccatctaagcgagcct 
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Figure 1 (Cont'd) 



cccccaggcaaggagggccccctgacctacaggttccagacctgccttaccatgcaggag 
gtcttcagccagcaccgcagggcctgtcctggcctcacctaccaccgcatccccatgccg 
gacttctgtgccccccgagaggaggactttgaccagctgctggaggccctgcgggccgcc 
ctctccaaggacccaggcactggcttcgtgttcagctgcctcagcggccagggccgtacc 
acaactgcgatggtggtggctgtcctggccttctggcacatccaaggcttccccgaggtg 
ggtgaggaggagctcgtgagtgtgcctgatgccaagttcactaagggtgaatttcaggta 
gtaatgaaggtggtgcagctgctacccgatgggcaccgtgtgaagaaggaggtggacgca 
gcgctggacactgtcagcgagaccatgacgcccatgcactaccacctgcgggagatcatc 
atctgcacctaccgccaggcgaaggcagcgaaagaggcgcaggaaatgcggaggctgcag 
ctgcggagcctgcagtacttggagcgctatgtctgcctgattctcttcaacgcgtacctc 
cacctggagaaggccgactcctggcagaggcccttcagcacctggatgcaggaggtggca 
tcgaaggctggcatctacgagatccttaacgagctgggcttccccgagctggagagcggg 
gaggaccagcccttctccaggctgcgctaccggtggcaggagcagagctgcagcctcgag 
ccctctgcccccgaggacttgctgtagggggccttactccctgtccccccacccacaggg 
ccccacgcaggcctggggtgtctgaggtgctcttggctgggagcggccctgaggggtgct 
ggccttgaaatgattcccccacttcctggagagactgagcggagttgggagcctttttag 
aaagaactttttataggacagggagacagcacagccatcccttgcaaaccaccaaggtgt 
gtggctgacctccagggaggagcactcactggagtgctcacaaggtgcacactgctgtgt 
gtaccttgcagacaggccggcgttcagcctccaaggggctcactcccccagttgccaaac 
actgtggatctctctgtcctcttctcccctctctcagattggcctggcagcccctggcac 
agagcagacccggccactggtagctccccacttccttactcctgctgctctgccattgcc 
gctccccttcttgctgcccaagcactgccctcgggcgtctggcagcctgaggtgggtgga 
ggggacagtgttctggatagatctattatgtgaaaggcagcttcacccagttttctggac 
tctcatgcccccatctccgacctgggagacttcaggaatgacaacctacccagectggtg 
gggctggcaggatggtggaggtttctcaaggagctggagacttcagggagcccctctcat 
ggggaggaaagagcttccagggggcgaacgcagcacagaggaagaggcctgctccacttg 
tctgggaacctgggcaggaggcacagaggaagccaaggcctggagctgcaggtcccccgg 
catctctctctgtcccggcagcccaggatggcctggtgcccccacctgctgcagcaggag 
ccccaaggagtgctagctgagggtggttgctggggtggtcctcatggacagtgaggtgtg 
caagggtgcactgagggtggtgggaggggatcacctgggttccaggccatccttgctgag 
catctttgagcctgccttccggtgggagcagaaaaggccagaccctgctgagttagaggc 
tgctgggatccactgtttccacacagcgggaaggctgctgggaacaggtggcagagaagt 
gccatgtttgcgttgagccttgcagctcttccagctggggactggtgcttgctgaaaccc 
aggagctgaacagtgaggaggctgtccaccttgcttggctcactgggaccaggaaagcct 
gtctttggttaggctcgtgtacttctgcaggaaaaaaaaaaaaggatgtgtcattggtca 
tgatatttgaaaaggggaggaggccgaagttgttcccatttatccagtattggaaaatat 
ttgacccccttggctgaattcttttgcagaactactgtgtgtctgttcactaccttttca 
ggtttattgtttttatttttgcatgaattaagacgttttaatttctttgcagacaaggtc 
tagatgcggagtcagagatgggactgaatggggagggatcctttgtgttctcatggttgg 
ctctgactttcagctgtgttgggaccactggctgatcacatcacctctctgcctcagttt 
ccccatctgtaaaatgggagaataatacttgcctacctacctcacaggggtgttgtgagg 
attcatttgtgatttttttttttttttttgtacagagcttttaagcattaaaaacagcta 
aatgtgaaaaaaaatgg 

SEQ ID NO: 19. >AA553584_T2 # TY Transcript # LN 3393 # Source Gene: 

AA553584 # Encoded protein: AA553584_P2 

ctacgggaccccccccgcccgccgccgccccctcatgccaccaggaggggtggtaccggg 
ccggtatttcgcgaaagcgtcgacccgcgcgtcctgagccccgtcccagacanaggctgc 
ccccacgcaggccaagcccctgcctatggagcagttccaggtgatccagagctttctccg 
catggtgccccagggaaggaggatggtggaagaggtggacagagccatcactgcctgtgc 
cgagttgcatgacctgaaagaagtggtcttggaaaaccagaagaagttagaaggtatccg 
accggagagcccagcccagggaagcggcagccgacacagcgtctggcagagggcgctgtg 
gagcctggagcgatacttctacctgatcctgtttaactactaccttcatgagcagtaccc 
gctggcctttgccctcagtttcagccgctggctgtgtgcccaccctgagctgtaccgcct 
gcccgtgacgctgagctcagcaggccctgtggctccgagggacctcatcgccaggggctc 
cctacgggaggacgatctggtctccccggacgcgctcagcactgtcagagagatggatgt 
ggccaacttccggcgggtgccccgcatgcccatctacggcacggcccagcccagcgccaa 
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Figur 1 (Cont'd) 



ggccctggggagcatcctggcctacctgacggacgccaagaggaggctgcggaaggttgt 
ctgggtgagccttcgggaggaggccgtgttggagtgtgacgggcacacctacagcctgcg 
gtggcctgggccccctgtggctcctgaccagctggagaccctggaggcccagctgaaggc 
ccatctaagcgagcctcccccaggcaaggagggccccctgacctacaggttccagacctg 
ccttaccatgcaggaggtcttcagccagcaccgcagggcctgtcctggcctcacctacca 
ccgcatccccatgccggacttctgtgccccccgagaggaggactttgaccagctgctgga 
ggccctgcgggccgccctctccaaggacccaggcactggcttcgtgttcagctgcctcag 
cggccagggccgtaccacaactgcgatggtggtggctgtcctggccttctggcacatcca 
aggcttccccgaggtgggtgaggaggagctcgtgagtgtgcctgatgccaagttcactaa 
gggtgaatttcaggtagtaatgaaggtggtgcagctgctacccgatgggcaccgtgtgaa 
gaaggaggtggacgcagcgctggacactgtcagcgagaccatgacgcccatgcactacca 
cctgcgggagatcatcatctgcacctaccgccaggcgaaggcagcgaaagaggcgcagga 
aatgcggaggctgcagctgcggagcctgcagtacttggagcgctatgtctgcctgattct 
cttcaacgcgtacctccacctggagaaggccgactcctggcagaggcccttcagcacctg 
gatgcaggaggtggcatcgaaggctggcatctacgagatccttaacgagctgggcttccc 
cgagctggagagcggggaggaccagcccttctccaggctgcgctaccggtggcaggagca 
gagctgcagcctcgagccctctgcccccgaggacttgctgtagggggccttactccctgt 
ccccccacccacagggccccacgcaggcctggggtgtctgaggtgctcttggctgggagc 
ggccctgaggggtgctggccttgaaatgattcccccacttcctggagagactgagcggag 
ttgggagcctttttagaaagaactttttataggacagggagacagcacagccatcccttg 
caaaccaccaaggtgtgtggctgacctccagggaggagcactcactggagtgctcacaag 
gtgcacactgctgtgtgtaccttgcagacaggccggcgttcagcctccaaggggctcact 
cccccagttgccaaacactgtggatctctctgtcctcttctcccctctctcagattggcc 
tggcagcccctggcacagagcagacccggccactggtagctccccacttccttactcctg 
ctgctctgccattgccgctccccttcttgctgcccaagcactgccctcgggcgtctggca 
gcctgaggtgggtggaggggacagtgttctggatagatctattatgtgaaaggcagcttc 
acccagttttctggactctcatgcccccatctccgacctgggagacttcaggaatgacaa 
cctacccagcctggtggggctggcaggatggtggaggtttctcaaggagctggagacttc 
agggagcccctctcatggggaggaaagagcttccagggggcgaacgcagcacagaggaag 
aggcctgctccacttgtctgggaacctgggcaggaggcacagaggaagccaaggcctgga 
gctgcaggtcccccggcatctctctctgtcccggcagcccaggatggcctggtgccccca 
cctgctgcagcaggagccccaaggagtgctagctgagggtggttgctggggtggtcctca 
tggacagtgaggtgtgcaagggtgcactgagggtggtgggaggggatcacctgggttcca 
ggccatccttgctgagcatctttgagcctgccttccggtgggagcagaaaaggccagacc 
ctgctgagttagaggctgctgggatccactgtttccacacagcgggaaggctgctgggaa 
caggtggcagagaagtgccatgtttgcgttgagccttgcagctcttccagctggggactg 
gtgcttgctgaaacccaggagctgaacagtgaggaggctgtccaccttgcttggctcact 
gggaccaggaaagcctgtctttggttaggctcgtgtacttctgcaggaaaaaaaaaaaag 
gatgtgtcattggtcatgatatttgaaaaggggaggaggccgaagttgttcccatttatc 
cagtattggaaaatatttgacccccttggctgaattcttttgcagaactactgtgtgtct 
gttcactaccttttcaggtttattgtttttatttttgcatgaattaagacgttttaattt 
ctttgcagacaaggtctagatgcggagtcagagatgggactgaatggggagggatccttt 
gtgttctcatggttggctctgactttcagctgtgttgggaccactggctgatcacatcac 
ctctctgcctcagtttccccatctgtaaaatgggagaataatacttgcctacctacctca 
caggggtgttgtgaggattcatttgtgatttttttttttttttttgtacagagcttttaa 
gcattaaaaacagctaaatgtgaaaaaaaatgg 

SEQ ID NO: 20. >AA553584_T3 # TY Transcript # LN 2602 # Source Gene: 

AA553584 # Encoded protein: AA553584_P3 

cggggygsgcaggtcccgtcgccggtgagcacgggctccctctcgcgtggcctcgccggg 
tccgcctggcctgcccacctccggagccacctctgcccccgcatgggctggcgaagttgg 
gaggagcgagctggagccagagcgcgcgccgggcgcgccccgtcgctgcctgactcggcg 
cccgcagttcgggcgcagcacgccggccgcaggagcacggatgccccccggagccgcggg 
ctggcaggtctggggtcctgaggctgctggcagactatgggtacaacggccagcacagcc 
cagcagacggtctcggcaggcaccccatttgagggcctacagggcagtggcacgatggac 
agtcggcactccgtcagcatccactccttccagagcactagcttgcataacagcaaggcc 
aagtccatcatccccaacaaggtggcccctgttgtgatcacgtacaactgcaaggaggag 



12/365 



Figure 1 (Cont'd) 



ttccagatccatgatgagctgctcaaggctcattacacgttgggccggctctcggacaac 
acccctgagcactacctggtgcaaggccgctacttcctggtgcgggatgtcactgagaag 
atggatgtgctgggcaccgtgggaagctgtggggcccccaacttccggcaggtgcagggt 
gggctcactgtgttcggcatgggacagcccagcctctcagggttcaggcgggtcctccag 
aaactccagaaggacggacatagggagtgtgtcatcttctgtgtgcgggaggaacctgtg 
cttttcctgcgtgcagatgaggactttgtgtcctacacacctcgagacaagcagaacctt 
catgagaacctccagggccttggacccggggtccgggtggagagcctggagctggccatc 
cggaaagagatccacgactttgcccagctgagcgagaacacataccatgtgtaccataac 
accgaggacctgtggggggagccccatgctgtggccatccatggtgaggacgacttgcat 
gtgacggaggaggtgtacaagcggcccctcttcctgcagcccacctacaggtaccaccgc 
ctgcccctgcccgagcaagggagtcccctggaggcccagttggacgcctttgtcagtgtt 
ctccgggagacccccagcctgctgcagctccgtgatgcccacgggcctcccccagccctc 
gtcttcagctgccagatgggcgtgggcaggaccaacctgggcatggtcctgggcaccctc 
atcctgcttcaccgcagtgggaccacctcccagccagaggctgcccccacgcaggccaag 
cccctgcctatggagcagttccaggtgatccagagctttctccgcatggtgccccaggga 
aggaggatggtggaagagtggatagatctattatgtgaaaggcagcttcacccagttttc 
tggactctcatgcccccatctccgacctgggagacttcaggaatgacaacctacccagcc 
tggtggggctggcaggatggtggaggtttctcaaggagctggagacttcagggagcccct 
ctcatggggaggaaagagcttccagggggcgaacgcagcacagaggaagaggcctgctcc 
acttgtctgggaacctgggcaggaggcacagaggaagccaaggcctggagctgcaggtcc 
cccggcatctctctctgtcccggcagcccaggatggcctggtgcccccacctgctgcagc 
aggagccccaaggagtgctagctgagggtggttgctggggtggtcctcatggacagtgag 
gtgtgcaagggtgcactgagggtggtgggaggggatcacctgggttccaggccatccttg 
ctgagcatctttgagcctgccttccggtgggagcagaaaaggccagaccctgctgagtta 
gaggctgctgggatccactgtttccacacagcgggaaggctgctgggaacaggtggcaga 
gaagtgccatgtttgcgttgagccttgcagctcttccagctggggactggtgcttgctga 
aacccaggagctgaacagtgaggaggctgtccaccttgcttggctcactgggaccaggaa 
agcctgtctttggttaggctcgtgtacttctgcaggaaaaaaaaaaaaggatgtgtcatt 
ggtcatgatatttgaaaaggggaggaggccgaagttgttcccatttatccagtattggaa 
aatatttgacccccttggctgaattcttttgcagaactactgtgtgtctgttcactacct 
tttcaggtttattgtttttatttttgcatgaattaagacgttttaatttctttgcagaca 
aggtctagatgcggagtcagagatgggactgaatggggagggatcctttgtgttctcatg 
gttggctctgactttcagctgtgttgggaccactggctgatcacatcacctctctgcctc 
agtttccccatctgtaaaatgggagaataatacttgcctacctacctcacaggggtgttg 
tgaggattcatttgtgatttttttttttttttttgtacagagcttttaagcattaaaaac 
agctaaatgtgaaaaaaaatgg 

SEQ ID NO: 21. >AA553584_T4 # TY Transcript # LN 2432 # Source Gene: 

AA553584 # Encoded protein: AA553584_P4 

acacgggaccgtaggcagagacactagtgaatgggcgtcatggtctcgctgacagtgtcc 
agcgcgcggcgtccacctccttgcatcacacggagcgcatcgggtagcagctgcctcagc 
ggccagggccgtaccacaactgcgatggtggtggctgtcctggccttctggcacatccaa 
ggcttccccgaggtgggtgaggaggagctcgtgagtgtgcctgatgccaagttcactaag 
ggtgaatttcaggtagtaatgaaggtggtgcagctgctacccgatgggcaccgtgtgaag 
aaggaggtggacgcagcgctggacactgtcagcgagaccatgacgcccatgcactaccac 
ctgcgggagatcatcatctgcacctaccgccaggcgaaggcagcgaaagaggcgcaggaa 
atgcggaggctgcagctgcggagcctgcagtacttggagcgctatgtctgcctgattctc 
ttcaacgcgtacctccacctggagaaggccgactcctggcagaggcccttcagcacctgg 
atgcaggaggtggcatcgaaggctggcatctacgagatccttaacgagctgggcttcccc 
gagctggagagcggggaggaccagcccttctccaggctgcgctaccggtggcaggagcag 
agctgcagcctcgagccctctgcccccgaggacttgctgtagggggccttactccctgtc 
cccccacccacagggccccacgcaggcctggggtgtctgaggtgctcttggctgggagcg 
gccctgaggggtgctggccttgaaatgattcccccacttcctggagagactgagcggagt 
tgggagcctttttagaaagaactttttataggacagggagacagcacagccatcccttgc 
aaaccaccaaggtgtgtggctgacctccagggaggagcactcactggagtgctcacaagg 
tgcacactgctgtgtgtaccttgcagacaggccggcgttcagcctccaaggggctcactc 
ccccagttgccaaacactgtggatctctctgtcctcttctcccctctctcagattggcct 
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Figur 1 (Conf d) 



ggcagcccctggcacagagcagacccggccactggtagctccccacttccttactcctgc 
tgctctgccattgccgctccccttcttgctgcccaagcactgccctcgggcgtctggcag 
cctgaggtgggtggaggggacagtgttctggatagatctattatgtgaaaggcagcttca 
cccagttttctggactctcatgcccccatctccgacctgggagacttcaggaatgacaac 
ctacccagcctggtggggctggcaggatggtggaggtttctcaaggagctggagacttca 
gggagcccctctcatggggaggaaagagcttccagggggcgaacgcagcacagaggaaga 
ggcctgctccacttgtctgggaacctgggcaggaggcacagaggaagccaaggcctggag 
ctgcaggtcccccggcatctctctctgtcccggcagcccaggatggcctggtgcccccac 
ctgctgcagcaggagccccaaggagtgctagctgagggtggttgctggggtggtcctcat 
ggacagtgaggtgtgcaagggtgcactgagggtggtgggaggggatcacctgggttccag 
gccatccttgctgagcatctttgagcctgccttccggtgggagcagaaaaggccagaccc 
tgctgagttagaggctgctgggatccactgtttccacacagcgggaaggctgctgggaac 
aggtggcagagaagtgccatgtttgcgttgagccttgcagctcttccagctggggactgg 
tgcttgctgaaacccaggagctgaacagtgaggaggctgtccaccttgcttggctcactg 
ggaccaggaaagcctgtctttggttaggctcgtgtacttctgcaggaaaaaaaaaaaagg 
atgtgtcattggtcatgatatttgaaaaggggaggaggccgaagttgttcccatttatcc 
agtattggaaaatatttgacccccttggctgaattcttttgcagaactactgtgtgtctg 
ttcactaccttttcaggtttattgtttttatttttgcatgaattaagacgttttaatttc 
tttgcagacaaggtctagatgcggagtcagagatgggactgaatggggagggatcctttg 
tgttctcatggttggctctgactttcagctgtgttgggaccactggctgatcacatcacc 
tctctgcctcagtttccccatctgtaaaatgggagaataatacttgcctacctacctcac 
aggggtgttgtgaggattcatttgtgatttttttttttttttttgtacagagcttttaag 
cattaaaaacagctaaatgtgaaaaaaaatgg 

SEQ ID NO: 22. >AA553584_T5 # TY Transcript # LN 2698 # Source Gene: 

AA553584 # Encoded protein: AA553584_P5 

cggggygsgcaggtcccgtcgccggtgagcacgggctccctctcgcgtggcctcgccggg 
tccgcctggcctgcccacctccggagccacctctgcccccgcatgggctggcgaagttgg 
gaggagcgagctggagccagagcgcgcgccgggcgcgccccgtcgctgcctgactcggcg 
cccgcagttcgggcgcagcacgccggccgcaggagcacggatgccccccggagccgcggg 
ctggcaggtctggggtcctgaggctgctggcagactatgggtacaacggccagcacagcc 
cagcagacggtctcggcaggcaccccatttgagggcctacagggcagtggcacgatggac 
agtcggcactccgtcagcatccactccttccagagcactagcttgcataacagcaaggcc 
aagtccatcatccccaacaaggtggcccctgttgtgatcacgtacaactgcaaggaggag 
ttccagatccatgatgagctgctcaaggctcattacacgttgggccggctctcggacaac 
acccctgagcactacctggtgcaaggccgctacttcctggtgcgggatgtcactgagaag 
atggatgtgctgggcaccgtgggaagctgtggggcccccaacttccggcaggtgcagggt 
gggctcactgtgttcggcatgggacagcccagcctctcagggttcaggcgggtcctccag 
aaactccagaaggacggacatagggagtgtgtcatcttctgtgtgcgggaggaacctgtg 
cttttcctgcgtgcagatgaggactttgtgtcctacacacctcgagacaagcagaacctt 
catgagaacctccagggccttggacccggggtccgggtggagagcctggagctggccatc 
cggaaagagatccacgactttgcccagctgagcgagaacacataccatgtgtaccataac 
accgaggacctgtggggggagccccatgctgtggccatccatggtgaggacgacttgcat 
gtgacggaggaggtgtacaagcggcccctcttcctgcagcccacctacaggtaccaccgc 
ctgcccctgcccgagcaagggagtcccctggaggcccagttggacgcctttgtcagtgtt 
ctccgggagacccccagcctgctgcagctccgtgatgcccacgggcctcccccagccctc 
gtcttcagctgccagatgggcgtgggcaggaccaacctgggcatggtcctgggcaccctc 
atcctgcttcaccgcagtgggaccacctcccagccagaggctgcccccacgcaggccaag 
cccctgcctatggagcagttccaggtgatccagagctttctccgcatggtgccccaggga 
aggaggatggtggaagaggtggacagagccatcactgcctgtgccgagttgcatgacctg 
aaagaagtggtcttggaaaaccagaagaagttagaaggtatccgaccggagagcccagcc 
cagggaagcggcagccgacacagcgtctggcagagggcgctgtggagcctggagcgatac 
ttctacctgatcctgtttaactactaccttcatgagcagtacccgctggcctttgccctc 
agtttcagccgctggctgtgtgcccaccctgagctgtaccgcctgcccgtgacgctgagc 
tcagcaggccctgtggctccgagggacctcatcgccaggggctccctacgggaggacgat 
ctggtctccccggacgcgctcagcactgtcagagagatggatgtggccaacttccggcgg 
gtgccccgcatgcccatctacggcacggcccagcccagcgccaaggccctggggagcatc 
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ctggcctacctgacggacgccaagaggaggctgcggaaggttgtctgggtgagccttcgg 
gaggaggccgtgttggagtgtgacgggcacacctacagcctgcggtggcctgggccccct 
gtggctcctgaccagctggagaccctggaggcccagctgaaggcccatctaagcgagcct 
cccccaggcaaggagggccccctgacctacaggttccagacctgccttaccatgcaggag 
gtcttcagccagcaccgcagggcctgtcctggcctcacctaccaccgcatccccatgccg 
gacttctgtgccccccgagaggaggtgagggtgctgctcaggctgaggcctctggggagg 
gacaggaggcctccctgtctcccctctggtctgggctctgggagaatgaaacgaccccag 
cttctctacggggcccgggaatggtctcacgaggttttccgatttggcccaagtggttta 
ttcacattccttccaagctttgtgtgtgtgtacgtgtgtttattatgtgttatagggtat 
atgtgtgtgtattgttataggtgtgtgtgtattggagtgtgtattctgttacgtatgtgc 
atatgtgttataagatgtgtgtacatgtgtttattatgttgtagggtgtatgtgtgtgta 
ttgtgttatgggtgtgtgttataggtgtgtgtgtgttatcggggtgtgtgtgtattctgt 
tacaggtgcctgtaatcccagctactcgggaggctgaggcgggagaatttcttgaaccta 
agaggcagaggttgcagtgagccgagatctcaccactacactccagactgggtgacag 

SEQ ID NO: 23. >AA553584_P1 # TY Protein # CC #LN 856 # Source Gene: 

AA553584 # Encoding Transcript: 1 

MGTTASTAQQTVSAGTPFEGLQGSGTMDSRHSVSIHSFQSTSLHNSKAKSIIPNKVAPW 
ITYNCKEEFQIHDELLKAHYTLGRLSDNTPEHYLVQGRYFLVRDVTEKMDVLGTVGSCGA 
PNFRQVQGGLTVFGMGQPSLSGFRRVLQKLQKDGHRECVIFCVREEPVLFLRADEDFVSY 
TPRDKQNLHENLQGLGPGVRVESLELAIRBCEIHDFAQLSENTYHVYHNTEDLWGEPHAVA 
IHGEDDLHVTEEVYKRPLFLQPTYRYHRLPLPEQGSPLEAQLDAFVSVLRETPSLLQLRD 
AHGPPPALVFSCQMGVGRTNLGMVLGTLILLHRSGTTSQPEAAPTQAKPLPMEQFQVIQS 
FLRMVPQGRRMVEEVDRAITACAELHDLKEWLENQKKLEGIRPESPAQGSGSRHSVWQR 
ALWSLERYFYLILFNYYLHEQYPLAFALSFSRWLCAHPELYRLPVTLSSAGPVAPRDLIA 
RGSLREDDLVSPDALSTVREMDVANFRRVPRMPIYGTAQPSAKALGSILAYLTDAKRRLR 
KWWVSLREEAVLECDGHTYSLRWPGPPVAPDQLETLEAQLKAHLSEPPPGKEGPLTYRF 
QTCLTMQEVFSQHRRACPGLTYHRIPMPDFCAPREEDFDQLLEALRAALSKDPGTGFVFS 
CLSGQGRTTTAMWAVLAFWHIQGFPEVGEEELVSVPDAKFTKGEFQWMKWQLLPDGH 
RVKKEV DAAL DT VS ETMT PMH YH LRE 1 1 ICT YRQAKAAKEAQEMRRLQLRSLQYLERYVC 
LILFNAYLHLEKADSWQRPFSTWMQEVASKAGIYEILNELGFPELESGEDQPFSRLRYRW 
QEQSCSLEPSAPEDLL 

SEQ ID NO: 24. >AA553584_P2 # TY Protein # CC #LN 505 # Source Gene: 

AA553584 # Encoding Transcript: 2 

MEQFQVIQSFLRMVPQGRRMVEEVDRAITACAELHDLKEWLENQKKLEGIRPESPAQGS 
GS RH S VWQ RALWS LER YF YL I LFNYYLHEQYPLAFALSFSRWLCAH PEL YRLPVTLS SAG 
PVAPRDLIARGSLREDDLVSPDALSTVREMDVANFRRVPRMPIYGTAQPSAKALGSILAY 
LTDAKRRLRKWWVSLREEAVLECDGHTYSLRWPGPPVAPDQLETLEAQLKAHLSEPPPG 
KEGPLTYRFQTCLTMQEVFSQHRRACPGLTYHRIPMPDFCAPREEDFDQLLEALRAALSK 
DPGTGFVFSCLSGQGRTTTAMWAVLAFWHIQGFPEVGEEELVSVPDAKFTKGEFQWMK 
WQLLPDGHRVKKEVDAALDTVSETMTPMHYHLREIIICTYRQAKAAKEAQEMRRLQLRS 
LQYLERYVCLILFNAYLHLEKADSWQRPFSTWMQEVASKAGIYEILNELGFPELESGEDQ 
PFSRLRYRWQEQSCSLEPSAPEDLL 

SEQ ID NO:25. >AA553584_P3 # TY Protein # CC #LN 494 # Source Gene: 

AA553584 # Encoding Transcript: 3 

MGTTASTAQQTVSAGTPFEGLQGSGTMDSRHSVSIHSFQSTSLHNSKAKSIIPNKVAPW 
ITYNCKEEFQIHDELLKAHYTLGRLSDNTPEHYLVQGRYFLVRDVTEKMDVLGTVGSCGA 
PNFRQVQGGLTVFGMGQPSLSGFRRVLQKLQKDGHRECVIFCVREEPVLFLRADEDFVSY 
TPRDKQNLHENLQGLGPGVRVESLELAIRKEIHDFAQLSENTYHVYHNTEDLWGEPHAVA 
IHGEDDLHVTEEVYKRPLFLQPTYRYHRLPLPEQGSPLEAQLDAFVSVLRETPSLLQLRD 
AHGPPPALVFSCQMGVGRTNLGMVLGTLILLHRSGTTSQPEAAPTQAKPLPMEQFQVIQS 
FLRMVPQGRRMVEEWIDLLCERQLHPVFWTLMPPSPTWETSGMTTYPAWWGWQDGGGFSR 
SWRLQGAPLMGRKELPGGERSTEEEACSTCLGTWAGGTEEAKAWSCRS PG I S LCPGS PGW 
PGAPTCCSRSPKEC 
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SEQ ID NO: 26. >AA553584_P4 ft TY Protein # CC #LN 223 # Source Gene: 

AA553584 # Encoding Transcript: 4 

MGVMVSLTVSSARRPPPCITRSASGSSCLSGQGRTTTAMWAVLAFWHIQGFPEVGEEEL 
VSVPDAKFTKGEFQWMKWQLLPDGHRVKKEVDAALDTVSETMTPMHYHLREIIICTYR 
QAKAAKEAQEMRRLQLRSLQYLERYVCLILFNAYLHLEKADSWQRPFSTWMQEVASKAGI 
YEILNELGFPELESGEDQPFSRLRYRWQEQSCSLEPSAPEDLL 

SEQ ID NO: 27. >AA553584_P5 # TY Protein # CC #LN 755 # Source Gene: 

AA553584 # Encoding Transcript: 5 

MGTTASTAQQTVSAGTPFEGLQGSGTMDSRHSVSIHSFQSTSLHNSKAKSIIPNKVAPW 
ITYNCKEEFQIHDELLKAHYTLGRLSDNTPEHYLVQGRYFLVRDVTEKMDVLGTVGSCGA 
PNFRQVQGGLTVFGMGQPSLSGFRRVLQKLQKDGHRECVIFCVREEPVLFLRADEDFVSY 
TPRDKQNLHENLQGLGPGVRVESLELAIRKEIHDFAQLSENTYHVYHNTEDLWGEPHAVA 
IHGEDDLHVTEEVYKRPLFLQPTYRYHRLPLPEQGSPLEAQLDAFVSVLRETPSLLQLRD 
AHGPPPALVFSCQMGVGRTNLGMVLGTLILLHRSGTTSQPEAAPTQAKPLPMEQFQVIQS 
FLRMV PQGRRMVE E V DRAI T ACAE LH DL KE WLENQKKLE G I RPE S P AQG S G S RH S VWQR 
7VLWSLERYFYLILFNYYLHEQYPLAFALSFSRWLCAHPELYRLPVTLSSAGPVAPRDLIA 
RGSLREDDLVSPDALSTVREMDVANFRRVPRMPIYGTAQPSAKALGSILAYLTDAKRRLR 
KWWVSLREEAVLECDGHTYSLRWPGPPVAPDQLETLEAQLKAHLSEPPPGKEGPLTYRF 
QTCLTMQEVFSQHRRACPGLTYHRIPMPDFCAPREEVRVLLRLRPLGRDRRPPCLPSGLG 
SGRMKRPQLLYGAREWSHEVFRFGPSGLFTFLPSFVCVYVCLLCVIGYMCVYCYRCVCIG 
VCILLRMCICVIRCVYMCLLCCRVYVCVLCYGCVL 



SEQ ID NO: 28. >D61791 # TY Consensus # Length 3383 # Number of exons 17 

tgcgaagtttggggagggaggagctcgggcggccggagctggtcgctgcgcgcgccgggg 

ggagggccggggctgcgagagaggaagctctttcgcggcgctacggcgttggcaccagtc 

tctagaaaagaagtcagctctggttcggagaagcagcggctggcgtgggccatccgggga 

atgggcgccctcgtgacctagtgttgcggggcaaaaagggtcttgccggcctcgctcgtg 

caggggcgtatctgggcgcctgagcgcggcgtgggagccttgggagccgccgcagcaggg 

ggcacacccggaaccggcctgagcgcccgggaccatgaacggggaggccatctgcagcgc 

cctgcccaccattccctaccacaaactcgccgacctgcgctacctgagccgcggcgcctc 

tggcactgtgtcgtccgcccgccacgcagactggcgcgtccaggtggccgtgaagcacct 

gcacatccacactccgctgctcgacagtgaaagaaaggatgtcttaagagaagctgaaat 

tttacacaaagctagatttagttacattcttccaattttgggaatttgcaatgagcctga 

atttttgggaatagttactgaatacatgccaaatggatcattaaatgaactcctacatag 

gaaaactgaatatcctgatgttgcttggccattgagatttcgcatcctgcatgaaattgc 

ccttggtgtaaattacctgcacaatatgactcctcctttacttcatcatgacttgaagac 

tcagaatatcttattggacaatgaatttcatgttaagattgcagattttggtttatcaaa 

gtggcgcatgatgtccctctcacagtcacgaagtagcaaatctgcaccagaaggagggac 

aattatctatatgccacctgaaaactatgaacctggacaaaaatcaagggccagtatcaa 

gcacgatatatatagctatgcagttatcacatgggaagtgttatccagaaaacagccttt 

tgaagatgtcaccaatcctttgcagataatgtatagtgtgtcacaaggacatcgacctgt 

tattaatgaagaaagtttgccatatgatatacctcaccgagcacgtatgatctctctaat 

agaaagtggatgggcacaaaatccagatgaaagaccatctttcttaaaatgtttaataga 

acttgaaccagttttgagaacatttgaagagataacttttcttgaagctgttattcagct 

aaagaaaacaaagacctagtttataccagatatttcatatttataagctcaatctctaga 

accatcctgcatgtaggtaattgttagatttcgaggatgaagctctgagacactgagaga 

aggtaaccaattaatatgagccaggattcaaagtctgtgtttctaaagttatttcccgtt 

ctacactgtcttttccttcagttatatgttatattctcaactctttatattttctttcca 

tgtattttgtacaaacatataaaatgtgttagattgtattttacttctataatttcctaa 

tcatctccagttaaagtgtatatatatttatgtattcattacagttacagagtgtttcaa 
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gtgccattcacctatgtgacaagaagaaaatggaattatctctgaacatacctgtaaatc 
atggtccacaagaggaatcatgtggatcctctcagctccatgaaaatagtggttctcctg 
aaacttcaaggtccctgccagctcctcaagacaatgattttttatctagaaaagctcaag 
actgttattttatgaagctgcatcactgtcctggaaatcacagttgggatagcaccattt 
ctggatctcaaagggctgcattctgtgatcacaagaccactccatgctcttcagcaataa 
taaatccactctcaactgcaggaaactcagaacgtctgcagcctggtatagcccagcagt 
ggatccagagcaaaagggaagacattgtgaaccaaatgacagaagcctgccttaaccagt 
cgctagatgcccttctgtccagggacttgatcatgaaagaggactatgaacttgttagta 
ccaagcctacaaggacctcaaaagtcagacaattactagacactactgacatccaaggag 
aagaatttgccaaagttatagtacaaaaattgaaagataacaaacaaatgggtcttcagc 
cttacccggaaatacttgtggtttctagatcaccatctttaaatttacttcaaaataaaa 
gcatgtaagtgactgtttttcaagaagaaatgtgtttcataaaaggatatttatatctct 
gttgctttgactttttttatataaaatccgtgagtattaaagctttattgaaggttcttt 
gggtaaatattagtctccctccatgacactgcagtattttttttaattaatacaagtaaa 
aagtttgaattttgctacatagttcaatttttatgtctcttttgttaacagaaaccactt 
ttaaaggatagtaattattcttgtttataacagtgccttaaggtatgatgtatttctgat 
ggaagccattttcacattcatgttcttcatggattatttgttacttgtctaagatgcaat 
ttgattttatgaagtatataccctttacccaccagagacagtacagaatccctgccctaa 
aatcccaggcttaattgccctacaaagggttattaatttaaaactccattattaggatta 
cattttaaagttttatttatgaattccctttaaaaatgatatttcaaaggtaaaacaata 
caatataaagaaaaaaataaatatattaataccggcttcctgtccccatttttaacctca 
gccttccctactgtcaccaacaaccaagctaaataaagtcaacagcctgatgtgtatctt 
tctgtccctttcttcctgcttatatttagaacatatgctcatttgagaaaggatctttct 
gcatattattattataattatacatcatactgcaacctgctttttgcatttaatagtaca 
gccttccagatcagatatggctctaacttactctttttacttgctgcctaatacccacag 
tttttattttcttaacttgtccatatcaaatttagcccaaacaataaaagtacggtgtga 
gcacttttttggttaataggttatattactataggagaacttacaaggaataatagccct 
tttctgaaactgcgtagcgaagtcagcagaaagtgagatacagaataaatttgaaaagat 
gaactttcaagaagcaacagacccacaacctgaaaagtttgaataaaagaaattagatgc 
cattgaggtaattaaatattaat 

SEQ ID NO: 29. >D61791_T1 # TY Transcript # LN 2003 # Source Gene: D61791 # 
Encoded protein: D61791_P1 

tgcgaagtttggggagggaggagctcgggcggccggagctggtcgctgcgcgcgccgggg 
ggagggccggggctgcgagagaggaagctctttcgcggcgctacggcgttggcaccagtc 
tctagaaaagaagtcagctctggttcggagaagcagcggctggcgtgggccatccgggga 
atgggcgccctcgtgacctagtgttgcggggcaaaaagggtcttgccggcctcgctcgtg 
caggggcgtatctgggcgcctgagcgcggcgtgggagccttgggagccgccgcagcaggg 
ggcacacccggaaccggcctgagcgcccgggaccatgaacggggaggccatctgcagcgc 
cctgcccaccattccctaccacaaactcgccgacctgcgctacctgagccgcggcgcctc 
tggcactgtgtcgtccgcccgccacgcagactggcgcgtccaggtggccgtgaagcacct 
gcacatccacactccgctgctcgacagtgaaagaaaggatgtcttaagagaagctgaaat 
tttacacaaagctagatttagttacattcttccaattttgggaatttgcaatgagcctga 
atttttgggaatagttactgaatacatgccaaatggatcattaaatgaactcctacatag 
gaaaactgaatatcctgatgttgcttggccattgagatttcgcatcctgcatgaaattgc 
ccttggtgtaaattacctgcacaatatgactcctcctttacttcatcatgacttgaagac 
tcagaatatcttattggacaatgaatttcatgttaagattgcagattttggtttatcaaa 
gtggcgcatgatgtccctctcacagtcacgaagtagcaaatctgcaccagaaggagggac 
aattatctatatgccacctgaaaactatgaacctggacaaaaatcaagggccagtatcaa 
gcacgatatatatagctatgcagttatcacatgggaagtgttatccagaaaacagccttt 
tgaagatgtcaccaatcctttgcagataatgtatagtgtgtcacaaggacatcgacctgt 
tattaatgaagaaagtttgccatatgatatacctcaccgagcacgtatgatctctctaat 
agaaagtggatgggcacaaaatccagatgaaagaccatctttcttaaaatgtttaataga 
acttgaaccagttttgagaacatttgaagagataacttttcttgaagctgttattcagct 
aaagaaaacaaagttacagagtgtttcaagtgccattcacctatgtgacaagaagaaaat 
ggaattatctctgaacatacctgtaaatcatggtccacaagaggaatcatgtggatcctc 
tcagctccatgaaaatagtggttctcctgaaacttcaaggtccctgccagctcctcaaga 
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caatgattttttatctagaaaagctcaagactgttattttatgaagctgcatcactgtcc 
tggaaatcacagttgggatagcaccatttctggatctcaaagggctgcattctgtgatca 
caagaccactccatgctcttcagcaataataaatccactctcaactgcaggaaactcaga 
acgtctgcagcctggtatagcccagcagtggatccagagcaaaagggaagacattgtgaa 
ccaaatgacagaagcctgccttaaccagtcgctagatgcccttctgtccagggacttgat 
catgaaagaggactatgaacttgttagtaccaagcctacaaggacctcaaaagtcagaca 
attactagacactactgacatccaaggagaagaatttgccaaagttatagtacaaaaatt 
gaaagataacaaacaaatgggtcttcagccttacccggaaatacttgtggtttctagatc 
accatctttaaatttacttcaaaataaaagcatgtaagtgactgtttttcaagaagaaat 
gtgtttcataaaaggatatttat 

SEQ ID NO:30. >D61791_T2 # TY Transcript # LN 3052 # Source Gene: D61791 # 
Encoded protein: D61791_P1 

tgcgaagtttggggagggaggagctcgggcggccggagctggtcgctgcgcgcgccgggg 
ggagggccggggctgcgagagaggaagctctttcgcggcgctacggcgttggcaccagtc 
tctagaaaagaagtcagctctggttcggagaagcagcggctggcgtgggccatccgggga 
atgggcgccctcgtgacctagtgttgcggggcaaaaagggtcttgccggcctcgctcgtg 
caggggcgtatctgggcgcctgagcgcggcgtgggagccttgggagccgccgcagcaggg 
ggcacacccggaaccggcctgagcgcccgggaccatgaacggggaggccatctgcagcgc 
cctgcccaccattccctaccacaaactcgccgacctgcgctacctgagccgcggcgcctc 
tggcactgtgtcgtccgcccgccacgcagactggcgcgtccaggtggccgtgaagcacct 
gcacatccacactccgctgctcgacagtgaaagaaaggatgtcttaagagaagctgaaat 
tttacacaaagctagatttagttacattcttccaattttgggaatttgcaatgagcctga 
atttttgggaatagttactgaatacatgccaaatggatcattaaatgaactcctacatag 
gaaaactgaatatcctgatgttgcttggccattgagatttcgcatcctgcatgaaattgc 
ccttggtgtaaattacctgcacaatatgactcctcctttacttcatcatgacttgaagac 
tcagaatatcttattggacaatgaatttcatgttaagattgcagattttggtttatcaaa 
gtggcgcatgatgtccctctcacagtcacgaagtagcaaatctgcaccagaaggagggac 
aattatctatatgccacctgaaaactatgaacctggacaaaaatcaagggccagtatcaa 
gcacgatatatatagctatgcagttatcacatgggaagtgttatccagaaaacagccttt 
tgaagatgtcaccaatcctttgcagataatgtatagtgtgtcacaaggacatcgacctgt 
tattaatgaagaaagtttgccatatgatatacctcaccgagcacgtatgatctctctaat 
agaaagtggatgggcacaaaatccagatgaaagaccatctttcttaaaatgtttaataga 
acttgaaccagttttgagaacatttgaagagataacttttcttgaagctgttattcagct 
aaagaaaacaaagttacagagtgtttcaagtgccattcacctatgtgacaagaagaaaat 
ggaattatctctgaacatacctgtaaatcatggtccacaagaggaatcatgtggatcctc 
tcagctccatgaaaatagtggttctcctgaaacttcaaggtccctgccagctcctcaaga 
caatgattttttatctagaaaagctcaagactgttattttatgaagctgcatcactgtcc 
tggaaatcacagttgggatagcaccatttctggatctcaaagggctgcattctgtgatca 
caagaccactccatgctcttcagcaataataaatccactctcaactgcaggaaactcaga 
acgtctgcagcctggtatagcccagcagtggatccagagcaaaagggaagacattgtgaa 
ccaaatgacagaagcctgccttaaccagtcgctagatgcccttctgtccagggacttgat 
catgaaagaggactatgaacttgttagtaccaagcctacaaggacctcaaaagtcagaca 
attactagacactactgacatccaaggagaagaatttgccaaagttatagtacaaaaatt 
gaaagataacaaacaaatgggtcttcagccttacccggaaatacttgtggtttctagatc 
accatctttaaatttacttcaaaataaaagcatgtaagtgactgtttttcaagaagaaat 
gtgtttcataaaaggatatttatatctctgttgctttgactttttttatataaaatccgt 
gagtattaaagctttattgaaggttctttgggtaaatattagtctccctccatgacactg 
cagtattttttttaattaatacaagtaaaaagtttgaattttgctacatagttcaatttt 
tatgtctcttttgttaacagaaaccacttttaaaggatagtaattattcttgtttataac 
agtgccttaaggtatgatgtatttctgatggaagccattttcacattcatgttcttcatg 
gattatttgttacttgtctaagatgcaatttgattttatgaagtatataccctttaccca 
ccagagacagtacagaatccctgccctaaaatcccaggcttaattgccctacaaagggtt 
attaatttaaaactccattattaggattacattttaaagttttatttatgaattcccttt 
aaaaatgatatttcaaaggtaaaacaatacaatataaagaaaaaaataaatatattaata 
ccggcttcctgtccccatttttaacctcagccttccctactgtcaccaacaaccaagcta 
aataaagtcaacagcctgatgtgtatctttctgtccctttcttcctgcttatatttagaa 
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catatgctcatttgagaaaggatctttctgcatattattattataattatacatcatact 
gcaacctgctttttgcatttaatagtacagccttccagatcagatatggctctaacttac 
tctttttacttgctgcctaatacccacagtttttattttcttaacttgtccatatcaaat 
ttagcccaaacaataaaagtacggtgtgagcacttttttggttaataggttatattacta 
taggagaacttacaaggaataatagcccttttctgaaactgcgtagcgaagtcagcagaa 
agtgagatacagaataaatttgaaaagatgaactttcaagaagcaacagacccacaacct 
gaaaagtttgaataaaagaaattagatgccattgaggtaattaaatattaat 

SEQ ID NO: 31. >D61791_T3 # TY Transcript # LN 2150 # Source Gene: D61791 # 
Encoded protein: D61791_P1 

tgcgaagtttggggagggaggagctcgggcggccggagctggtcgctgcgcgcgccgggg 
ggagggccggggctgcgagagaggaagctctttcgcggcgctacggcgttggcaccagtc 
tctagaaaagaagtcagctctggttcggagaagcagcggctggcgtgggccatccgggga 
atgggcgccctcgtgacctagtgttgcggggcaaaaagggtcttgccggcctcgctcgtg 
caggggcgtatctgggcgcctgagcgcggcgtgggagccttgggagccgccgcagcaggg 
ggcacacccggaaccggcctgagcgcccgggaccatgaacggggaggccatctgcagcgc 
cctgcccaccattccctaccacaaactcgccgacctgcgctacctgagccgcggcgcctc 
tggcactgtgtcgtccgcccgccacgcagactggcgcgtccaggtggccgtgaagcacct 
gcacatccacactccgctgctcgacagtgaaagaaaggatgtcttaagagaagctgaaat 
tttacacaaagctagatttagttacattcttccaattttgggaatttgcaatgagcctga 
atttttgggaatagttactgaatacatgccaaatggatcattaaatgaactcctacatag 
gaaaactgaatatcctgatgttgcttggccattgagatttcgcatcctgcatgaaattgc 
ccttggtgtaaattacctgcacaatatgactcctcctttacttcatcatgacttgaagac 
tcagaatatcttattggacaatgaatttcatgttaagattgcagattttggtttatcaaa 
gtggcgcatgatgtccctctcacagtcacgaagtagcaaatctgcaccagaaggagggac 
aattatctatatgccacctgaaaactatgaacctggacaaaaatcaagggccagtatcaa 
gcacgatatatatagctatgcagttatcacatgggaagtgttatccagaaaacagccttt 
tgaagatgtcaccaatcctttgcagataatgtatagtgtgtcacaaggacatcgacctgt 
tattaatgaagaaagtttgccatatgatatacctcaccgagcacgtatgatctctctaat 
agaaagtggatgggcacaaaatccagatgaaagaccatctttcttaaaatgtttaataga 
acttgaaccagttttgagaacatttgaagagataacttttcttgaagctgttattcagct 
aaagaaaacaaagttacagagtgtttcaagtgccattcacctatgtgacaagaagaaaat 
ggaattatctctgaacatacctgtaaatcatggtccacaagaggaatcatgtggatcctc 
tcagctccatgaaaatagtggttctcctgaaacttcaaggtccctgccagctcctcaaga 
caatgattttttatctagaaaagctcaagactgttattttatgaagctgcatcactgtcc 
tggaaatcacagttgggatagcaccatttctggatctcaaagggctgcattctgtgatca 
caagaccactccatgctcttcagcaataataaatccactctcaactgcaggaaactcaga 
acgtctgcagcctggtatagcccagcagtggatccagagcaaaagggaagacattgtgaa 
ccaaatgacagaagcctgccttaaccagtcgctagatgcccttctgtccagggacttgat 
catgaaagaggactatgaacttgttagtaccaagcctacaaggacctcaaaagtcagaca 
attactagacactactgacatccaaggagaagaatttgccaaagttatagtacaaaaatt 
gaaagataacaaacaaatgggtcttcagccttacccggaaatacttgtggtttctagatc 
accatctttaaatttacttcaaaataaaagcatgtaagtgactgtttttcaagaagaaat 
gtgtttcataaaaggatatttatatctctgttgctttgactttttttatataaaatccgt 
gagtattaaagctttattgaaggttctttgggtaaatattagtctccctccatgacactg 
cagtattttttttaattaatacaagtaaaaagtttgaattttgctacata 

SEQ ID NO: 32. >D61791_T4 # TY Transcript # LN 1996 # Source Gene: D61791 # 
Encoded protein: D61791_P2 

tgcgaagtttggggagggaggagctcgggcggccggagctggtcgctgcgcgcgccgggg 
ggagggccggggctgcgagagaggaagctctttcgcggcgctacggcgttggcaccagtc 
tctagaaaagaagtcagctctggttcggagaagcagcggctggcgtgggccatccgggga 
atgggcgccctcgtgacctagtgttgcggggcaaaaagggtcttgccggcctcgctcgtg 
caggggcgtatctgggcgcctgagcgcggcgtgggagccttgggagccgccgcagcaggg 
ggcacacccggaaccggcctgagcgcccgggaccatgaacggggaggccatctgcagcgc 
cctgcccaccattccctaccacaaactcgccgacctgcgctacctgagccgcggcgcctc 
tggcactgtgtcgtccgcccgccacgcagactggcgcgtccaggtggccgtgaagcacct 
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Figure 1 (Cont'd) 



gcacatccacactccgctgctcgacagaaaactgaatatcctgatgttgcttggccattg 
agatttcgcatcctgcatgaaattgcccttggtgtaaattacctgcacaatatgactcct 
cctttacttcatcatgacttgaagactcagaatatcttattggacaatgaatttcatgtt 
aagattgcagattttggtttatcaaagtggcgcatgatgtccctctcacagtcacgaagt 
agcaaatctgcaccagaaggagggacaattatctatatgccacctgaaaactatgaacct 
ggacaaaaatcaagggccagtatcaagcacgatatatatagctatgcagttatcacatgg 
gaagtgttatccagaaaacagccttttgaagatgtcaccaatcctttgcagataatgtat 
agtgtgtcacaaggacatcgacctgttattaatgaagaaagtttgccatatgatatacct 
caccgagcacgtatgatctctctaatagaaagtggatgggcacaaaatccagatgaaaga 
ccatctttcttaaaatgtttaatagaacttgaaccagttttgagaacatttgaagagata 
acttttcttgaagctgttattcagctaaagaaaacaaagttacagagtgtttcaagtgcc 
attcacctatgtgacaagaagaaaatggaattatctctgaacatacctgtaaatcatggt 
ccacaagaggaatcatgtggatcctctcagctccatgaaaatagtggttctcctgaaact 
tcaaggtccctgccagctcctcaagacaatgattttttatctagaaaagctcaagactgt 
tattttatgaagctgcatcactgtcctggaaatcacagttgggatagcaccatttctgga 
tctcaaagggctgcattctgtgatcacaagaccactccatgctcttcagcaataataaat 
ccactctcaactgcaggaaactcagaacgtctgcagcctggtatagcccagcagtggatc 
cagagcaaaagggaagacattgtgaaccaaatgacagaagcctgccttaaccagtcgcta 
gatgcccttctgtccagggacttgatcatgaaagaggactatgaacttgttagtaccaag 
cctacaaggacctcaaaagtcagacaattactagacactactgacatccaaggagaagaa 
tttgccaaagttatagtacaaaaattgaaagataacaaacaaatgggtcttcagccttac 
ccggaaatacttgtggtttctagatcaccatctttaaatttacttcaaaataaaagcatg 
taagtgactgtttttcaagaagaaatgtgtttcataaaaggatatttatatctctgttgc 
tttgactttttttatataaaatccgtgagtattaaagctttattgaaggttctttgggta 
aatattagtctccctccatgacactgcagtattttttttaattaatacaagtaaaaagtt 
tgaattttgctacata 

SEQ ID NO: 33. >D61791_T5 # TY Transcript # LN 2371 # Source Gene: D61791 # 
Encoded protein: D61791_P2 

tgcgaagtttggggagggaggagctcgggcggccggagctggtcgctgcgcgcgccgggg 
ggagggccggggctgcgagagaggaagctctttcgcggcgctacggcgttggcaccagtc 
tctagaaaagaagtcagctctggttcggagaagcagcggctggcgtgggccatccgggga 
atgggcgccctcgtgacctagtgttgcggggcaaaaagggtcttgccggcctcgctcgtg 
caggggcgtatctgggcgcctgagcgcggcgtgggagccttgggagccgccgcagcaggg 
ggcacacccggaaccggcctgagcgcccgggaccatgaacggggaggccatctgcagcgc 
cctgcccaccattccctaccacaaactcgccgacctgcgctacctgagccgcggcgcctc 
tggcactgtgtcgtccgcccgccacgcagactggcgcgtccaggtggccgtgaagcacct 
gcacatccacactccgctgctcgacagaaaactgaatatcctgatgttgcttggccattg 
agatttcgcatcctgcatgaaattgcccttggtgtaaattacctgcacaatatgactcct 
cctttacttcatcatgacttgaagactcagaatatcttattggacaatgaatttcatgtt 
aagattgcagattttggtttatcaaagtggcgcatgatgtccctctcacagtcacgaagt 
agcaaatctgcaccagaaggagggacaattatctatatgccacctgaaaactatgaacct 
ggacaaaaatcaagggccagtatcaagcacgatatatatagctatgcagttatcacatgg 
gaagtgttatccagaaaacagccttttgaagatgtcaccaatcctttgcagataatgtat 
agtgtgtcacaaggacatcgacctgttattaatgaagaaagtttgccatatgatatacct 
caccgagcacgtatgatctctctaatagaaagtggatgggcacaaaatccagatgaaaga 
ccatctttcttaaaatgtttaatagaacttgaaccagttttgagaacatttgaagagata 
acttttcttgaagctgttattcagctaaagaaaacaaagttacagagtgtttcaagtgcc 
attcacctatgtgacaagaagaaaatggaattatctctgaacatacctgtaaatcatggt 
ccacaagaggaatcatgtggatcctctcagctccatgaaaatagtggttctcctgaaact 
tcaaggtccctgccagctcctcaagacaatgattttttatctagaaaagctcaagactgt 
tattttatgaagctgcatcactgtcctggaaatcacagttgggatagcaccatttctgga 
tctcaaagggctgcattctgtgatcacaagaccactccatgctcttcagcaataataaat 
ccactctcaactgcaggaaactcagaacgtctgcagcctggtatagcccagcagtggatc 
cagagcaaaagggaagacattgtgaaccaaatgacagaagcctgccttaaccagtcgcta 
gatgcccttctgtccagggacttgatcatgaaagaggactatgaacttgttagtaccaag 
cctacaaggacctcaaaagtcagacaattactagacactactgacatccaaggagaagaa 
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Figur 1 (Confd) 



tttgccaaagttatagtacaaaaattgaaagataacaaacaaatgggtcttcagccttac 
ccggaaatacttgtggtttctagatcaccatctttaaatttacttcaaaataaaagcatg 
taagtgactgtttttcaagaagaaatgtgtttcataaaaggatatttatatctctgttgc 
tttgactttttttatataaaatccgtgagtattaaagctttattgaaggttctttgggta 
aatattagtctccctccatgacactgcagtattttttttaattaatacaagtaaaaagtt 
tgaattttgctacatagttcaatttttatgtctcttttgttaacagaaaccacttttaaa 
ggatagtaattattcttgtttataacagtgccttaaggtatgatgtatttctgatggaag 
ccattttcacattcatgttcttcatggattatttgttacttgtctaagatgcaatttgat 
tttatgaagtatataccctttacccaccagagacagtacagaatccctgccctaaaatcc 
caggcttaattgccctacaaagggttattaatttaaaactccattattaggattacattt 
taaagttttatttatgaattccctttaaaaatgatatttcaaaggtaaaacaatacaata 
taaagaaaaaaataaatatattaataccggc 

SEQ ID NO: 34 . >D61791_T6 # TY Transcript # LN 1911 # Source Gene: D61791 # 
Encoded protein: D61791_P2 

tgcgaagtttggggagggaggagctcgggcggccggagctggtcgctgcgcgcgccgggg 
ggagggccggggctgcgagagaggaagctctttcgcggcgctacggcgttggcaccagtc 
tctagaaaagaagtcagctctggttcggagaagcagcggctggcgtgggccatccgggga 
atgggcgccctcgtgacctagtgttgcggggcaaaaagggtcttgccggcctcgctcgtg 
caggggcgtatctgggcgcctgagcgcggcgtgggagccttgggagccgccgcagcaggg 
ggcacacccggaaccggcctgagcgcccgggaccatgaacggggaggccatctgcagcgc 
cctgcccaccattccctaccacaaactcgccgacctgcgctacctgagccgcggcgcctc 
tggcactgtgtcgtccgcccgccacgcagactggcgcgtccaggtggccgtgaagcacct 
gcacatccacactccgctgctcgacagaaaactgaatatcctgatgttgcttggccattg 
agatttcgcatcctgcatgaaattgcccttggtgtaaattacctgcacaatatgactcct 
cctttacttcatcatgacttgaagactcagaatatcttattggacaatgaatttcatgtt 
aagattgcagattttggtttatcaaagtggcgcatgatgtccctctcacagtcacgaagt 
agcaaatctgcaccagaaggagggacaattatctatatgccacctgaaaactatgaacct 
ggacaaaaatcaagggccagtatcaagcacgatatatatagctatgcagttatcacatgg 
gaagtgttatccagaaaacagccttttgaagatgtcaccaatcctttgcagataatgtat 
agtgtgtcacaaggacatcgacctgttattaatgaagaaagtttgccatatgatatacct 
caccgagcacgtatgatctctctaatagaaagtggatgggcacaaaatccagatgaaaga 
ccatctttcttaaaatgtttaatagaacttgaaccagttttgagaacatttgaagagata 
acttttcttgaagctgttattcagctaaagaaaacaaagttacagagtgtttcaagtgcc 
attcacctatgtgacaagaagaaaatggaattatctctgaacatacctgtaaatcatggt 
ccacaagaggaatcatgtggatcctctcagctccatgaaaatagtggttctcctgaaact 
tcaaggtccctgccagctcctcaagacaatgattttttatctagaaaagctcaagactgt 
tattttatgaagctgcatcactgtcctggaaatcacagttgggatagcaccatttctgga 
tctcaaagggctgcattctgtgatcacaagaccactccatgctcttcagcaataataaat 
ccactctcaactgcaggaaactcagaacgtctgcagcctggtatagcccagcagtggatc 
cagagcaaaagggaagacattgtgaaccaaatgacagaagcctgccttaaccagtcgcta 
gatgcccttctgtccagggacttgatcatgaaagaggactatgaacttgttagtaccaag 
cctacaaggacctcaaaagtcagacaattactagacactactgacatccaaggagaagaa 
tttgccaaagttatagtacaaaaattgaaagataacaaacaaatgggtcttcagccttac 
ccggaaatacttgtggtttctagatcaccatctttaaatttacttcaaaataaaagcatg 
taagtgactgtttttcaagaagaaatgtgtttcataaaaggatatttatatctctgttgc 
tttgactttttttatataaaatccgtgagtattaaagctttattgaaggtt 

SEQ ID NO: 35. >D61791_T7 # TY Transcript # LN 1863 # Source Gene: D61791 # 
Encoded protein: D61791_P3 

tgcgaagtttggggagggaggagctcgggcggccggagctggtcgctgcgcgcgccgggg 
ggagggccggggctgcgagagaggaagctctttcgcggcgctacggcgttggcaccagtc 
tctagaaaagaagtcagcttgtgtcgtccgcccgccacgcagactggcgcgtccaggtgg 
ccgtgaagcacctgcacatccacactccgctgctcgacagtgaaagaaaggatgtcttaa 
gagaagctgaaattttacacaaagctagatttagttacattcttccaattttgggaattt 
gcaatgagcctgaatttttgggaatagttactgaatacatgccaaatggatcattaaatg 
aactcctacataggaaaactgaatatcctgatgttgcttggccattgagatttcgcatcc 



21/365 



Figur 1 (Cont'd) 



tgcatgaaattgcccttggtgtaaattacctgcacaatatgactcctcctttacttcatc 
atgacttgaagactcagaatatcttattggacaatgaatttcatgttaagattgcagatt 
ttggtt.tatcaaagtggcgcatgatgtccctctcacagtcacgaagtagcaaatctgcac 
cagaaggagggacaattatctatatgccacctgaaaactatgaacctggacaaaaatcaa 
gggccagtatcaagcacgatatatatagctatgcagttatcacatgggaagtgttatcca 
gaaaacagccttttgaagatgtcaccaatcctttgcagataatgtatagtgtgtcacaag 
gacatcgacctgttattaatgaagaaagtttgccatatgatatacctcaccgagcacgta 
tgatctctctaatagaaagtggatgggcacaaaatccagatgaaagaccatctttcttaa 
aatgtttaatagaacttgaaccagttttgagaacatttgaagagataacttttcttgaag 
ctgttattcagctaaagaaaacaaagttacagagtgtttcaagtgccattcacctatgtg 
acaagaagaaaatggaattatctctgaacatacctgtaaatcatggtccacaagaggaat 
catgtggatcctctcagctccatgaaaatagtggttctcctgaaacttcaaggtccctgc 
cagctcctcaagacaatgattttttatctagaaaagctcaagactgttattttatgaagc 
tgcatcactgtcctggaaatcacagttgggatagcaccatttctggatctcaaagggctg 
cattctgtgatcacaagaccactccatgctcttcagcaataataaatccactctcaactg 
caggaaactcagaacgtctgcagcctggtatagcccagcagtggatccagagcaaaaggg 
aagacattgtgaaccaaatgacagaagcctgccttaaccagtcgctagatgcccttctgt 
ccagggacttgatcatgaaagaggactatgaacttgttagtaccaagcctacaaggacct 
caaaagtcagacaattactagacactactgacatccaaggagaagaatttgccaaagtta 
tagtacaaaaattgaaagataacaaacaaatgggtcttcagccttacccggaaatacttg 
tggtttctagatcaccatctttaaatttacttcaaaataaaagcatgtaagtgactgttt 
ttcaagaagaaatgtgtttcataaaaggatatttatatctctgttgctttgacttttttt 
atataaaatccgtgagtattaaagctttattgaaggttctttgggtaaatattagtctcc 
ctccatgacactgcagtattttttttaattaatacaagtaaaaagtttgaattttgctac 
ata 

SEQ ID NO: 36. >D61791_T8 # TY Transcript # LN 1208 # Source Gene: D61791 # 
Encoded protein: D61791_P4 

acctagtttataccagatatttcatatttataagctcaatctctagaaccatcctgcatg 
taggtaattgttagatttcgaggatgaagctctgagacactgagagaaggtaaccaatta 
atatgagccaggattcaaagtctgtgtttctaaagttatttcccgttctacactgtcttt 
tccttcagttatatgttatattctcaactctttatattttctttccatgtattttgtaca 
aacatataaaatgtgttagattgtattttacttctataatttcctaatcatctccagtta 
aagtgtatatatatttatgtattcattacagttacagagtgtttcaagtgccattcacct 
atgtgacaagaagaaaatggaattatctctgaacatacctgtaaatcatggtccacaaga 
ggaatcatgtggatcctctcagctccatgaaaatagtggttctcctgaaacttcaaggtc 
cctgccagctcctcaagacaatgattttttatctagaaaagctcaagactgttattttat 
gaagctgcatcactgtcctggaaatcacagttgggatagcaccatttctggatctcaaag 
ggctgcattctgtgatcacaagaccactccatgctcttcagcaataataaatccactctc 
aactgcaggaaactcagaacgtctgcagcctggtatagcccagcagtggatccagagcaa 
aagggaagacattgtgaaccaaatgacagaagcctgccttaaccagtcgctagatgccct 
tctgtccagggacttgatcatgaaagaggactatgaacttgttagtaccaagcctacaag 
gacctcaaaagtcagacaattactagacactactgacatccaaggagaagaatttgccaa 
agttatagtacaaaaattgaaagataacaaacaaatgggtcttcagccttacccggaaat 
acttgtggtttctagatcaccatctttaaatttacttcaaaataaaagcatgtaagtgac 
tgtttttcaagaagaaatgtgtttcataaaaggatatttatatctctgttgctttgactt 
tttttatataaaatccgtgagtattaaagctttattgaaggttctttgggtaaatattag 
tctccctccatgacactgcagtattttttttaattaatacaagtaaaaagtttgaatttt 
gctacata 

SEQ ID NO: 37. >D61791_P1 # TY Protein # CC #LN 540 # Source Gene: D61791 
# Encoding Transcript: 1 

MNGEAICSALPTIPYHKLADLRYLSRGASGTVSSARHADWRVQVAVKHLHIHTPLLDSER 
KDVLREAEILHKARFSYILPILGICNEPEFLGIVTEYMPNGSLNELLHRKTEYPDVAWPL 
RFRILHEIALGVNYLHNMTPPLLHHDLKTQNILLDNEFHVKIADFGLSKWRMMSLSQSRS 
SKSAPEGGTIIYMPPENYEPGQKSRASIKHDIYSYAVITWEVLSRKQPFEDVTNPLQIMY 
SVSQGHRPVINEESLPYDIPHRARMISLIESGWAQNPDERPSFLKCLIELEPVLRTFEEI 
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Figure 1 (Cont'd) 



TFLEAVIQLKKTKLQSVSSAIHLCDKKKMELSLNIPVNHGPQEESCGSSQLHENSGSPET 
SRSLPAPQDNDFLSRKAQDCYFMKLHHCPGNHSWDSTISGSQRAAFCDHKTTPCSSAIIN 
PLSTAGNSERLQPGIAQQWIQSKREDIVNQMTEACLNQSLDALLSRDLIMKEDYELVSTK 
PTRTSKVRQLLDTTDIQGEEFAKVIVQKLKDNKQMGLQPYPEILWSRSPSLNLLQNKSM 

SEQ ID NO: 38. >D61791_P2 # TY Protein # CC #LN 403 # Source Gene: D61791 

# Encoding Transcript: 4 

MTPPLLHHDLKTQNILLDNEFHVKIADFGLSKWR^5MSLSQSRSSKSAPEGGTIIYMPPEN 
YEPGQKSRASIKHDIYSYAVITWEVLSRKQPFEDVTNPLQIMYSVSQGHRPVINEESLPY 
DIPHRARMISLIESGWAQNPDERPSFLKCLIELEPVLRTFEEITFLEAVIQLKKTKLQSV 
SSAIHLCDECKKMELSLNIPVNHGPQEESCGSSQLHENSGSPETSRSLPAPQDNDFLSRKA 
QDCYFMKLHHCPGNHSWDSTISGSQRAAFCDHKTTPCSSAIINPLSTAGNSERLQPGIAQ 
QWIQSKREDIVNQMTEACLNQSLDALLSRDLIMKEDYELVSTKPTRTSKVRQLLDTTDIQ 
GEEFAKVIVQKLKDNKQMGLQPYPEILWSRSPSLNLLQNKSM 

SEQ ID NO: 39. >D61791_P3 # TY Protein # CC #LN 443 # Source Gene: D61791 

# Encoding Transcript: 7 

MPNGSLNELLHRKTEYPDVAWPLRFRILHEIALGVNYLHNMTPPLLHHDLKTQNILLDNE 
FHVKIADFGLSKWRMMSLSQSRSSKSAPEGGTI I YMPPENYEPGQKSRAS IKHDI YSYAV 
ITWEVLSRKQPFEDVTNPLQIMYSVSQGHRPVINEESLPYDIPHRARMISLIESGWAQNP 
DERPSFLKCLIELEPVLRTFEEITFLEAVIQLKKTKLQSVSSAIHLCDKKKMELSLNIPV 
NHGPQEESCGSSQLHENSGSPETSRSLPAPQDNDFLSRKAQDCYFMKLHHCPGNHSWDST 
ISGSQRAAFCDHKTTPCSSAIINPLSTAGNSERLQPGIAQQWIQSKREDIVNQMTEACLN 
QSLDALLSRDLIMKEDYELVSTKPTRTSKVRQLLDTTDIQGEEFAKVIVQKLKDNKQMGL 
QPYPEILWSRSPSLNLLQNKSM 

SEQ ID NO: 40. >D61791_P4 # TY Protein # CC #LN 232 # Source Gene: D61791 

# Encoding Transcript: 8 

MYSLQLQSVSSAIHLCDKKKMELSLNIPVNHGPQEESCGSSQLHENSGSPETSRSLPAPQ 
DNDFLSRKAQDCYFMKLHHCPGNHSWDSTISGSQRAAFCDHKTTPCSSAIINPLSTAGNS 
ERLQPGIAQQWIQSKREDIVNQMTEACLNQSLDALLSRDLIMKEDYELVSTKPTRTSKVR 
QLLDTTDIQGEE FAKVI VQKLKDNKQMGLQPYPEI LWSRS PSLNLLQNKSM 



SEQ ID NO: 41. >F02366 # TY Consensus # Length 3279 # Number of exons 27 

ggaacgccaaccgcctgggcctagcgcagcttcctccgcccaccacggaagtgaggcggg 

gatactaaagcgacggagcccggtggacggaagtgggtgttggaggctttaaggtagctt 

taaattcgtgttgtcctgggagctcgcccttttcggctggagtcgggctttacggcgccg 

gatggctctggacgtgaagtctcgggcaaagcgttatgagaagctggacttccttgggga 

gggacaggctcgttctcgttgggggaaaccgtccagacgcacttgctgcccattctttac 

atcctgggggtgaatccctgaggggcctctccttgctgaagagtagcctggagctggacg 

gagactgacccgccacgtttccagccgccgcgagtctgctcagaaactctgggctctttg 

cttcgcgaaatgtaaaaatgcaaaaaggcaaacacaaaaactcccataaacttatgttat 

ttctatttttctttctagtttgccaccgtttacaaggccagagataagaataccaaccaa 

attgtcgccattaagaaaggtcccactctgtcgcacaggctggagtgcagtggcgtgatc 

tcggctcactgcaacctccacctccccaggctcaagcagttctcctgccgcagcctctca 

actgctgacctcaactgatctgcccacctcggcctcccaaagtgctgagattacaggctt 

gagccaccacacccagcctgcaacttgttattctgagtcaacattatttccgaaatcctc 

tatatcagacaaggtttcactttgttgtccagtctggtgtcaaactcctggccttaagca 

atcctccagcatcggcctcccaaagagctgggattacaggagtctctgctggtagccact 

acagctgtgaatgtgctgtgtcatacctgaagccaatatgtctcggagtttcacctgagg 

cccatgatcaaacttggacatagatcagaagctaaagatggtataaatagaaccgcctta 

agagagataaaattattacaggagctaagtcatccaaatataattggtctccttgatgct 
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Figure 1 (Cont'd) 



tttggacataaatctaatattagccttgtctttgattttatggaaactgatctagagggg 
cttcctgaggaaacttagataacctcttttgagagtctcctgtattgatacttacatttt 
aagtctgtaaattgttttatgtggtagagtttatttgaaataataaagggtacctgtata 
ttgtatacttgctttacaggttataataaaggataatagtcttgtgctgacaccatcaca 
catcaaagcctacatgttgatgactcttcaaggattagaatatttacatcaacattggat 
cctacatagggatctgaaaccaaacaacttgttgctagatgaaaatggagttctaaaact 
ggcagattttggcctggccaaatcttttgggagccccaatagaggtttggtaaacccaag 
ggttgggtattcgggggccccccccggaggttaacctaattttgggaaggcttaaggaat 
gaaatgggtgtagggtggtgggaccatgtggggcttgttggcctgtaatatttagccaga 
gtttactttctaagggagcttatacacatcaggttgtaaccaggtggtatcgggcccccg 
agttactatttggagctaggatgtatggtgtaggtgtggacatgtgggctgttggctgta 
tattagcagagttacttctaagggttccttttttgccaggagattcagaccttgatcagc 
taacaagaatatttgaaactttgggcacaccaactgaggaacagtggccggacatgtgta 
gtcttccagattatgtgacatttaagagtttccctggaatacctttgcatcacatcttca 
gtgcagcaggagacgacttactagatctcatacaaggcttattcttatttaatccatgtg 
ctcgaattacggccacacaggcactga'aaatgaagtatttcagtaatcggccagggccaa 
cacctggatgtcagctgccaagaccaaactgtccagtggaaaccttaaaggagcaatcaa 
atccagctttggcaataaaaaggaaaagaacagaggccttagaacaaggtaagattccca 
cttttaaaagaaattagaggattgcccaagaaactaattttttaaagagaacactggaca 
acattttactactgagggaaatagccaaaaaggcaaataatggaaaaatagtaaacatta 
agtaaatgctgtagaagtgagtttgtaaatattctacacatgtaaaatatgtaaaactat 
gggttatttttattaaatgtattttaaaataaaaatttaattctggtttttctgattaga 
gtgcaaaagtgagaaaagttcaatactcttgaaatgtagaattgaaaatgcattagggaa 
aacttaataaaaattattaccagttaatctgaaaccaaacaacttgttgctaggtgaata 
ggcctattcacatgagggtggaaccaggtggtatcgtctccccggtttctatttgagcta 
gaattatagggttttaccagcccttggacactatgggaggagggcaagagtacaccaatt 
tgttaaaagcaagaaaccacagtgtctcttcactagtcatttagaacatggttatcatcc 
aagactactctaccctgcaacattgaactcccaagagcaaatccacattcctcttgagtt 
ctgcagcttctgtgtaaatagggcagctgtcgtctatgccgtagaatcacatgatctgag 
gaccactcatggaagctgctaaatagcctagtctggggagtcttccataaagttttgcat 
ggagcaacaaacaggatttaactaggtttggttccttcagccctctaaaagcatagggct 
tagcctgcaggcttccttggctttctctgtggtgtagaaattgatctttttggccggaga 
tcgaactgtcagtacagattttgacttgggccccattggggacgtggcggcatggtgtct 
tccgatttgtgaacttaagttccggaacctgggccccttccggcgggagatcgactcaag 
gggttcttatcggtcgctatgcccggcggagagatttttgtcgggccgttgttgtcgcct 
ctcggcttgcacaccgcgggaaaaaaagcgcggcgtctgcgcggggcgtgcctgtccgct 
ccgtctccgctctcctcttctcttttgcttttggggtgc 

SEQ ID NO: 42. >F02366_T1 # TY Transcript # LN 1383 # Source Gene: F02366 # 
Encoded protein: F02366_P1 

ggaacgccaaccgcctgggcctagcgcagcttcctccgcccaccacggaagtgaggcggg 
gatactaaagcgacggagcccggtggacggaagtgggtgttggaggctttaaggtagctt 
taaattcgtgttgtcctgggagctcgcccttttcggctggagtcgggctttacggcgccg 
gatggctctggacgtgaagtctcgggcaaagcgttatgagaagctggacttccttgggga 
gggacagtttgccaccgtttacaaggccagagataagaataccaaccaaattgtcgccat 
taagaaaatcaaacttggacatagatcagaagctaaagatggtataaatagaaccgcctt 
aagagagataaaattattacaggagctaagtcatccaaatataattggtctccttgatgc 
ttttggacataaatctaatattagccttgtctttgattttatggaaactgatctagaggt 
tataataaaggataatagtcttgtgctgacaccatcacacatcaaagcctacatgttgat 
gactcttcaaggattagaatatttacatcaacattggatcctacatagggatctgaaacc 
aaacaacttgttgctagatgaaaatggagttctaaaactggcagattttggcctggccaa 
atcttttgggagccccaatagagcttatacacatcaggttgtaaccaggtggtatcgggc 
ccccgagttactatttggagctaggatgtatggtgtaggtgtggacatgtgggctgttgg 
ctgtatattagcagagttacttctaagggttccttttttgccaggagattcagaccttga 
tcagctaacaagaatatttgaaactttgggcacaccaactgaggaacagtggccggacat 
gtgtagtcttccagattatgtgacatttaagagtttccctggaatacctttgcatcacat 
cttcagtgcagcaggagacgacttactagatctcatacaaggcttattcttatttaatcc 
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Figur 1 (Confd) 



atgtgctcgaattacggccacacaggcactgaaaatgaagtatttcagtaatcggccagg 
gccaacacctggatgtcagctgccaagaccaaactgtccagtggaaaccttaaaggagca 
atcaaatccagctttggcaataaaaaggaaaagaacagaggccttagaacaaggaggatt 
gcccaagaaactaattttttaaagagaacactggacaacattttactactgagggaaata 
gccaaaaaggcaaataatggaaaaatagtaaacattaagtaaatgctgtagaagtgagtt 
tgtaaatattctacacatgtaaaatatgtaaaactatgggttatttttattaaatgtatt 
tta 

SEQ ID NO: 43. >F02366_T2 # TY Transcript # LN 1503 S Source Gene: F02366 # 
Encoded protein: F02366_P1 

ggaacgccaaccgcctgggcctagcgcagcttcctccgcccaccacggaagtgaggcggg 
gatactaaagcgacggagcccggtggacggaagtgggtgttggaggctttaaggtagctt 
taaattcgtgttgtcctgggagctcgcccttttcggctggagtcgggctttacggcgccg 
gatggctctggacgtgaagtctcgggcaaagcgttatgagaagctggacttccttgggga 
gggacagtttgccaccgtttacaaggccagagataagaataccaaccaaattgtcgccat 
taagaaaatcaaacttggacatagatcagaagctaaagatggtataaatagaaccgcctt 
aagagagataaaattattacaggagctaagtcatccaaatataattggtctccttgatgc 
ttttggacataaatctaatattagccttgtctttgattttatggaaactgatctagaggt 
tataataaaggataatagtcttgtgctgacaccatcacacatcaaagcctacatgttgat 
gactcttcaaggattagaatatttacatcaacattggatcctacatagggatctgaaacc 
aaacaacttgttgctagatgaaaatggagttctaaaactggcagattttggcctggccaa 
atcttttgggagccccaatagagcttatacacatcaggttgtaaccaggtggtatcgggc 
ccccgagttactatttggagctaggatgtatggtgtaggtgtggacatgtgggctgttgg 
ctgtatattagcagagttacttctaagggttccttttttgccaggagattcagaccttga 
tcagctaacaagaatatttgaaactttgggcacaccaactgaggaacagtggccggacat 
gtgtagtcttccagattatgtgacatttaagagtttccctggaatacctttgcatcacat 
cttcagtgcagcaggagacgacttactagatctcatacaaggcttattcttatttaatcc 
atgtgctcgaattacggccacacaggcactgaaaatgaagtatttcagtaatcggccagg 
gccaacacctggatgtcagctgccaagaccaaactgtccagtggaaaccttaaaggagca 
atcaaatccagctttggcaataaaaaggaaaagaacagaggccttagaacaaggaggatt 
gcccaagaaactaattttttaaagagaacactggacaacattttactactgagggaaata 
gccaaaaaggcaaataatggaaaaatagtaaacattaagtaaatgctgtagaagtgagtt 
tgtaaatattctacacatgtaaaatatgtaaaactatgggttatttttattaaatgtatt 
ttaaaataaaaatttaattctggtttttctgattagagtgcaaaagtgagaaaagttcaa 
tactcttgaaatgtagaattgaaaatgcattagggaaaacttaataaaaattattaccag 
tta 

SEQ ID NO:44. >F02366_T3 # TY Transcript S LN 1221 # Source Gene: F02366 # 
Encoded protein: F02366_P2 

ggaacgccaaccgcctgggcctagcgcagcttcctccgcccaccacggaagtgaggcggg 
gatactaaagcgacggagcccggtggacggaagtgggtgttggaggctttaaggtagctt 
taaattcgtgttgtcctgggagctcgcccttttcggctggagtcgggctttacggcgccg 
gatggctctggacgtgaagtctcgggcaaagcgttatgagaagctggacttccttgggga 
gggacagtttgccaccgtttacaaggccagagataagaataccaaccaaattgtcgccat 
taagaaaatcaaacttggacatagatcagaagctaaagatggtataaatagaaccgcctt 
aagagagataaaattattacaggagctaagtcatccaaatataattggtctccttgatgc 
ttttggacataaatctaatattagccttgtctttgattttatggaaactgatctagaggt 
tataataaaggataatagtcttgtgctgacaccatcacacatcaaagcctacatgttgat 
gactcttcaaggattagaatatttacatcaacattggatcctacatagggatctgaaacc 
aaacaacttgttgctagatgaaaatggagttctaaaactggcagattttggcctggccaa 
atcttttgggagccccaatagagcttatacacatcaggttgtaaccaggtggtatcgggc 
ccccgagttactatttggagctaggatgtatggtgtaggtgtggacatgtgggctgttgg 
ctgtatattagcagagttacttctaagggttccttttttgccaggagattcagaccttga 
tcagctaacaagaatatttgaaactttgggcacaccaactgaggaacagtggccggacat 
gtgtagtcttccagattatgtgacatttaagagtttccctggaatacctttgcatcacat 
cttcagtgcagcaggagacgacttactagatctcatacaaggcttattcttatttaatcc 
atgtgctcgaattacggccacacaggcactgaaaatgaagtatttcagtaatcggccagg 
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Figure 1 (Cont'd) 



gccaacacctggatgtcagctgccaagaccaaactgtccagtggaaaccttaaaggagca 
atcaaatccagctttggcaataaaaaggaaaagaacagaggccttagaacaaggtaagat 
tcccacttttaaaagaaatta 

SEQ ID NO: 45. >F02366_T4 # TY Transcript # LN 1361 # Source Gene: F02366 # 
Encoded protein: F02366_P3 

ggaacgccaaccgcctgggcctagcgcagcttcctccgcccaccacggaagtgaggcggg 
gatactaaagcgacggagcccggtggacggaagtgggtgttggagtttgccaccgtttac 
aaggccagagataagaataccaaccaaattgtcgccattaagaaaatcaaacttggacat 
agatcagaagctaaagatggtataaatagaaccgccttaagagagataaaattattacag 
gagctaagtcatccaaatataattggtctccttgatgcttttggacataaatctaatatt 
agccttgtctttgattttatggaaactgatctagaggttataataaaggataatagtctt 
gtgctgacaccatcacacatcaaagcctacatgttgatgactcttcaaggattagaatat 
ttacatcaacattggatcctacatagggatctgaaaccaaacaacttgttgctagatgaa 
aatggagttctaaaactggcagattttggcctggccaaatcttttgggagccccaataga 
gcttatacacatcaggttgtaaccaggtggtatcgggcccccgagttactatttggagct 
aggatgtatggtgtaggtgtggacatgtgggctgttggctgtatattagcagagttactt 
ctaagggttccttttttgccaggagattcagaccttgatcagctaacaagaatatttgaa 
actttgggcacaccaactgaggaacagtggccggacatgtgtagtcttccagattatgtg 
acatttaagagtttccctggaatacctttgcatcacatcttcagtgcagcaggagacgac 
ttactagatctcatacaaggcttattcttatttaatccatgtgctcgaattacggccaca 
caggcactgaaaatgaagtatttcagtaatcggccagggccaacacctggatgtcagctg 
ccaagaccaaactgtccagtggaaaccttaaaggagcaatcaaatccagctttggcaata 
aaaaggaaaagaacagaggccttagaacaaggaggattgcccaagaaactaattttttaa 
agagaacactggacaacattttactactgagggaaatagccaaaaaggcaaataatggaa 
aaatagtaaacattaagtaaatgctgtagaagtgagtttgtaaatattctacacatgtaa 
aatatgtaaaactatgggttatttttattaaatgtattttaaaataaaaatttaattctg 
gtttttctgattagagtgcaaaagtgagaaaagttcaatactcttgaaatgtagaattga 
aaatgcattagggaaaacttaataaaaattattaccagtta 

SEQ ID NO: 46. >F02366_T5 # TY Transcript # LN 1507 # Source Gene: F02366 # 
Encoded protein: F02366_P3 

gctcgttctcgttgggggaaaccgtccagacgcacttgctgcccattctttacatcctgg 
gggtgaatccctgaggggcctctccttgctgaagagtagcctggagctggacggagactg 
acccgccacgtttccagccgccgcgagtctgctcagaaactctgggctctttgcttcgcg 
aaatgtaaaaatgcaaaaaggcaaacacaaaaactcccataaacttatgttatttctatt 
tttctttctagtttgccaccgtttacaaggccagagataagaataccaaccaaattgtcg 
ccattaagaaaatcaaacttggacatagatcagaagctaaagatggtataaatagaaccg 
ccttaagagagataaaattattacaggagctaagtcatccaaatataattggtctccttg 
atgcttttggacataaatctaatattagccttgtctttgattttatggaaactgatctag 
aggttataataaaggataatagtcttgtgctgacaccatcacacatcaaagcctacatgt 
tgatgactcttcaaggattagaatatttacatcaacattggatcctacatagggatctga 
aaccaaacaacttgttgctagatgaaaatggagttctaaaactggcagattttggcctgg 
ccaaatcttttgggagccccaatagagcttatacacatcaggttgtaaccaggtggtatc 
gggcccccgagttactatttggagctaggatgtatggtgtaggtgtggacatgtgggctg 
ttggctgtatattagcagagttacttctaagggttccttttttgccaggagattcagacc 
ttgatcagctaacaagaatatttgaaactttgggcacaccaactgaggaacagtggccgg 
acatgtgtagtcttccagattatgtgacatttaagagtttccctggaatacctttgcatc 
acatcttcagtgcagcaggagacgacttactagatctcatacaaggcttattcttattta 
atccatgtgctcgaattacggccacacaggcactgaaaatgaagtatttcagtaatcggc 
cagggccaacacctggatgtcagctgccaagaccaaactgtccagtggaaaccttaaagg 
agcaatcaaatccagctttggcaataaaaaggaaaagaacagaggccttagaacaaggag 
gattgcccaagaaactaattttttaaagagaacactggacaacattttactactgaggga 
aatagccaaaaaggcaaataatggaaaaatagtaaacattaagtaaatgctgtagaagtg 
agtttgtaaatattctacacatgtaaaatatgtaaaactatgggttatttttattaaatg 
tattttaaaataaaaatttaattctggtttttctgattagagtgcaaaagtgagaaaagt 
tcaatactcttgaaatgtagaattgaaaatgcattagggaaaacttaataaaaattatta 
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Figur 1 (Cont'd) 



ccagtta 

SEQ ID NO: 47. >F02366_T6 # TY Transcript # LN 1590 # Source Gene: F02366 # 
Encoded protein: F02366_P4 

ggaacgccaaccgcctgggcctagcgcagcttcctccgcccaccacggaagtgaggcggg 
gatactaaagcgacggagcccggtggacggaagtgggtgttggaggctttaaggtagctt 
taaattcgtgttgtcctgggagctcgcccttttcggctggagtcgggctttacggcgccg 
gatggctctggacgtgaagtctcgggcaaagcgttatgagaagctggacttccttgggga 
gggacagtttgccaccgtttacaaggccagagataagaataccaaccaaattgtcgccat 
taagaaagagtctctgctggtagccactacagctgtgaatgtgctgtgtcatacctgaag 
ccaatatgtctcggagtttcacctgaggcccatgatcaaacttggacatagatcagaagc 
taaagatggtataaatagaaccgccttaagagagataaaattattacaggagctaagtca 
tccaaatataattggtctccttgatgcttttggacataaatctaatattagccttgtctt 
tgattttatggaaactgatctagaggttataataaaggataatagtcttgtgctgacacc 
atcacacatcaaagcctacatgttgatgactcttcaaggattagaatatttacatcaaca 
ttggatcctacatagggatctgaaaccaaacaacttgttgctagatgaaaatggagttct 
aaaactggcagattttggcctggccaaatcttttgggagccccaatagagcttatacaca 
tcaggttgtaaccaggtggtatcgggcccccgagttactatttggagctaggatgtatgg 
tgtaggtgtggacatgtgggctgttggctgtatattagcagagttacttctaagggttcc 
ttttttgccaggagattcagaccttgatcagctaacaagaatatttgaaactttgggcac 
accaactgaggaacagtggccggacatgtgtagtcttccagattatgtgacatttaagag 
tttccctggaatacctttgcatcacatcttcagtgcagcaggagacgacttactagatct 
catacaaggcttattcttatttaatccatgtgctcgaattacggccacacaggcactgaa 
aatgaagtatttcagtaatcggccagggccaacacctggatgtcagctgccaagaccaaa 
ctgtccagtggaaaccttaaaggagcaatcaaatccagctttggcaataaaaaggaaaag 
aacagaggccttagaacaaggaggattgcccaagaaactaattttttaaagagaacactg 
gacaacattttactactgagggaaatagccaaaaaggcaaataatggaaaaatagtaaac 
attaagtaaatgctgtagaagtgagtttgtaaatattctacacatgtaaaatatgtaaaa 
ctatgggttatttttattaaatgtattttaaaataaaaatttaattctggtttttctgat 
tagagtgcaaaagtgagaaaagttcaatactcttgaaatgtagaattgaaaatgcattag 
ggaaaacttaataaaaattattaccagtta 

SEQ ID NO: 48. >F02366_T7 # TY Transcript # LN 1477 # Source Gene: F02366 # 
Encoded protein: F02366_P5 

ggaacgccaaccgcctgggcctagcgcagcttcctccgcccaccacggaagtgaggcggg 
gatactaaagcgacggagcccggtggacggaagtgggtgttggaggctttaaggtagctt 
taaattcgtgttgtcctgggagctcgcccttttcggctggagtcgggctttacggcgccg 
gatggctctggacgtgaagtctcgggcaaagcgttatgagaagctggacttccttgggga 
gggacagtttgccaccgtttacaaggccagagataagaataatcaaacttggacatagat 
cagaagctaaagatggtataaatagaaccgccttaagagagataaaattattacaggagc 
taagtcatccaaatataattggtctccttgatgcttttggacataaatctaatattagcc 
ttgtctttgattttatggaaactgatctagaggttataataaaggataatagtcttgtgc 
tgacaccatcacacatcaaagcctacatgttgatgactcttcaaggattagaatatttac 
atcaacattggatcctacatagggatctgaaaccaaacaacttgttgctagatgaaaatg 
gagttctaaaactggcagattttggcctggccaaatcttttgggagccccaatagagctt 
atacacatcaggttgtaaccaggtggtatcgggcccccgagttactatttggagctagga 
tgtatggtgtaggtgtggacatgtgggctgttggctgtatattagcagagttacttctaa 
gggttccttttttgccaggagattcagaccttgatcagctaacaagaatatttgaaactt 
tgggcacaccaactgaggaacagtggccggacatgtgtagtcttccagattatgtgacat 
ttaagagtttccctggaatacctttgcatcacatcttcagtgcagcaggagacgacttac 
tagatctcatacaaggcttattcttatttaatccatgtgctcgaattacggccacacagg 
cactgaaaatgaagtatttcagtaatcggccagggccaacacctggatgtcagctgccaa 
gaccaaactgtccagtggaaaccttaaaggagcaatcaaatccagctttggcaataaaaa 
ggaaaagaacagaggccttagaacaaggaggattgcccaagaaactaattttttaaagag 
aacactggacaacattttactactgagggaaatagccaaaaaggcaaataatggaaaaat 
agtaaacattaagtaaatgctgtagaagtgagtttgtaaatattctacacatgtaaaata 
tgtaaaactatgggttatttttattaaatgtattttaaaataaaaatttaattctggttt 
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Figur 1 (Cont'd) 



ttctgattagagtgcaaaagtgagaaaagttcaatactcttgaaatgtagaattgaaaat 
gcattagggaaaacttaataaaaattattaccagtta 

SEQ ID NO:49. >F02366_T8 # TY Transcript # LN 1911 # Source Gene: F02366 # 
Encoded protein: F02366_P4 

ggaacgccaaccgcctgggcctagcgcagcttcctccgcccaccacggaagtgaggcggg 
gatactaaagcgacggagcccggtggacggaagtgggtgttggaggctttaaggtagctt 
taaattcgtgttgtcctgggagctcgcccttttcggctggagtcgggctttacggcgccg 
gatggctctggacgtgaagtctcgggcaaagcgttatgagaagctggacttccttgggga 
gggacagtttgccaccgtttacaaggccagagataagaataccaaccaaattgtcgccat 
taagaaaggtcccactctgtcgcacaggctggagtgcagtggcgtgatctcggctcactg 
caacctccacctccccaggctcaagcagttctcctgccgcagcctctcaactgctgacct 
caactgatctgcccacctcggcctcccaaagtgctgagattacaggcttgagccaccaca 
cccagcctgcaacttgttattctgagtcaacattatttccgaaatcctctatatcagaca 
aggtttcactttgttgtccagtctggtgtcaaactcctggccttaagcaatcctccagca 
tcggcctcccaaagagctgggattacaggagtctctgctggtagccactacagctgtgaa 
tgtgctgtgtcatacctgaagccaatatgtctcggagtttcacctgaggcccatgatcaa 
acttggacatagatcagaagctaaagatggtataaatagaaccgccttaagagagataaa 
attattacaggagctaagtcatccaaatataattggtctccttgatgcttttggacataa 
atctaatattagccttgtctttgattttatggaaactgatctagaggttataataaagga 
taatagtcttgtgctgacaccatcacacatcaaagcctacatgttgatgactcttcaagg 
attagaatatttacatcaacattggatcctacatagggatctgaaaccaaacaacttgtt 
gctagatgaaaatggagttctaaaactggcagattttggcctggccaaatcttttgggag 
ccccaatagagcttatacacatcaggttgtaaccaggtggtatcgggcccccgagttact 
atttggagctaggatgtatggtgtaggtgtggacatgtgggctgttggctgtatattagc 
agagttacttctaagggttccttttttgccaggagattcagaccttgatcagctaacaag 
aatatttgaaactttgggcacaccaactgaggaacagtggccggacatgtgtagtcttcc 
agattatgtgacatttaagagtttccctggaatacctttgcatcacatcttcagtgcagc 
aggagacgacttactagatctcatacaaggcttattcttatttaatccatgtgctcgaat 
tacggccacacaggcactgaaaatgaagtatttcagtaatcggccagggccaacacctgg 
atgtcagctgccaagaccaaactgtccagtggaaaccttaaaggagcaatcaaatccagc 
tttggcaataaaaaggaaaagaacagaggccttagaacaaggaggattgcccaagaaact 
aattttttaaagagaacactggacaacattttactactgagggaaatagccaaaaaggca 
aataatggaaaaatagtaaacattaagtaaatgctgtagaagtgagtttgtaaatattct 
acacatgtaaaatatgtaaaactatgggttatttttattaaatgtattttaaaataaaaa 
tttaattctggtttttctgattagagtgcaaaagtgagaaaagttcaatactcttgaaat 
gtagaattgaaaatgcattagggaaaacttaataaaaattattaccagtta 

SEQ ID NO: 50. >F02366_T9 # TY Transcript # LN 1435 # Source Gene: F02366 # 
Encoded protein: F02366_P6 

ggaacgccaaccgcctgggcctagcgcagcttcctccgcccaccacggaagtgaggcggg 
gatactaaagcgacggagcccggtggacggaagtgggtgttggaggctttaaggtagctt 
taaattcgtgttgtcctgggagctcgcccttttcggctggagtcgggctttacggcgccg 
gatggctctggacgtgaagtctcgggcaaagcgttatgagaagctggacttccttgggga 
gggacagtttgccaccgtttacaaggccagagataagaataccaaccaaattgtcgccat 
taagaaaatcaaacttggacatagatcagaagctaaagatgctccttgatgcttttggac 
ataaatctaatattagccttgtctttgattttatggaaactgatctagaggttataataa 
aggataatagtcttgtgctgacaccatcacacatcaaagcctacatgttgatgactcttc 
aaggattagaatatttacatcaacattggatcctacatagggatctgaaaccaaacaact 
tgttgctagatgaaaatggagttctaaaactggcagattttggcctggccaaatcttttg 
ggagccccaatagagcttatacacatcaggttgtaaccaggtggtatcgggcccccgagt 
tactatttggagctaggatgtatggtgtaggtgtggacatgtgggctgttggctgtatat 
tagcagagttacttctaagggttccttttttgccaggagattcagaccttgatcagctaa 
caagaatatttgaaactttgggcacaccaactgaggaacagtggccggacatgtgtagtc 
ttccagattatgtgacatttaagagtttccctggaatacctttgcatcacatcttcagtg 
cagcaggagacgacttactagatctcatacaaggcttattcttatttaatccatgtgctc 
gaattacggccacacaggcactgaaaatgaagtatttcagtaatcggccagggccaacac 
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ctggatgtcagctgccaagaccaaactgtccagtggaaaccttaaaggagcaatcaaatc 
cagctttggcaataaaaaggaaaagaacagaggccttagaacaaggaggattgcccaaga 
aactaattttttaaagagaacactggacaacattttactactgagggaaatagccaaaaa 
ggcaaataatggaaaaatagtaaacattaagtaaatgctgtagaagtgagtttgtaaata 
ttctacacatgtaaaatatgtaaaactatgggttatttttattaaatgtattttaaaata 
aaaatttaattctggtttttctgattagagtgcaaaagtgagaaaagttcaatactcttg 
aaatgtagaattgaaaatgcattagggaaaacttaataaaaattattaccagtta 

SEQ ID NO: 51. >F02366_T10 # TY Transcript # LN 1167 # Source Gene: F02366 

# Encoded protein: F02366_P7 

gggcttcctgaggaaacttagataacctcttttgagagtctcctgtattgatacttacat 
tttaagtctgtaaattgttttatgtggtagagtttatttgaaataataaagggtacctgt 
atattgtatacttgctttacaggttataataaaggataatagtcttgtgctgacaccatc 
acacatcaaagcctacatgttgatgactcttcaaggattagaatatttacatcaacattg 
gatcctacatagggatctgaaaccaaacaacttgttgctagatgaaaatggagttctaaa 
actggcagattttggcctggccaaatcttttgggagccccaatagagcttatacacatca 
ggttgtaaccaggtggtatcgggcccccgagttactatttggagctaggatgtatggtgt 
aggtgtggacatgtgggctgttggctgtatattagcagagttacttctaagggttccttt 
tttgccaggagattcagaccttgatcagctaacaagaatatttgaaactttgggcacacc 
aactgaggaacagtggccggacatgtgtagtcttccagattatgtgacatttaagagttt 
ccctggaatacctttgcatcacatcttcagtgcagcaggagacgacttactagatctcat 
acaaggcttattcttatttaatccatgtgctcgaattacggccacacaggcactgaaaat 
gaagtatttcagtaatcggccagggccaacacctggatgtcagctgccaagaccaaactg 
tccagtggaaaccttaaaggagcaatcaaatccagctttggcaataaaaaggaaaagaac 
agaggccttagaacaaggaggattgcccaagaaactaattttttaaagagaacactggac 
aacattttactactgagggaaatagccaaaaaggcaaataatggaaaaatagtaaacatt 
aagtaaatgctgtagaagtgagtttgtaaatattctacacatgtaaaatatgtaaaacta 
tgggttatttttattaaatgtattttaaaataaaaatttaattctggtttttctgattag 
agtgcaaaagtgagaaaagttcaatactcttgaaatgtagaattgaaaatgcattaggga 
aaacttaataaaaattattaccagtta 

SEQ ID NO: 52. >F02366JT11 # TY Transcript # LN 1103 # Source Gene: F02366 

# Encoded protein: F02366_P8 

ggaacgccaaccgcctgggcctagcgcagcttcctccgcccaccacggaagtgaggcggg 
gatactaaagcgacggagcccggtggacggaagtgggtgttggaggctttaaggtagctt 
taaattcgtgttgtcctgggagctcgcccttttcggctggagtcgggctttacggcgccg 
gatggctctggacgtgaagtctcgggcaaagcgttatgagaagctggacttccttgggga 
gggacagtttgccaccgtttacaaggccagagataagaataccaaccaaattgtcgccat 
taagaaaatcaaacttggacatagatcagaagctaaagatggtataaatagaaccgcctt 
aagagagataaaattattacaggagctaagtcatccaaatataattggtctccttgatgc 
ttttggacataaatctaatattagccttgtctttgattttatggaaactgatctagaggt 
tataataaaggataatagtcttgtgctgacaccatcacacatcaaagcctacatgttgat 
gactcttcaaggattagaatatttacatcaacattggatcctacatagggatctgaaacc 
aaacaacttgttgctagatgaaaatggagttctaaaactggcagattttggcctggccaa 
atcttttgggagccccaatagagcttatacacatcaggttgtaaccaggtggtatcgggc 
ccccgagttactatttggagctaggatgtatggtgtaggtgtggacatgtgggctgttgg 
ctgtatattagcagagttacttctaaggtctttttggccggagatcgaactgtcagtaca 
gattttgacttgggccccattggggacgtggcggcatggtgtcttccgatttgtgaactt 
aagttccggaacctgggccccttccggcgggagatcgactcaaggggttcttatcggtcg 
ctatgcccggcggagagatttttgtcgggccgttgttgtcgcctctcggcttgcacaccg 
cgggaaaaaaagcgcggcgtctgcgcggggcgtgcctgtccgctccgtctccgctctcct 
cttctcttttgcttttggggtgc 

SEQ ID NO: 53. >F02366_T12 # TY Transcript # LN 1388 # Source Gene: F02366 

# Encoded protein: F02366_P9 

gctcgttctcgttgggggaaaccgtccagacgcacttgctgcccattctttacatcctgg 
gggtgaatccctgaggggcctctccttgctgaagagtagcctggagctggacggagactg 
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acccgccacgtttccagccgccgcgagtctgctcagaaactctgggctctttgcttcgcg 
aaatgtaaaaatgcaaaaaggcaaacacaaaaactcccataaacttatgttatttctatt 
tttctttctagtttgccaccgtttacaaggccagagataagaataccaaccaaattgtcg 
ccattaagaaaatcaaacttggacatagatcagaagctaaagatggtataaatagaaccg 
ccttaagagagataaaattattacaggagctaagtcatccaaatataattggtctccttg 
atgcttttggacataaatctaatattagccttgtctttgattttatggaaactgatctag 
aggttataataaaggataatagtcttgtgctgacaccatcacacatcaaagcctacatgt 
tgatgactcttcaaggattagaatatttacatcaacattggatcctacatagggtggtat 
cgggcccccgagttactatttggagctaggatgtatggtgtaggtgtggacatgtgggct 
gttggctgtatattagcagagttacttctaagggttccttttttgccaggagattcagac 
cttgatcagctaacaagaatatttgaaactttgggcacaccaactgaggaacagtggccg 
gacatgtgtagtcttccagattatgtgacatttaagagtttccctggaatacctttgcat 
cacatcttcagtgcagcaggagacgacttactagatctcatacaaggcttattcttattt 
aatccatgtgctcgaattacggccacacaggcactgaaaatgaagtatttcagtaatcgg 
ccagggccaacacctggatgtcagctgccaagaccaaactgtccagtggaaaccttaaag 
gagcaatcaaatccagctttggcaataaaaaggaaaagaacagaggccttagaacaagga 
ggattgcccaagaaactaattttttaaagagaacactggacaacattttactactgaggg 
aaatagccaaaaaggcaaataatggaaaaatagtaaacattaagtaaatgctgtagaagt 
gagtttgtaaatattctacacatgtaaaatatgtaaaactatgggttatttttattaaat 
gtattttaaaataaaaatttaattctggtttttctgattagagtgcaaaagtgagaaaag 
ttcaatactcttgaaatgtagaattgaaaatgcattagggaaaacttaataaaaattatt 
accagtta 

SEQ ID NO:54. >F02366_T13 # TY Transcript # LN 729 # Source Gene: F02366 # 
Encoded protein: F02366_P10 

aggtttggtaaacccaagggttgggtattcgggggccccccccggaggttaacctaattt 
tgggaaggcttaaggaatgaaatgggtgtagggtggtgggaccatgtggggcttgttggc 
ctgtaatatttagccagagtttactttctaaggggttccttttttgccaggagattcaga 
ccttgatcagctaacaagaatatttgaaactttgggcacaccaactgaggaacagtggcc 
ggacatgtgtagtcttccagattatgtgacatttaagagtttccctggaatacctttgca 
tcacatcttcagtgcagcaggagacgacttactagatctcatacaaggcttattcttatt 
taatccatgtgctcgaattacggccacacaggcactgaaaatgaagtatttcagtaatcg 
gccagggccaacacctggatgtcagctgccaagaccaaactgtccagtggaaaccttaaa 
ggagcaatcaaatccagctttggcaataaaaaggaaaagaacagaggccttagaacaagg 
aggattgcccaagaaactaattttttaaagagaacactggacaacattttactactgagg 
gaaatagccaaaaaggcaaataatggaaaaatagtaaacattaagtaaatgctgtagaag 
tgagtttgtaaatattctacacatgtaaaatatgtaaaactatgggttatttttattaaa 
tgtatttta 

SEQ ID NO:55. >F02366_T14 # TY Transcript # LN 1075 # Source Gene: F02366 
# Encoded protein: F02366_P11 

ggaacgccaaccgcctgggcctagcgcagcttcctccgcccaccacggaagtgaggcggg 
gatactaaagcgacggagcccggtggacggaagtgggtgttggaggctttaaggtagctt 
taaattcgtgttgtcctgggagctcgcccttttcggctggagtcgggctttacggcgccg 
gatggctctggacgtgaagtctcgggcaaagcgttatgagaagctggacttccttgggga 
gggacagtttgccaccgtttacaaggccagagataagaataccaaccaaattgtcgccat 
taagaaaatcaaacttggacatagatcagaagctaaagatggtataaatagaaccgcctt 
aagagagataaaattattacaggagctaagtcatccaaatataattggtctccttgatgc 
ttttggacataaatctaatattagccttgtctttgattttatggaaactgatctagaggt 
tataataaaggataatagtcttgtgctgacaccatcacacatcaaagcctacatgttgat 
gactcttcaaggattagaatatttacatcaacattggatcctacatagggatctgaaacc 
aaacaacttgttgctagatgaaaatggagttctaaaactggcagattttggcctggccaa 
atcttttgggagccccaataggttttaccagcccttggacactatgggaggagggcaaga 
gtacaccaatttgttaaaagcaagaaaccacagtgtctcttcactagtcatttagaacat 
ggttatcatccaagactactctaccctgcaacattgaactcccaagagcaaatccacatt 
cctcttgagttctgcagcttctgtgtaaatagggcagctgtcgtctatgccgtagaatca 
catgatctgaggaccactcatggaagctgctaaatagcctagtctggggagtcttccata 
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aagttttgcatggagcaacaaacaggatttaactaggtttggttccttcagccctctaaa 
agcatagggcttagcctgcaggcttccttggctttctctgtggtgtagaaattga 

SEQ ID NO: 56, >F02366_T15 # TY Transcript # LN 693 # Source Gene: F02366 # 
Encoded protein: F02366JP12 

ggaacgccaaccgcctgggcctagcgcagcttcctccgcccaccacggaagtgaggcggg 
gatactaaagcgacggagcccggtggacggaagtgggtgttggaggctttaaggtagctt 
taaattcgtgttgtcctgggagctcgcccttttcggctggagtcgggctttacggcgccg 
gatggctctggacgtgaagtctcgggcaaagcgttatgagaagctggacttccttgggga 
gggacagtttgccaccgtttacaaggccagagataagaataccaaccaaattgtcgccat 
taagaaaatcaaacttggacatagatcagaagctaaagatggtataaatagaaccgcctt 
aagagagataaaattattacaggagctaagtcatccaaatataattggtctccttgatgc 
ttttggacataaatctaatattagccttgtctttgattttatggaaactgatctagaggt 
tataataaaggataatagtcttgtgctgacaccatcacacatcaaagcctacatgttgat 
gactcttcaaggattagaatatttacatcaacattggatcctacataggatctgaaacca 
aacaacttgttgctaggtgaataggcctattcacatgagggtggaaccaggtggtatcgt 
ctccccggtttctatttgagctagaattatagg 

SEQ ID NO: 57, >F02366_P1 # TY Protein # CC #LN 346 # Source Gene: F02366 

# Encoding Transcript: 1 

MALDVKSRAKRYEKLDFLGEGQFATVYKARDKNTNQIVAIKKIKLGHRSEAKDGINRTAL 
REIKLLQELSHPNIIGLLDAFGHKSNISLVFDFMETDLEVIIKDNSLVLTPSHIKAYMLM 
TLQGLE YLHQHWI LH RDLKPNNLLL DENGVLKLADFGLAKS FGS PNRAYTHQWT RWYRA 
PELLFGARMYGVGVDMWAVGCILAELLLRVPFLPGDSDLDQLTRIFETLGTPTEEQWPDM 
CSLPDYVTFKSFPGIPLHHIFSAAGDDLLDLIQGLFLFNPCARITATQALKMKYFSNRPG 
PTPGCQLPRPNCPVETLKEQSNPALAIKRKRTEALEQGGLPKKLIF 

SEQ ID NO: 58. >F02366_P2 # TY Protein # CC #LN 346 # Source Gene: F02366 

# Encoding Transcript: 3 

MALDVKSRAKRYEKLDFLGEGQFATVYKARDKNTNQIVAIKKIKLGHRSEAKDGINRTAL 
REIKLLQELSHPNIIGLLDAFGHKSNISLVFDFMETDLEVIIKDNSLVLTPSHIKAYMLM 
TLQGLE YLHQHWI LHRDLKPNNLLLDENGVLKLADFGLAKS FGS PNRAYTHQWTRWYRA 
PELLFGARMYGVGVDMWAVGCILAELLLRVPFLPGDSDLDQLTRIFETLGTPTEEQWPDM 
CSLPDYVTFKSFPGIPLHHIFSAAGDDLLDLIQGLFLFNPCARITATQALKMKYFSNRPG 
PTPGCQLPRPNCPVETLKEQSNPALAIKRKRTEALEQGKIPTFKRN 

SEQ ID NO: 59. >F02366_P3 # TY Protein # CC #LN 253 # Source Gene: F02366 

# Encoding Transcript: 4 

METDLEVIIKDNSLVLTPSHIKAYMLMTLQGLEYLHQHWILHRDLKPNNLLLDENGVLKL 
ADFGLAKSFGS PNRAYTHQWTRWYRAPELLFGARMYGVGVDMWAVGC ILAELLLRVPFL 
PGDSDLDQLTRIFETLGTPTEEQWPDMCSLPDYVTFKSFPGIPLHHIFSAAGDDLLDLIQ 
GLFLFNPCARITATQALKMKYFSNRPGPTPGCQLPRPNCPVETLKEQSNPALAIKRKRTE 
ALEQGGL PKKLI F 

SEQ ID NO: 60. >F02366_P4 # TY Protein # CC #LN 305 # Source Gene: F02366 

# Encoding Transcript: 6 

MIKLGHRSEAKDGINRTALREIKLLQELSHPNIIGLLDAFGHKSNISLVFDFMETDLEVI 
I KDNSLVLTPSH I KAYMLMTLQGLE YLHQHWI LHRDLKPNNLLLDENGVLKLADFGLAKS 
FGS PNRAYTHQWTRWY RAPELLFGARMYGVGV DMWAVGC I LAELLLRVP FL PGDSDLDQ 
LTRIFETLGTPTEEQWPDMCSLPDYVTFKSFPGIPLHHIFSAAGDDLLDLIQGLFLFNPC 
ARITATQALKMKYFSNRPGPTPGCQLPRPNCPVETLKEQSNPALAIKRKRTEALEQGGLP 
KKLIF 

SEQ ID NO: 61. >F02366_P5 # TY Protein # CC #LN 326 # Source Gene: F02366 

# Encoding Transcript: 7 

MRSWTSLGRDSLPPFTRPEIRIIKLGHRSEAKDGINRTALREIKLLQELSHPNIIGLLDA 
FGHKSN I SLVFDFMETDLEVI I KDNS LVLTPS H I KAYMLMTLQGLE YLHQHWI LHRDLKP 
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NNLLLDENGVLKLADFGLAKSFGSPNRAYTHQWTRWYRAPELLFGARMYGVGVDMWAVG 
CILAELLLRVPFLPGDSDLDQLTRIFETLGTPTEEQWPDMCSLPDYVTFKSFPGIPLHHI 
FSAAGDDLLDLIQGLFLFNPCARITATQALKMKYFSNRPGPTPGCQLPRPNCPVETLKEQ 
SNPALAIKRKRTEALEQGGLPKKLIF 

SEQ ID NO: 62. >F02366_P6 # TY Protein # CC #LN 312 < Source Gene: F02366 

# Encoding Transcript: 9 

MRSWTSLGRDSLPPFTRPEIRIPTKLSPLRKSNLDIDQKLKMLLDAFGHKSNISLVFDFM 
ETDLEVIIKDNSLVLTPSHIKAYMLMTLQGLEYLHQHWILHRDLKPNNLLLDENGVLKLA 
DFGLAKSFGSPNRAYTHQWTRWYRAPELLFGARMYGVGVDMWAVGCILAELLLRVPFLP 
GDSDLDQLTRIFETLGTPTEEQWPDMCSLPDYVTFKSFPGIPLHHIFSAAGDDLLDLIQG 
LFLFNPCARITATQALKMKYFSNRPGPTPGCQLPRPNCPVETLKEQSNPALAIKRKRTEA 
LEQGGLPKKLIF 

SEQ ID NO: 63. >F02366_P7 # TY Protein # CC #LN 229 # Source Gene: F02366 

# Encoding Transcript: 10 

MLMTLQGLE YLHQHWI LHRDLKPNNLLLDENGVLKLADFGLAKS FGSPNRAYTHQWTRW 
YRAPELLFGARMYGVGVDMWAVGCI LAELLLRVPFLPGDSDLDQLTRI FETLGTPTEEQW 
PDMCSLPDYVTFKSFPGIPLHHIFSAAGDDLLDLIQGLFLFNPCARITATQALKMKYFSN 
RPGPTPGCQLPRPNCPVETLKEQSNPALAIKRKRTEALEQGGLPKKLIF 

SEQ ID NO: 64. >F02366_P8 # TY Protein # CC #LN 307 # Source Gene: F02366 

# Encoding Transcript: 11 

MALDVKSRAKRYEKLDFLGEGQFATVYKARDKNTNQIVAIKKIKLGHRSEAKDGINRTAL 
REIKLLQELSHPNIIGLLDAFGHKSNISLVFDFMETDLEVIIKDNSLVLTPSHIKAYMLM 
TLQGLEYLHQHWI LHRDLKPNNLLLDENGVLKLiADFGLAKS FGSPNRAYTHQVVTRWYRA 
PELLFGARMYGVGVDMWAVGC I LAELLLRFLLAGDP PVS KE FDLG PIGDVAAWCL PI CEL 
KFRNLGPFRREITQGVLIGRYARRRDFCRAVWASRLAHRGKKSAASARGVPVRSVSALL 
FSFAFGV 

SEQ ID NO: 65. >F02366_P9 # TY Protein # CC #LN 158 # Source Gene: F02366 

# Encoding Transcript: 12 

MYGVGVDMWAVGCILAELLLRVPFLPGDSDLDQLTRIFETLGTPTEEQWPDMCSLPDYVT 
FKSFPGIPLHHIFSAAGDDLLDLIQGLFLFNPCARITATQALKMKYFSNRPGPTPGCQLP 
RPNCPVETLKEQSNPALAIKRKRTEALEQGGLPKKLIF 

SEQ ID NO: 66 . >F02366_P10 # TY Protein # CC #LN 107 # Source Gene: F02366 

# Encoding Transcript: 13 

MCSLPDYVTFKSFPGIPLHHIFSAAGDDLLDLIQGLFLFNPCARITATQALKMKYFSNRP 
GPTPGCQLPRPNC PVETLKEQSN PALAI KRKRTEALEQGGL PKKL I F 

SEQ ID NO: 67. >F02366_P11 # TY Protein 8 CC #LN 197 # Source Gene: F02366 

# Encoding Transcript: 14 

MALDVKSRAKRYEKLDFLGEGQFATVYKARDKNTNQIVAIKKIKLGHRSEAKDGINRTAL 
REIKLLQELSHPNIIGLLDAFGHKSNISLVFDFMETDLEVIIKDNSLVLTPSHIKAYMLM 
TLQGLEYLHQHWI LHRDLKPNNLLLDENGVLKLADFGLAKSFGSPNRFYQPLDTMGGGQE 
YTNLLKARNHSVSSLVI 

SEQ ID NO: 68. >F02366_P12 # TY Protein # CC #LN 137 # Source Gene: F02366 

# Encoding Transcript: 15 

MALDVKSRAKRYEKLDFLGEGQFATVYKARDKNTNQIVAIKKIKLGHRSEAKDGINRTAL 
REIKLLQELSHPNIIGLLDAFGHKSNISLVFDFMETDLEVIIKDNSLVLTPSHIKAYMLM 
T LQGLE YLHQHW I LH R I 
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SEQ ID NO: 69. >H61320 # TY Consensus # Length 5458 # Number of exons 11 

acgtcagtccccgcgcttttcggaggctgccagcgtcccacaccagccgcaggtgaaaac 

cggcagaaagacattaagagattttcctgcagtcactgctggcagatgatagagccagga 

tttgaaagcaggcagcctggctccagaccctgtgctcttaactcccgttttgcatcaaga 

acagaatcctatgaaaggcttgtacagtgcttggtactgagtaggcactccgtcagtatt 

ttagtagctattattccaaaggacactgtattctaggtactaagacaatgtgaatggaat 

ggaaaacaattcctccctgcagaagccaactgtctaggaagacagataatgacattatta 

aaataatcagaacattggcaaaataaagaagattgaaatgcagttcatatgacagtgcta 

attaatcatactttttagatctaggaaaggccataaaggttaaattgacttggaaatggt 

aaacaagacaatggttgtgtcactgatcggtgatggccagtttttggttatccttatttg 

ccatttaaatgattgaattgaaaggaagaagtaagacttggaagatgaatattgaaggaa 

aggtgaagatggaaaagagaagggggtttattctgagaacagtgactagaactctgaggt 

atggtagaaatttcaggataacagatgataaaggggaatgaagccatggagaggaagaga 

atgataatattaattgtgaaagtttgtgtcaagtgttttctgagtcacctttctttcagt 

ttctcatttctagtattgtataaatgagtttaacatcaaaatggtatagtgaaaaaaagg 

atgaactgcttgacagaagaggtcagatctagttttgactctagcttatttagctttaca 

gctttaaataaattacagtgtctttaaagctctggatcttaagttttctaattgtttttt 

aaacaaagatttgtgaacacctacttacaatgtgacaaatgtctttgtaaggcattatgg 

gagataaaagctgcattaaacctggctgttgtacttacaagtaatttggaaggcaaaatt 

tcaaatacttgaacctaaagtagggtttcttggttttcatttaattgaaatactgttagg 

aaacaaaaatttttcataaatttttcttgtggatatgtaagtaaattctatatatctagc 

aatatgatacatctatagttctgtttaaaggtaacaatgtgaggaagcttatgcgaataa 

cttgacagaagaaatctaaagtaaaagcattttcctagtaggctagcttgttctcttatt 

tcatttattgtaacagtttcactttttaataattatgtgaaattaggaaacagatattct 

agggcattctgtttagggtgatgaacttattcatgaatgttattaaaaccatttaaatat 

gccatagtttaaatttgtgatccatgctgttcaatctgtgacctaggatagcagcatcaa 

ggagcattgtgtacatgcagaagtgcacagtacctggagtgaaactgcttgtgttcgatt 

tctgataccattcataactggctgtgtgatctcaaataagaaaactgaagaaccagctgg 

gcgcggtagctcacgcctgtaatcccagcactttgggaggctgaggcgggtggattacct 

gagaacctctaaaatgcagacctccagctctagatctgtgcacctgagtgaatggcagaa 

gaattacttcgcaattacatctggcatatgtaccggaccgaaggcagatgcataccgtgc 

acagatattacgcattcagtatgcatgggcaaactctgagatttcccaggtctgtgctac 

caaactgttcaaaaaatatgcagagaaatattctgcaattattgattctgacaatgttga 

atctgggttgaataattatgcagaaaacattttaactttggcaggatctcaacaaacaga 

tagtgacaagtggcagtctggattgtcaataaataatgttttcaaaatgagtagtgtaca 

gaagatgatgcaagctggcaaaaaattcaaagactctctgttggaacctgctcttgcatc 

agtggtaatccataaggaggccactgtctttgatcttcctaaatttagtgtttgtggtag 

ttctcaagagagtgactcattacctaactcagctcatgatcgagaccggacccaagactt 

cccggagagcaatcgtttgaaactccttcagaatgcccagccacctatggtgactaacac 

tgctaggacttgtcctacattctcagcacctgtaggtgagtcagctactgcaaaattcca 

tgtcacaccattgtttggaaatgtcaaaaaggaaaatcacagctctgcaaaagaaaacat 

aggacttaatgtgttcttatctaaccagtcttgttttcctgctgcctgtgaaaatccaca 

gaggaagtctttttatggttctggcaccattgatgcactttccaatccaatactgaataa 

ggcttgtagtaaaacagaagataatggcccaaaggaggatagcagcctgcctacatttaa 

aactgcaaaagaacaattatgggtagatcagcaaaaaaagtaccaccaacctcagcgtgc 

atcagggtcttcatatggtggtgtaaaaaagtctctaggagctagtagatcccgagggat 

acttggaaagtttgttcctcctatacccaagcaagatgggggagagcagaatggaggaat 

gcaatgtaagccttatggggcaggacctacagaaccagcacatccagttgatgagcgtct 

gaagaacttggagccaaagatgattgaacttattatgaatgagattatggatcatggacc 

tccagtaaattgggaagatattgcaggagtagaatttgctaaagccaccataaaggaaat 

agttgtgtggcccatgttgaggccagacatctttactggtttaaggggaccccctaaagg 

aattttgctctttggtcctcctgggactggtaaaactctaattggcaagtgcattgctag 

tcagtctggggcaacattctttagcatctctgcttcatccttaacttctaaatgggtagg 

tgagggggagaaaatggtccgtgcattgtttgctgttgcaaggtgtcagcaaccagctgt 

gatatttattgacgaaattgattccttgttatctcaacggggagatggtgagcatgaatc 
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ttctagaaggataaaaacagaatttttagttcaattagatggagcaacaacatcttctga 
agatcgtatcctagtggtgggagcaacaaatcggccacaagaaattgatgaggctgcccg 
gagaagattggtgaaaaggctttatattcccctcccagaagcttcagccaggaaacagat 
agtaattaatctaatgtccaaagagcagtgttgcctcagtgaagaagaaattgaacagat 
tgtacagcagtctgatgcgttttcaggagcagacatgacacagctttgcagggaggcttc 
tcttggtcctattcgcagtttacaaactgctgacattgctaccataacaccggatcaagt 
tcgacccatagcttacattgattttgaaaatgcttttagaactgtgcgacctagtgtttc 
tccaaaagatttagagctttatgaaaactggaacaaaacttttggttgtggaaagtaagt 
gggatacttggaatcaaggcatctctgtattacagtcttctttattttttagcatagaaa 
gttggggatgtgttaattgtatttttaagaatatattctaaattctgtacttcaaataat 
agcacagattttacatctgattgacatagtgtatgttaatgtaagttttgctttccagtg 
attacctgatacgtaagcctatttgaacaaagtgagaatgaacttttgtttctaagaagt 
ctttatcttgaagctatataacatgaaaagtgagctcaaattttttttagttgaagatta 
catataaagttgtgtctgattaatattcatcttttattgaagaaagtgccttctgatggc 
cacataattcttaatgtcagctagtataatggtttacatttggacaaagtattgcttagt 
gttatttaagtagatttaagatctcaaagctaaagtgccaatttttactttcttcagcca 
atttgttacctcttttatggtttttaatttttatcaggactaacattttcagaaatagca 
aggtgtgatctagtattagattcacagaactgaaaggttaattaagaatgttggttattt 
tcacgggagcgtgtgatatttaacattaatattttatttgacactacagcctgtaatact 
gctctctttcaaaagcaagttttcagatttttataacagacccattttgttttatgaaac 
atgttcattacagaaacatgagaaaatagagataagtaaaaagaataaaagtcactatta 
tcctaccactttgcgggacacactattaatatttaagcatagattcttccagatgttttg 
ttttatacagcagtaggattgtgtaatgtattctctcttgtagcctattcttcaaagcgt 
atcttcactttccatgttaacataaatgtccttttatgattgtctaggattccatggatg 
gatggaatataattgagtcatctggtagagcgcatataggacaggacgctaggttgttta 
tcttttgtttattgtaaaggaactatgtgggatgcatttggtgatctttttaaaaaatgt 
ataaatttcttgagtaattcctggattaaagaatatgcactttttaaagcttaaaaataa 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaacaaaactaaacaggaac 
caaaacaaaaacgacagccacagtcagcaaccaactgaccaaacaagattacaaccacca 
gaaaacaaagagagggggcgccgccctaaacacaatccactccggcgggcgcccacaatg 
ccacaccacacccaacccattttataaaggaggcgccccccaaatgagagcctataacaa 
acaacggccgccacgggcgtcaaaaccacgttgatgggaaaacaacagcactgtcagtat 
cgtacgagaaacaccccactcggggcgggacacactggccgacacccacccccaagacaa 
tgccgcgcccgagacaaacaccatcgggtcagcgtggacaccccgccttcctgccgcaga 
catggtctcgtgtattctcaaaatcacggcccgcaatcatagtcaaagcgacagacgagc 
tcgcccagacacagaccagaccggcgtacaagctgactgaacacaccagacagccacggc 
ggaccaccaactacgacgacagcgacgacccgaaccagcccccaacctctactcgttg 

SEQ ID NO: 70. >H61320_T1 # TY Transcript # LN 3497 # Source Gene: H61320 # 
Encoded protein: H61320_P3 

acgtcagtccccgcgcttttcggaggctgccagcgtcccacaccagccgcaggtgaaaac 
cggcagaaagacattaagagattttcctgcagtcactgctggcagatgatagagccagga 
tttgaaagcaggcagcctggctccagaccctgtgctcttaactcccgttttgcatcaaga 
acagaatcctatgaaaggcttgtacagtgcttggatagcagcatcaaggagcattgtgta 
catgcagaagtgcacagtacctggagtgaaactgcttgtgttcgatttctgataccattc 
ataactggctgtgtgatctcaaaacctctaaaatgcagacctccagctctagatctgtgc 
acctgagtgaatggcagaagaattacttcgcaattacatctggcatatgtaccggaccga 
aggcagatgcataccgtgcacagatattacgcattcagtatgcatgggcaaactctgaga 
tttcccaggtctgtgctaccaaactgttcaaaaaatatgcagagaaatattctgcaatta 
ttgattctgacaatgttgaatctgggttgaataattatgcagaaaacattttaactttgg 
caggatctcaacaaacagatagtgacaagtggcagtctggattgtcaataaataatgttt 
tcaaaatgagtagtgtacagaagatgatgcaagctggcaaaaaattcaaagactctctgt 
tggaacctgctcttgcatcagtggtaatccataaggaggccactgtctttgatcttccta 
aatttagtgtttgtggtagttctcaagagagtgactcattacctaactcagctcatgatc 
gagaccggacccaagacttcccggagagcaatcgtttgaaactccttcagaatgcccagc 
cacctatggtgactaacactgctaggacttgtcctacattctcagcacctgtaggtgagt 
cagctactgcaaaattccatgtcacaccattgtttggaaatgtcaaaaaggaaaatcaca 
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Figure 1 (Cont'd) 



gctctgcaaaagaaaacataggacttaatgtgttcttatctaaccagtcttgttttcctg 
ctgcctgtgaaaatccacagaggaagtctttttatggttctggcaccattgatgcacttt 
ccaatccaatactgaataaggcttgtagtaaaacagaagataatggcccaaaggaggata 
gcagcctgcctacatttaaaactgcaaaagaacaattatgggtagatcagcaaaaaaagt 
accaccaacctcagcgtgcatcagggtcttcatatggtggtgtaaaaaagtctctaggag 
ctagtagatcccgagggatacttggaaagtttgttcctcctatacccaagcaagatgggg 
gagagcagaatggaggaatgcaatgtaagccttatggggcaggacctacagaaccagcac 
atccagttgatgagcgtctgaagaacttggagccaaagatgattgaacttattatgaatg 
agattatggatcatggacctccagtaaattgggaagatattgcaggagtagaatttgcta 
aagccaccataaaggaaatagttgtgtggcccatgttgaggccagacatctttactggtt 
taaggggaccccctaaaggaattttgctctttggtcctcctgggactggtaaaactctaa 
ttggcaagtgcattgctagtcagtctggggcaacattctttagcatctctgcttcatcct 
taacttctaaatgggtaggtgagggggagaaaatggtccgtgcattgtttgctgttgcaa 
ggtgtcagcaaccagctgtgatatttattgacgaaattgattccttgttatctcaacggg 
gagatggtgagcatgaatcttctagaaggataaaaacagaatttttagttcaattagatg 
gagcaacaacatcttctgaagatcgtatcctagtggtgggagcaacaaatcggccacaag 
aaattgatgaggctgcccggagaagattggtgaaaaggctttatattcccctcccagaag 
cttcagccaggaaacagatagtaattaatctaatgtccaaagagcagtgttgcctcagtg 
aagaagaaattgaacagattgtacagcagtctgatgcgttttcaggagcagacatgacac 
agctttgcagggaggcttctcttggtcctattcgcagtttacaaactgctgacattgcta 
ccataacaccggatcaagttcgacccatagcttacattgattttgaaaatgcttttagaa 
ctgtgcgacctagtgtttctccaaaagatttagagctttatgaaaactggaacaaaactt 
ttggttgtggaaagtaagtgggatacttggaatcaaggcatctctgtattacagtcttct 
ttattttttagcatagaaagttggggatgtgttaattgtatttttaagaatatattctaa 
attctgtacttcaaataatagcacagattttacatctgattgacatagtgtatgttaatg 
taagttttgctttccagtgattacctgatacgtaagcctatttgaacaaagtgagaatga 
acttttgtttctaagaagtctttatcttgaagctatataacatgaaaagtgagctcaaat 
tttttttagttgaagattacatataaagttgtgtctgattaatattcatcttttattgaa 
gaaagtgccttctgatggccacataattcttaatgtcagctagtataatggtttacattt 
ggacaaagtattgcttagtgttatttaagtagatttaagatctcaaagctaaagtgccaa 
tttttactttcttcagccaatttgttacctcttttatggtttttaatttttatcaggact 
aacattttcagaaatagcaaggtgtgatctagtattagattcacagaactgaaaggttaa 
ttaagaatgttggttattttcacgggagcgtgtgatatttaacattaatattttatttga 
cactacagcctgtaatactgctctctttcaaaagcaagttttcagatttttataacagac 
ccattttgttttatgaaacatgttcattacagaaacatgagaaaatagagataagtaaaa 
agaataaaagtcactattatcctaccactttgcgggacacactattaatatttaagcata 
gattcttccagatgttttgttttatacagcagtaggattgtgtaatgtattctctcttgt 
agcctattcttcaaagcgtatcttcactttccatgttaacataaatgtccttttatgatt 
gtctaggattccatggatggatggaatataattgagtcatctggtagagcgcatatagga 
caggacgctaggttgtttatcttttgtttattgtaaaggaactatgtgggatgcatttgg 
tgatctttttaaaaaatgtataaatttcttgagtaattcctggattaaagaatatgcact 
ttttaaagcttaaaaat 

SEQ ID NO: 71. >H61320_T2 # TY Transcript < LN 3942 # Source Gene: H61320 # 
Encoded protein: H61320_J?1 

acgtcagtccccgcgcttttcggaggctgccagcgtcccacaccagccgcaggtgaaaac 
cggcagaaagacattaagagattttcctgcagtcactgctggcagatgatagagccagga 
tttgaaagcaggcagcctggctccagaccctgtgctcttaactcccgttttgcatcaaga 
acagaatcctatgaaaggcttgtacagtgcttggtactgagtaggcactccgtcagtatt 
ttagtagctattattccaaaggacactgtattctaggtactaagacaatgtgaatggaat 
ggaaaacaattcctccctgcagaagccaactgtctaggaagacagataatgacattatta 
aaataatcagaacattggcaaaataaagaagattgaaatgcagttcatatgacagtgcta 
attaatcatactttttagatctaggaaaggccataaaggttaaattgacttggaaatggt 
aaacaagacaatggttgtgtcactgatcggtgatggccagtttttggttatccttatttg 
ccatttaaatgattgaattgaaaggaagaagtaagacttggaagatgaatattgaaggaa 
aggtgaagatggaaaagagaagggggtttattctgagaacagtgactagaactctgagga 
tagcagcatcaaggagcattgtgtacatgcagaagtgcacagtacctggagtgaaactgc 
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Figur 1 (Cont'd) 



ttgtgttcgatttctgataccattcataactggctgtgtgatctcaaaacctctaaaatg 
cagacctccagctctagatctgtgcacctgagtgaatggcagaagaattacttcgcaatt 
acatctggcatatgtaccggaccgaaggcagatgcataccgtgcacagatattacgcatt 
cagtatgcatgggcaaactctgagatttcccaggtctgtgctaccaaactgttcaaaaaa 
tatgcagagaaatattctgcaattattgattctgacaatgttgaatctgggttgaataat 
tatgcagaaaacattttaactttggcaggatctcaacaaacagatagtgacaagtggcag 
tctggattgtcaataaataatgttttcaaaatgagtagtgtacagaagatgatgcaagct 
ggcaaaaaattcaaagactctctgttggaacctgctcttgcatcagtggtaatccataag 
gaggccactgtctttgatcttcctaaatttagtgtttgtggtagttctcaagagagtgac 
tcattacctaactcagctcatgatcgagaccggacccaagacttcccggagagcaatcgt 
ttgaaactccttcagaatgcccagccacctatggtgactaacactgctaggacttgtcct 
acattctcagcacctgtaggtgagtcagctactgcaaaattccatgtcacaccattgttt 
ggaaatgtcaaaaaggaaaatcacagctctgcaaaagaaaacataggacttaatgtgttc 
ttatctaaccagtcttgttttcctgctgcctgtgaaaatccacagaggaagtctttttat 
ggttctggcaccattgatgcactttccaatccaatactgaataaggcttgtagtaaaaca 
gaagataatggcccaaaggaggatagcagcctgcctacatttaaaactgcaaaagaacaa 
ttatgggtagatcagcaaaaaaagtaccaccaacctcagcgtgcatcagggtcttcatat 
ggtggtgtaaaaaagtctctaggagctagtagatcccgagggatacttggaaagtttgtt 
cctcctatacccaagcaagatgggggagagcagaatggaggaatgcaatgtaagccttat 
ggggcaggacctacagaaccagcacatccagttgatgagcgtctgaagaacttggagcca 
aagatgattgaacttattatgaatgagattatggatcatggacctccagtaaattgggaa 
gatattgcaggagtagaatttgctaaagccaccataaaggaaatagttgtgtggcccatg 
ttgaggccagacatctttactggtttaaggggaccccctaaaggaattttgctctttggt 
cctcctgggactggtaaaactctaattggcaagtgcattgctagtcagtctggggcaaca 
ttctttagcatctctgcttcatccttaacttctaaatgggtaggtgagggggagaaaatg 
gtccgtgcattgtttgctgttgcaaggtgtcagcaaccagctgtgatatttattgacgaa 
attgattccttgttatctcaacggggagatggtgagcatgaatcttctagaaggataaaa 
acagaatttttagttcaattagatggagcaacaacatcttctgaagatcgtatcctagtg 
gtgggagcaacaaatcggccacaagaaattgatgaggctgcccggagaagattggtgaaa 
aggctttatattcccctcccagaagcttcagccaggaaacagatagtaattaatctaatg 
tccaaagagcagtgttgcctcagtgaagaagaaattgaacagattgtacagcagtctgat 
gcgttttcaggagcagacatgacacagctttgcagggaggcttctcttggtcctattcgc 
agtttacaaactgctgacattgctaccataacaccggatcaagttcgacccatagcttac 
attgattttgaaaatgcttttagaactgtgcgacctagtgtttctccaaaagatttagag 
ctttatgaaaactggaacaaaacttttggttgtggaaagtaagtgggatacttggaatca 
aggcatctctgtattacagtcttctttattttttagcatagaaagttggggatgtgttaa 
ttgtatttttaagaatatattctaaattctgtacttcaaataatagcacagattttacat 
ctgattgacatagtgtatgttaatgtaagttttgctttccagtgattacctgatacgtaa 
gcctatttgaacaaagtgagaatgaacttttgtttctaagaagtctttatcttgaagcta 
tataacatgaaaagtgagctcaaattttttttagttgaagattacatataaagttgtgtc 
tgattaatattcatcttttattgaagaaagtgccttctgatggccacataattcttaatg 
tcagctagtataatggtttacatttggacaaagtattgcttagtgttatttaagtagatt 
taagatctcaaagctaaagtgccaatttttactttcttcagccaatttgttacctctttt 
atggtttttaatttttatcaggactaacattttcagaaatagcaaggtgtgatctagtat 
tagattcacagaactgaaaggttaattaagaatgttggttattttcacgggagcgtgtga 
tatttaacattaatattttatttgacactacagcctgtaatactgctctctttcaaaagc 
aagttttcagatttttataacagacccattttgttttatgaaacatgttcattacagaaa 
catgagaaaatagagataagtaaaaagaataaaagtcactattatcctaccactttgcgg 
gacacactattaatatttaagcatagattcttccagatgttttgttttatacagcagtag 
gattgtgtaatgtattctctcttgtagcctattcttcaaagcgtatcttcactttccatg 
ttaacataaatgtccttttatgattgtctaggattccatggatggatggaatataattga 
gtcatctggtagagcgcatataggacaggacgctaggttgtttatcttttgtttattgta 
aaggaactatgtgggatgcatttggtgatctttttaaaaaatgtataaatttcttgagta 
attcctggattaaagaatatgcactttttaaagcttaaaaat 
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Figur 1 (Cont'd) 



SEQ ID NO: 72. >H61320_T3 # TY Transcript # LN 4743 # Source Gene: H61320 # 
Encoded protein: H61320_P2 

acgtcagtccccgcgcttttcggaggctgccagcgtcccacaccagccgcaggtgaaaac 
cggcagaaagacattaagagattttcctgcagtcactgctggcagatgatagagccagga 
tttgaaagcaggcagcctggctccagaccctgtgctcttaactcccgttttgcatcaaga 
acagaatcctatgaaaggcttgtacagtgcttggtactgagtaggcactccgtcagtatt 
ttagtagctattattccaaaggacactgtattctaggtactaagacaatgtgaatggaat 
ggaaaacaattcctccctgcagaagccaactgtctaggaagacagataatgacattatta 
aaataatcagaacattggcaaaataaagaagattgaaatgcagttcatatgacagtgcta 
attaatcatactttttagatctaggaaaggccataaaggttaaattgacttggaaatggt 
aaacaagacaatggttgtgtcactgatcggtgatggccagtttttggttatccttatttg 
ccatttaaatgattgaattgaaaggaagaagtaagacttggaagatgaatattgaaggaa 
aggtgaagatggaaaagagaagggggtttattctgagaacagtgactagaactctgaggt 
atggtagaaatttcaggataacagatgataaaggggaatgaagccatggagaggaagaga 
atgataatattaattgtgaaagtttgtgtcaagtgttttctgagtcacctttctttcagt 
ttctcatttctagtattgtataaatgagtttaacatcaaaatggtatagtgaaaaaaagg 
atgaactgcttgacagaagaggtcagatctagttttgactctagcttatttagctttaca 
gctttaaataaattacagtgtctttaaagctctggatcttaagttttctaattgtttttt 
aaacaaagatttgtgaacacctacttacaatgtgacaaatgtctttgtaaggcattatgg 
gagataaaagctgcattaaacctggctgttgtacttacaagtaatttggaaggcaaaatt 
tcaaatacttgaacctaaagtagggtttcttggttttcatttaattgaaatactgttagg 
aaacaaaaatttttcataaatttttcttgtggatatgtaagtaaattctatatatctagc 
aatatgatacatctatagttctgtttaaaggtaacaatgtgaggaagcttatgcgaataa 
cttgacagaagaaatctaaagtaaaagcattttcctagtaggctagcttgttctcttatt 
tcatttattgtaacagtttcactttttaataattatgtgaaattaggaaacagatattct 
agggcattctgtttagggtgatgaacttattcatgaatgttattaaaaccatttaaatat 
gccatagtttaaatttgtgatccatgctgttcaatctgtgacctaggatagcagcatcaa 
ggagcattgtgtacatgcagaagtgcacagtacctggagtgaaactgcttgtgttcgatt 
tctgataccattcataactggctgtgtgatctcaaataagaaaactgaagaaccagctgg 
gcgcggtagctcacgcctgtaatcccagcactttgggaggctgaggcgggtggattacct 
gagaacctctaaaatgcagacctccagctctagatctgtgcacctgagtgaatggcagaa 
gaattacttcgcaattacatctggcatatgtaccggaccgaaggcagatgcataccgtgc 
acagatattacgcattcagtatgcatgggcaaactctgagatttcccaggatctcaacaa 
acagatagtgacaagtggcagtctggattgtcaataaataatgttttcaaaatgagtagt 
gtacagaagatgatgcaagctggcaaaaaattcaaagactctctgttggaacctgctctt 
gcatcagtggtaatccataaggaggccactgtctttgatcttcctaaatttagtgtttgt 
ggtagttctcaagagagtgactcattacctaactcagctcatgatcgagaccggacccaa 
gacttcccggagagcaatcgtttgaaactccttcagaatgcccagccacctatggtgact 
aacactgctaggacttgtcctacattctcagcacctgtaggtgagtcagctactgcaaaa 
ttccatgtcacaccattgtttggaaatgtcaaaaaggaaaatcacagctctgcaaaagaa 
aacataggacttaatgtgttcttatctaaccagtcttgttttcctgctgcctgtgaaaat 
ccacagaggaagtctttttatggttctggcaccattgatgcactttccaatccaatactg 
aataaggcttgtagtaaaacagaagataatggcccaaaggaggatagcagcctgcctaca 
tttaaaactgcaaaagaacaattatgggtagatcagcaaaaaaagtaccaccaacctcag 
cgtgcatcagggtcttcatatggtggtgtaaaaaagtctctaggagctagtagatcccga 
gggatacttggaaagtttgttcctcctatacccaagcaagatgggggagagcagaatgga 
ggaatgcaatgtaagccttatggggcaggacctacagaaccagcacatccagttgatgag 
cgtctgaagaacttggagccaaagatgattgaacttattatgaatgagattatggatcat 
ggacctccagtaaattgggaagatattgcaggagtagaatttgctaaagccaccataaag 
gaaatagttgtgtggcccatgttgaggccagacatctttactggtttaaggggaccccct 
aaaggaattttgctctttggtcctcctgggactggtaaaactctaattggcaagtgcatt 
gctagtcagtctggggcaacattctttagcatctctgcttcatccttaacttctaaatgg 
gtaggtgagggggagaaaatggtccgtgcattgtttgctgttgcaaggtgtcagcaacca 
gctgtgatatttattgacgaaattgattccttgttatctcaacggggagatggtgagcat 
gaatcttctagaaggataaaaacagaatttttagttcaattagatggagcaacaacatct 
tctgaagatcgtatcctagtggtgggagcaacaaatcggccacaagaaattgatgaggct 
gcccggagaagattggtgaaaaggctttatattcccctcccagaagcttcagccaggaaa 
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Figure 1 (Cont'd) 



cagatagtaattaatctaatgtccaaagagcagtgttgcctcagtgaagaagaaattgaa 
cagattgtacagcagtctgatgcgttttcaggagcagacatgacacagctttgcagggag 
gcttctcttggtcctattcgcagtttacaaactgctgacattgctaccataacaccggat 
caagttcgacccatagcttacattgattttgaaaatgcttttagaactgtgcgacctagt 
gtttctccaaaagatttagagctttatgaaaactggaacaaaacttttggttgtggaaag 
taagtgggatacttggaatcaaggcatctctgtattacagtcttctttattttttagcat 
agaaagttggggatgtgttaattgtatttttaagaatatattctaaattctgtacttcaa 
ataatagcacagattttacatctgattgacatagtgtatgttaatgtaagttttgctttc 
cagtgattacctgatacgtaagcctatttgaacaaagtgagaatgaacttttgtttctaa 
gaagtctttatcttgaagctatataacatgaaaagtgagctcaaattttttttagttgaa 
gattacatataaagttgtgtctgattaatattcatcttttattgaagaaagtgccttctg 
atggccacataattcttaatgtcagctagtataatggtttacatttggacaaagtattgc 
ttagtgttatttaagtagatttaagatctcaaagctaaagtgccaatttttactttcttc 
agccaatttgttacctcttttatggtttttaatttttatcaggactaacattttcagaaa 
tagcaaggtgtgatctagtattagattcacagaactgaaaggttaattaagaatgttggt 
tattttcacgggagcgtgtgatatttaacattaatattttatttgacactacagcctgta 
atactgctctctttcaaaagcaagttttcagatttttataacagacccattttgttttat 
gaaacatgttcattacagaaacatgagaaaatagagataagtaaaaagaataaaagtcac 
tattatcctaccactttgcgggacacactattaatatttaagcatagattcttccagatg 
ttttgttttatacagcagtaggattgtgtaatgtattctctcttgtagcctattcttcaa 
agcgtatcttcactttccatgttaacataaatgtccttttatgattgtctaggattccat 
ggatggatggaatataattgagtcatctggtagagcgcatataggacaggacgctaggtt 
gtttatcttttgtttattgtaaaggaactatgtgggatgcatttggtgatctttttaaaa 
aatgtataaatttcttgagtaattcctggattaaagaatatgcactttttaaagcttaaa 
aat 

SEQ ID NO: 73. >H61320_T4 # TY Transcript # LN 3744 # Source Gene: H61320 # 
Encoded protein: H61320_P3 

acgtcagtccccgcgcttttcggaggctgccagcgtcccacaccagccgcaggtgaaaac 
cggcagaaagacattaagagattttcctgcagtcactgctggcagatgatagagccagga 
tttgaaagcaggcagcctggctccagaccctgtgctcttaactcccgttttgcatcaaga 
acagaatcctatgaaaggcttgtacagtgcttggatagcagcatcaaggagcattgtgta 
catgcagaagtgcacagtacctggagtgaaactgcttgtgttcgatttctgataccattc 
ataactggctgtgtgatctcaaaacctctaaaatgcagacctccagctctagatctgtgc 
acctgagtgaatggcagaagaattacttcgcaattacatctggcatatgtaccggaccga 
aggcagatgcataccgtgcacagatattacgcattcagtatgcatgggcaaactctgaga 
tttcccaggtctgtgctaccaaactgttcaaaaaatatgcagagaaatattctgcaatta 
ttgattctgacaatgttgaatctgggttgaataattatgcagaaaacattttaactttgg 
caggatctcaacaaacagatagtgacaagtggcagtctggattgtcaataaataatgttt 
tcaaaatgagtagtgtacagaagatgatgcaagctggcaaaaaattcaaagactctctgt 
tggaacctgctcttgcatcagtggtaatccataaggaggccactgtctttgatcttccta 
aatttagtgtttgtggtagttctcaagagagtgactcattacctaactcagctcatgatc 
gagaccggacccaagacttcccggagagcaatcgtttgaaactccttcagaatgcccagc 
cacctatggtgactaacactgctaggacttgtcctacattctcagcacctgtaggtgagt 
cagctactgcaaaattccatgtcacaccattgtttggaaatgtcaaaaaggaaaatcaca 
gctctgcaaaagaaaacataggacttaatgtgttcttatctaaccagtcttgttttcctg 
ctgcctgtgaaaatccacagaggaagtctttttatggttctggcaccattgatgcacttt 
ccaatccaatactgaataaggcttgtagtaaaacagaagataatggcccaaaggaggata 
gcagcctgcctacatttaaaactgcaaaagaacaattatgggtagatcagcaaaaaaagt 
accaccaacctcagcgtgcatcagggtcttcatatggtggtgtaaaaaagtctctaggag 
ctagtagatcccgagggatacttggaaagtttgttcctcctatacccaagcaagatgggg 
gagagcagaatggaggaatgcaatgtaagccttatggggcaggacctacagaaccagcac 
atccagttgatgagcgtctgaagaacttggagccaaagatgattgaacttattatgaatg 
agattatggatcatggacctccagtaaattgggaagatattgcaggagtagaatttgcta 
aagccaccataaaggaaatagttgtgtggcccatgttgaggccagacatctttactggtt 
taaggggaccccctaaaggaattttgctctttggtcctcctgggactggtaaaactctaa 
ttggcaagtgcattgctagtcagtctggggcaacattctttagcatctctgcttcatcct 
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Figure 1 (Cont'd) 



taacttctaaatgggtaggtgagggggagaaaatggtccgtgcattgtttgctgttgcaa 
ggtgtcagcaaccagctgtgatatttattgacgaaattgattccttgttatctcaacggg 
gagatggtgagcatgaatcttctagaaggataaaaacagaatttttagttcaattagatg 
gagcaacaacatcttctgaagatcgtatcctagtggtgggagcaacaaatcggccacaag 
aaattgatgaggctgcccggagaagattggtgaaaaggctttatattcccctcccagaag 
cttcagccaggaaacagatagtaattaatctaatgtccaaagagcagtgttgcctcagtg 
aagaagaaattgaacagattgtacagcagtctgatgcgttttcaggagcagacatgacac 
agctttgcagggaggcttctcttggtcctattcgcagtttacaaactgctgacattgcta 
ccataacaccggatcaagttcgacccatagcttacattgattttgaaaatgcttttagaa 
ctgtgcgacctagtgtttctccaaaagatttagagctttatgaaaactggaacaaaactt 
ttggttgtggaaagtaagtgggatacttggaatcaaggcatctctgtattacagtcttct 
ttattttttagcatagaaagttggggatgtgttaattgtatttttaagaatatattctaa 
attctgtacttcaaataatagcacagattttacatctgattgacatagtgtatgttaatg 
taagttttgctttccagtgattacctgatacgtaagcctatttgaacaaagtgagaatga 
acttttgtttctaagaagtctttatcttgaagctatataacatgaaaagtgagctcaaat 
tttttttagttgaagattacatataaagttgtgtctgattaatattcatcttttattgaa 
gaaagtgccttctgatggccacataattcttaatgtcagctagtataatggtttacattt 
ggacaaagtattgcttagtgttatttaagtagatttaagatctcaaagctaaagtgccaa 
tttttactttcttcagccaatttgttacctcttttatggtttttaatttttatcaggact 
aacattttcagaaatagcaaggtgtgatctagtattagattcacagaactgaaaggttaa 
ttaagaatgttggttattttcacgggagcgtgtgatatttaacattaatattttatttga 
cactacagcctgtaatactgctctctttcaaaagcaagttttcagatttttataacagac 
ccattttgttttatgaaacatgttcattacagaaacatgagaaaatagagataagtaaaa 
agaataaaagtcactattatcctaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaaaaaaacaaaactaaacaggaaccaaaacaaaaacgacagccacagtcagcaaccaa 
ctgaccaaacaagattacaaccaccagaaaacaaagagagggggcgccgccctaaacaca 
atccactccggcgggcgcccacaatgccacaccacacccaacccattttataaaggaggc 
gccccccaaatgagagcctataacaaacaacggccgccacgggcgtcaaaaccacgttga 
tgggaaaacaacagcactgtcagtatcgtacgagaaacaccccactcggggcgggacaca 
ctggccgacacccacccccaagacaatgccgcgcccgagacaaacaccatcgggtcagcg 
tggacaccccgccttcctgccgcagacatggtctcgtgtattctcaaaatcacggcccgc 
aatcatagtcaaagcgacagacgagctcgcccagacacagaccagaccggcgtacaagct 
gactgaacacaccagacagccacggcggaccaccaactacgacgacagcgacgacccgaa 
ccagcccccaacctctactcgttg 

SEQ ID NO:74. >H61320_T5 # TY Transcript # LN 4858 # Source Gene: H61320 # 
Encoded protein: H61320_P1 

acgtcagtccccgcgcttttcggaggctgccagcgtcccacaccagccgcaggtgaaaac 
cggcagaaagacattaagagattttcctgcagtcactgctggcagatgatagagccagga 
tttgaaagcaggcagcctggctccagaccctgtgctcttaactcccgttttgcatcaaga 
acagaatcctatgaaaggcttgtacagtgcttggtactgagtaggcactccgtcagtatt 
ttagtagctattattccaaaggacactgtattctaggtactaagacaatgtgaatggaat 
ggaaaacaattcctccctgcagaagccaactgtctaggaagacagataatgacattatta 
aaataatcagaacattggcaaaataaagaagattgaaatgcagttcatatgacagtgcta 
attaatcatactttttagatctaggaaaggccataaaggttaaattgacttggaaatggt 
aaacaagacaatggttgtgtcactgatcggtgatggccagtttttggttatccttatttg 
ccatttaaatgattgaattgaaaggaagaagtaagacttggaagatgaatattgaaggaa 
aggtgaagatggaaaagagaagggggtttattctgagaacagtgactagaactctgaggt 
atggtagaaatttcaggataacagatgataaaggggaatgaagccatggagaggaagaga 
atgataatattaattgtgaaagtttgtgtcaagtgttttctgagtcacctttctttcagt 
ttctcatttctagtattgtataaatgagtttaacatcaaaatggtatagtgaaaaaaagg 
atgaactgcttgacagaagaggtcagatctagttttgactctagcttatttagctttaca 
gctttaaataaattacagtgtctttaaagctctggatcttaagttttctaattgtttttt 
aaacaaagatttgtgaacacctacttacaatgtgacaaatgtctttgtaaggcattatgg 
gagataaaagctgcattaaacctggctgttgtacttacaagtaatttggaaggcaaaatt 
tcaaatacttgaacctaaagtagggtttcttggttttcatttaattgaaatactgttagg 
aaacaaaaatttttcataaatttttcttgtggatatgtaagtaaattctatatatctagc 
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Figur 1 (Cont'd) 



aatatgatacatctatagttctgtttaaaggtaacaatgtgaggaagcttatgcgaataa 
cttgacagaagaaatctaaagtaaaagcattttcctagtaggctagcttgttctcttatt 
tcatttattgtaacagtttcactttttaataattatgtgaaattaggaaacagatattct 
agggcattctgtttagggtgatgaacttattcatgaatgttattaaaaccatttaaatat 
gccatagtttaaatttgtgatccatgctgttcaatctgtgacctaggatagcagcatcaa 
ggagcattgtgtacatgcagaagtgcacagtacctggagtgaaactgcttgtgttcgatt 
tctgataccattcataactggctgtgtgatctcaaataagaaaactgaagaaccagctgg 
gcgcggtagctcacgcctgtaatcccagcactttgggaggctgaggcgggtggattacct 
gagaacctctaaaatgcagacctccagctctagatctgtgcacctgagtgaatggcagaa 
gaattacttcgcaattacatctggcatatgtaccggaccgaaggcagatgcataccgtgc 
acagatattacgcattcagtatgcatgggcaaactctgagatttcccaggtctgtgctac 
caaactgttcaaaaaatatgcagagaaatattctgcaattattgattctgacaatgttga 
atctgggttgaataattatgcagaaaacattttaactttggcaggatctcaacaaacaga 
tagtgacaagtggcagtctggattgtcaataaataatgttttcaaaatgagtagtgtaca 
gaagatgatgcaagctggcaaaaaattcaaagactctctgttggaacctgctcttgcatc 
agtggtaatccataaggaggccactgtctttgatcttcctaaatttagtgtttgtggtag 
ttctcaagagagtgactcattacctaactcagctcatgatcgagaccggacccaagactt 
cccggagagcaatcgtttgaaactccttcagaatgcccagccacctatggtgactaacac 
tgctaggacttgtcctacattctcagcacctgtaggtgagtcagctactgcaaaattcca 
tgtcacaccattgtttggaaatgtcaaaaaggaaaatcacagctctgcaaaagaaaacat 
aggacttaatgtgttcttatctaaccagtcttgttttcctgctgcctgtgaaaatccaca 
gaggaagtctttttatggttctggcaccattgatgcactttccaatccaatactgaataa 
ggcttgtagtaaaacagaagataatggcccaaaggaggatagcagcctgcctacatttaa 
aactgcaaaagaacaattatgggtagatcagcaaaaaaagtaccaccaacctcagcgtgc 
atcagggtcttcatatggtggtgtaaaaaagtctctaggagctagtagatcccgagggat 
acttggaaagtttgttcctcctatacccaagcaagatgggggagagcagaatggaggaat 
gcaatgtaagccttatggggcaggacctacagaaccagcacatccagttgatgagcgtct 
gaagaacttggagccaaagatgattgaacttattatgaatgagattatggatcatggacc 
tccagtaaattgggaagatattgcaggagtagaatttgctaaagccaccataaaggaaat 
agttgtgtggcccatgttgaggccagacatctttactggtttaaggggaccccctaaagg 
aattttgctctttggtcctcctgggactggtaaaactctaattggcaagtgcattgctag 
tcagtctggggcaacattctttagcatctctgcttcatccttaacttctaaatgggtagg 
tgagggggagaaaatggtccgtgcattgtttgctgttgcaaggtgtcagcaaccagctgt 
gatatttattgacgaaattgattccttgttatctcaacggggagatggtgagcatgaatc 
ttctagaaggataaaaacagaatttttagttcaattagatggagcaacaacatcttctga 
agatcgtatcctagtggtgggagcaacaaatcggccacaagaaattgatgaggctgcccg 
gagaagattggtgaaaaggctttatattcccctcccagaagcttcagccaggaaacagat 
agtaattaatctaatgtccaaagagcagtgttgcctcagtgaagaagaaattgaacagat 
tgtacagcagtctgatgcgttttcaggagcagacatgacacagctttgcagggaggcttc 
tcttggtcctattcgcagtttacaaactgctgacattgctaccataacaccggatcaagt 
tcgacccatagcttacattgattttgaaaatgcttttagaactgtgcgacctagtgtttc 
tccaaaagatttagagctttatgaaaactggaacaaaacttttggttgtggaaagtaagt 
gggatacttggaatcaaggcatctctgtattacagtcttctttattttttagcatagaaa 
gttggggatgtgttaattgtatttttaagaatatattctaaattctgtacttcaaataat 
agcacagattttacatctgattgacatagtgtatgttaatgtaagttttgctttccagtg 
attacctgatacgtaagcctatttgaacaaagtgagaatgaacttttgtttctaagaagt 
ctttatcttgaagctatataacatgaaaagtgagctcaaattttttttagttgaagatta 
catataaagttgtgtctgattaatattcatcttttattgaagaaagtgccttctgatggc 
cacataattcttaatgtcagctagtataatggtttacatttggacaaagtattgcttagt 
gttatttaagtagatttaagatctcaaagctaaagtgccaatttttactttcttcagcca 
atttgttacctcttttatggtttttaatttttatcaggactaacattttcagaaatagca 
aggtgtgatctagtattagattcacagaactgaaaggttaattaagaatgttggttattt 
tcacgggagcgtgtgatatttaacattaatattttatttgacactacagcctgtaatact 
gctctctttcaaaagcaagttttcagatttttataacagacccattttgttttatgaaac 
atgttcattacagaaacatgagaaaatagagataagtaaaaagaataaaagtcactatta 
tcctaccactttgcgggacacactattaatatttaagcatagattcttccagatgttttg 
ttttatacagcagtaggattgtgtaatgtattctctcttgtagcctattcttcaaagcgt 
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Figure 1 (Conf d) 



atcttcactttccatgttaacataaatgtccttttatgattgtctaggattccatggatg 
gatggaatataattgagtcatctggtagagcgcatataggacaggacgctaggttgttta 
tcttttgtttattgtaaaggaactatgtgggatgcatttggtgatctttttaaaaaatgt 
ataaatttcttgagtaattcctggattaaagaatatgcactttttaaagcttaaaaat 

SEQ ID NO: 75. >H61320_T6 # TY Transcript 8 LN 3745 # Source Gene: H61320 # 
Encoded protein: H61320_P1 

acgtcagtccccgcgcttttcggaggctgccagcgtcccacaccagccgcaggtgaaaac 
cggcagaaagacattaagagattttcctgcagtcactgctggcagatgatagagccagga 
tttgaaagcaggcagcctggctccagaccctgtgctcttaactcccgttttgcatcaaga 
acagaatcctatgaaaggcttgtacagtgcttggtactgagtaggcactccgtcagtatt 
ttagtagctattattccaaaggacactgtattctaggtactaagacaatgtgaatggaat 
ggaaaacaattcctccctgcagaagccaactgtctaggaagacagataatgacattatta 
aaataatcagaacattggcaaaataaagaagattgaaatgcagttcatatgacagtgcta 
attaatcatactttttagatctaggaaaggccataaaggttaaattgacttggaaatggt 
aaacaagacaatggttgtgtcactgatcggtgatggccagtttttggttatccttatttg 
ccatttaaatgattgaattgaaaggaagaagtaagacttggaagatgaatattgaaggaa 
aggtgaagatggaaaagagaagggggtttattctgagaacagtgactagaactctgaggt 
atggtagaaatttcaggataacagatgataaaggggaatgaagccatggagaggaagaga 
atgataatattaattgtgaaagtttgtgtcaagtgttttctgagtcacctttctttcagt 
ttctcatttctagtattgtataaatgagtttaacatcaaaatggtatagtgaaaaaaagg 
atgaactgcttgacagaagaggtcagatctagttttgactctagcttatttagctttaca 
gctttaaataaattacagtgtctttaaagctctggatcttaagttttctaattgtttttt 
aaacaaagatttgtgaacacctacttacaatgtgacaaatgtctttgtaaggcattatgg 
gagataaaagctgcattaaacctggctgttgtacttacaagtaatttggaaggcaaaatt 
tcaaatacttgaacctaaagtagggtttcttggttttcatttaattgaaatactgttagg 
aaacaaaaatttttcataaatttttcttgtggatatgtaagtaaattctatatatctagc 
aatatgatacatctatagttctgtttaaaggtaacaatgtgaggaagcttatgcgaataa 
cttgacagaagaaatctaaagtaaaagcattttcctagtaggctagcttgttctcttatt 
tcatttattgtaacagtttcactttttaataattatgtgaaattaggaaacagatattct 
agggcattctgtttagggtgatgaacttattcatgaatgttattaaaaccatttaaatat 
gccatagtttaaatttgtgatccatgctgttcaatctgtgacctaggatagcagcatcaa 
ggagcattgtgtacatgcagaagtgcacagtacctggagtgaaactgcttgtgttcgatt 
tctgataccattcataactggctgtgtgatctcaaaacctctaaaatgcagacctccagc 
tctagatctgtgcacctgagtgaatggcagaagaattacttcgcaattacatctggcata 
tgtaccggaccgaaggcagatgcataccgtgcacagatattacgcattcagtatgcatgg 
gcaaactctgagatttcccaggtctgtgctaccaaactgttcaaaaaatatgcagagaaa 
tattctgcaattattgattctgacaatgttgaatctgggttgaataattatgcagaaaac 
attttaactttggcaggatctcaacaaacagatagtgacaagtggcagtctggattgtca 
ataaataatgttttcaaaatgagtagtgtacagaagatgatgcaagctggcaaaaaattc 
aaagactctctgttggaacctgctcttgcatcagtggtaatccataaggaggccactgtc 
tttgatcttcctaaatttagtgtttgtggtagttctcaagagagtgactcattacctaac 
tcagctcatgatcgagaccggacccaagacttcccggagagcaatcgtttgaaactcctt 
cagaatgcccagccacctatggtgactaacactgctaggacttgtcctacattctcagca 
cctgtaggtgagtcagctactgcaaaattccatgtcacaccattgtttggaaatgtcaaa 
aaggaaaatcacagctctgcaaaagaaaacataggacttaatgtgttcttatctaaccag 
tcttgttttcctgctgcctgtgaaaatccacagaggaagtctttttatggttctggcacc 
attgatgcactttccaatccaatactgaataaggcttgtagtaaaacagaagataatggc 
ccaaaggaggatagcagcctgcctacatttaaaactgcaaaagaacaattatgggtagat 
cagcaaaaaaagtaccaccaacctcagcgtgcatcagggtcttcatatggtggtgtaaaa 
aagtctctaggagctagtagatcccgagggatacttggaaagtttgttcctcctataccc 
aagcaagatgggggagagcagaatggaggaatgcaatgtaagccttatggggcaggacct 
acagaaccagcacatccagttgatgagcgtctgaagaacttggagccaaagatgattgaa 
cttattatgaatgagattatggatcatggacctccagtaaattgggaagatattgcagga 
gtagaatttgctaaagccaccataaaggaaatagttgtgtggcccatgttgaggccagac 
atctttactggtttaaggggaccccctaaaggaattttgctctttggtcctcctgggact 
ggtaaaactctaattggcaagtgcattgctagtcagtctggggcaacattctttagcatc 
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Figure 1 (Cont'd) 



tctgcttcatccttaacttctaaatgggtaggtgagggggagaaaatggtccgtgcattg 
tttgctgttgcaaggtgtcagcaaccagctgtgatatttattgacgaaattgattccttg 
ttatctcaacggggagatggtgagcatgaatcttctagaaggataaaaacagaattttta 
gttcaattagatggagcaacaacatcttctgaagatcgtatcctagtggtgggagcaaca 
aatcggccacaagaaattgatgaggctgcccggagaagattggtgaaaaggctttatatt 
cccctcccagaagcttcagccaggaaacagatagtaattaatctaatgtccaaagagcag 
tgttgcctcagtgaagaagaaattgaacagattgtacagcagtctgatgcgttttcagga 
gcagacatgacacagctttgcagggaggcttctcttggtcctattcgcagtttacaaact 
gctgacattgctaccataacaccggatcaagttcgacccatagcttacattgattttgaa 
aatgcttttagaactgtgcgacctagtgtttctccaaaagatttagagctttatgaaaac 
tggaacaaaacttttggttgtggaaagtaagtgggatacttggaatcaaggcatctctgt 
attacagtcttctttattttttagcatagaaagttggggatgtgttaattgtatttttaa 
gaatatattctaaattctgtacttc 

SEQ ID NO:76. >H61320_P3 # TY Protein # CC #LN 674 # Source Gene: H61320 

# Encoding Transcript: 1 

MQTSSSRSVHLSEWQKNYFAITSGICTGPKADAYRAQILRIQYAWANSEISQVCATKLFK 
KYAEKYSAIIDSDNVESGLNNYAENILTLAGSQQTDSDKWQSGLSINNVFECMSSVQECMMQ 
AGKKFKDSLLEPALASWIHKEATVFDLPKFSVCGSSQESDSLPNSAHDRDRTQDFPESN 
RLKLLQNAQPPMVTNTARTCPTFSAPVGESATAKFHVTPLFGNVKKENHSSAKENIGLNV 
FLSNQSCFPAACENPQRKSFYGSGTIDALSNPILNKACSKTEDNGPKEDSSLPTFKTAKE 
QLWVDQQKKYHQPQRASGSSYGGVKKSLGASRSRGILGKFVPPIPKQDGGEQNGGMQCKP 
YGAGPTEPAHPVDERLKNLEPKMIELIMNEIMDHGPPVNWEDIAGVEFAKATIKEIWWP 
MLRPDIFTGLRGPPKGILLFGPPGTGKTLIGKCIASQSGATFFSISASSLTSKWVGEGEK 
MVRALFAVARCQQPAVIFIDEIDSLLSQRGDGEHESSRRIKTEFLVQLDGATTSSEDRIL 
WGATNRPQEIDEAARRRLVKRLYIPLPEASARKQIVINLMSKEQCCLSEEEIEQIVQQS 
DAFSGADMTQLCRGASLGPIRSLQTADIATITPDQVRPIAYIDFENAFRTVRPSVSPKDL 
ELYENWNKT FGCGK 

SEQ ID NO: 77. >H61320_P1 # TY Protein # CC #LN 674 # Source Gene: H61320 

# Encoding Transcript: 2 

MQTSSSRSVHLSEWQKNYFAITSGICTGPKADAYRAQILRIQYAWANSEISQVCATKLFK 
KYAEKYSAIIDSDNVESGLNNYAENILTLAGSQQTDSDKWQSGLSINNVFKMSSVQKMMQ 
AGKKFKDSLLEPALASWIHKEATVFDLPKFSVCGSSQESDSLPNSAHDRDRTQDFPESN 
RLKLLQNAQPPMVTNTARTCPTFSAPVGESATAKFHVTPLFGNVKKENHSSAKENIGLNV 
FLSNQSCFPAACENPQRKSFYGSGTIDALSNPILNKACSKTEDNGPKEDSSLPTFKTAKE 
QLWVDQQKKYHQPQRASGSSYGGVKKSLGASRSRGILGKFVPPIPKQDGGEQNGGMQCKP 
YGAGPTEPAHPVDERLKNLEPKMIELIMNEIMDHGPPVNWEDIAGVEFAKATIKEIVVWP 
MLRPDIFTGLRGPPKGILLFGPPGTGKTLIGKCIASQSGATFFSISASSLTSKWVGEGEK 
MVRALFAVARCQQPAVIFIDEIDSLLSQRGDGEHESSRRIKTEFLVQLDGATTSSEDRIL 
WGATNRPQEIDEAARRRLVECRLYIPLPEASARKQIVINLMSKEQCCLSEEEIEQIVQQS 
DAFSGADMTQLCREASLGPI RSLQTADI AT I T PDQVRP I AYI DFENAFRTVRPSVS PKDL 
ELYENWNKT FGCGK 

SEQ ID NO: 78. >H61320_P2 # TY Protein # CC #LN 563 # Source Gene: H61320 

# Encoding Transcript : 3 

MSSVQKMMQAGKKFKDSLLEPALASWIHKEATVFDLPKFSVCGSSQESDSLPNSAHDRD 
RTQDFPESNRLKLLQNAQPPMVTNTARTCPTFSAPVGESATAKFHVTPLFGNVKKENHSS 
AKENIGLNVFLSNQSCFPAACENPQRKSFYGSGTIDALSNPILNKACSKTEDNGPKEDSS 
LPTFKTAKEQLWVDQQKKYHQPQRASGSSYGGVKKSLGASRSRGILGKFVPPIPKQDGGE 
QNGGMQCKPYGAGPTEPAHPVDERLKNLEPKMIELIMNEIMDHGPPVNWEDIAGVEFAKA 
TIKEIWWPMLRPDIFTGLRGPPKGILLFGPPGTGKTLIGKCIASQSGATFFSISASSLT 
SKWVGEGEKMVRALFAVARCQQPAVIFIDEIDSLLSQRGDGEHESSRRIKTEFLVQLDGA 
TTSSEDRILWGATNRPQEIDEAARRRLVKRLYIPLPEASARKQIVINLMSKEQCCLSEE 
E I EQIVQQS DAFSGADMTQLCREASLGPI RSLQTADI AT IT PDQVRP I AY I DFENAFRTV 
RPSVSPKDLELYENWNKT FGCGK 
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Figur 1 (Cont'd) 



SEQ ID NO: 79. >HUMAAPA # TY Consensus # Length 6173 # Number of exons 28 

gaattcccgcgcagaggccgtgcgcaggcgcgcggggcaagaggctggcagtgcgcctgc 

gccgcgtcggcgtgcggaacgccgcggtgtctcggcgcctctgcgcgcgggaagatggcg 

gaacaggctaccaagtccgtgctgtttgtgtgtctgggtaagattttttactgctgtgga 

aactacagtctctgtgggaaaaaagttgtatttcattttaaaataagctggttgcaaccc 

acttcattttcgtgattcactcctggatcattagtagcaactctttaaacaataattttt 

gttgaaattgtttttctgatatctttgtcatctgtacttggagtacttggctggttggtg 

tcaaatcactgaacttgagcctgataggctgttctccagcagggtctaccctcactggtt 

ggacaaaggcgtcaaaggatagtgttggcctttgtttttcctgagggatagcacagcccc 

cgtgtgtcaggtgtgtaaagaaggggagctggcatgtgcccttccatccaacctaaccct 

gtttccccacccctccctttttttaaaggaattcccccgcgcgcagagaggccgcgagac 

gtaacatttgtcgatcacccattgcagaagcagttttcaggaaacttgtaaccgatcaaa 

acatctcagagaatgtaagtaccattcattatcttaaagaggccaacctgaactcctctg 

ggcaggaaattgctggagggtagacagcgcggcaacttccgggtatgagatagggaaccc 

ccctgactaccgagggcagagctgcatgaagaggcacggcattcccatgagccacgttgc 

ccggcagtgggtcattgacagcggtgctgtttctgactggaacgtgggccggtccccaga 

cccaagagctgtgagctgcctaagaaatcatggcattcacacagcccataaagcaagaca 

ggtagacaagctcttgttcaatttctaatatatagagtccagtaacttgagaagtagcga 

aaggattaaccagacttgtatattaatgaatgtgtttatttagggtgagcttaaccagct 

atggtgtgtccattttgtttcacttctggttgcacggtgttgaaagacttgcctgacttt 

ggaatttacttattaaaatgcacataaaagctaggtaatttataatgagagagcctgact 

gtgagctggggctgagcggtgctctgtcttctgttccttcctgcataatttttattaaac 

atttaggccatagtaatcatcctgctgatattgcaagtttgttgctagaatgaggttata 

taatatatacaaaaacattttttcaactgtaaagtgccttagtaatatagggtaatacca 

gcaacattatggatatataattatagtctattgggccacacttaagtttggagtctaata 

aagtcacaatcaaattctgcaatttcaattgaagataaccttgtctttatattatgaatt 

agaagctaaagttgatttttctaagagttctttatttaaatgaagtactctgggactgac 

cttttcggaaatggaatcttcattggtcaggtgattcaacatttttatacaatttatcca 

tcctcatctcttcaggatttgcataccttgccagtttctactggccattgttgaaaatac 

atttatttggagaagtccaaagccaaggggctcatggggctgtgaggtccttcttgctgc 

atcgtcctgtggtagaaggtggaggagtcaagagagtgccccagagtgagtgagagcgag 

aactagaaaaacgggaagagggaaacagaggagagagagagagaggacccatcagtgtca 

ggagcccactcccaagatagtggcattaataaagatcctgcgtctcactattgttgcatt 

ggggatgaagtttccaacaaaggaactttgggggacacatccaaaccataacataggatt 

taaataattttacagagttcaagagttctgctactgaaccgtttgagatccctgttctga 

ggtctcatcactttccagttttagcaggaagagaagtggcaagtggcaggagtctgcaga 

ttggggcctgcacctttttttgaggcaccttttttatgaacaataatgaaggggattaaa 

tatgtgacattttagtatgttgactgtattatacactgctactaaggaattgaagccgat 

gtataaacattgtgtctacacattatactattttatgattattgtatggaactttataat 

acaaaatttctttgcagtacaattttgagaaataggttaactctattttaacttaaaagt 

accctaaaaattatttaattgtttattagtgagtaacaggctcaaatacagcagtttttt 

ttttcttttagtattgctttgcatcctctaggcttgaatggtataaacactgtgttttga 

cttcttattcaattttagattaccaaagaagattttgccacatttgattatatactatgt 

atggatgaaagcaatctgaggtaatcctgtttttgaagaatatttctgttcaactctcag 

ttcagcagtgggccaagtaatttgttgtccagatttactttttctattttaaaggtttta 

atagtcagtgatggtcaccatatgtagaattattttatttagaacagcaaaaaacttagt 

aatctagaattgtctcttaggtatttacaaggaaatacaccttagaaaagggaaagtcta 

gttgttaatagcatgatttaattggaaaactagcttggctgttatagttttgtatcaagc 

atattgttctgcatgagaacgattttgatatttttgctgtaatgattcctgggcagggca 

gtgttttattgattctcaggtcaaagctctgcaaccataggttcctattattatcccatc 

ttgtaaaagaaggaagtgagaaacggtaatgttattttttaatttttatttttcagtcct 

tattctgttctcattgtttttgctgacaccatagcccataggcatcgtaaaaacaccttt 
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ttaggagaccccatttgctttgactggcactgagcaccgtgtttctttgcgtgtggggca 
agtggagttcctgcagcctcacatttgcacgtgctgttcctctgcctggaagctcctcct 
tgcttcttgggtcattcggctcctatttgcccctcgtgggtcagcttaaattcatttctt 
gatagaggtattctgctgccctgtcattaggccagattgtgcattagacttaggcctgcc 
tgcagttctcattcctgcacctaatcctgagctgtaaaacttcgtgcatgggagctcttg 
gctgttccgttcccacataaagcacgtacctgactcatcaactcatctgtcgcaggtgct 
cagtcgaaatttgttaaataaatggaggcgcttagttaagtttgcctttttcttttgtag 
caatgtgaaagctaagggtggaagtctagagtgaagctgagttttcagcttgggcagagg 
cttaaggaataaaagatggaagagtaattaaggagtgggtcgtgctgtgggaagtgctgt 
gtcaagacacctgaggatggaattggtagccctctgttttgggactggttcagatggtgt 
tttgggttcctttctatgacctttatctctgaaacttgattgtcttaaaatgtattttgt 
gggagaaaatataattattgtatattttgtgtaacagaatcagtgagaataagctgctgt 
cgcaaactgtcttgcctagagagagggcagtggcatacagctgcagaaggggccacacgg 
gcctgctgagtgttccgtttcatttcaaattaggtcattctgtcttgattttgatatgga 
tgtttcagaagaccctagcagatgtccctgtttaacttgaaaccatagatcagaaaacta 
agttcatatttcaattttacagagatttgaatagaaaaagtaatcaagttaaaacctgca 
aagctaaaattgaactacttgggagctatgatccacaaaaacaacttattattgaagatc 
cctattatgtaagtacagttcacgttttagggctaatatgaagacccaacacatttgtat 
cctgccatattaaataacagatgagattgtgttaaggatgtttttgttatgcaggttttg 
ccattttcttctttttcctgtccatttaggggaatgactctgactttgagacggtgtacc 
agcagtgtgtcaggtgctgcagagcgttcttggagaaggcccactgaggcaggttcgtgc 
cctgctgcggccagcctgactagaccccaccctgaggtcctgcatttctcagtcggtgtg 
taatcacgttccagggcccaaagcccagctctttgttcagttgacttactgtttcttacc 
ttaaaaagtaattgtagatggaaatcagttgtgtttggcaggagaatcaataaaaatctt 
tgattcagacagcttatggggtattttaagcattcttagactagttgaacatctcacttt 
gccccagttacaaaaatagtagaacaagcaacataaaacaatgaaggaaaacctcacttg 
aaggcccaggtcaacatctaagcctgttgagacttagataatcgagtctacctcttcagt 
aggtttgtgtggatggcctggagggcaggtgccctctgctccccagtgctacctctctct 
tccctagggccttttgtggattgacagtagtcccctccgtaggagctcacagtctagatt 
agaagtgttttaatttctacacacccatagtgcacacttgtatattgaaaagatagggaa 
gagagaaacatttatggaatcagtcgttggcaccttcaatacttcatgatttttgtcgag 
tttacttcatgaggaggtcagcccattggctcccatctgaaccactttgcctctgaaact 
taattacatccagaaagaaggacacttgtatgctagtctatggtcagttgaggaatatga 
ctgtttttatatgcacatgtaacccaaatgtccaatataaattggcttattttttaaaat 
aattttaaaagttgggaaaagtgttattatttggcatgcttaaatattgaataagtattc 
ttcatcagcatttaataaatgtataggcagatgtaaggtaatttctgtgtattttgagat 
aatgtcaaaatcatgaatatttcaaaataaactggggagttataaaaatacaactagaga 
tataaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaccaa 
cccaaaaaaaaaaaaaaaaaaaaaaaannnnnttgggggcgccccccccgccccccaaaa 
aaacattttttaaacaacccttgttggcgcgccggccccccccaaaaaggaaaaaaagaa 
acaggtttccttttttctttccacaaaaaaacacccgcggacaaaaaaaactcggtttgt 
tctcccccccccaaacaaaaaaaaaccccccggggggggcccaaccattcttttataaac 
aaacaacccgggggggggccccctcgaaggagtctttattggcgggcttctcacccgagg 
ggagcccccgcacccgccttttctatttatatatattatcacccccccccccccccctgt 
cttcttttcgagaggggggggggggtgcattttaaataaaaacctcctccgccggagaag 
ggggttgaacccccccctatacaacgggggtggagaggagaataaacgcaccccctcccc 
cggcgcgaggacaccccccccccctcggcgccccccgcggaaaaaaaacacaaccccccc 
ccgcgggaaacacaaaaggagctcaaaattgtgacagcaacttaacaggaggaaacccct 
tgaggcaggtaactcaagcgtgaataaaaacagtaccctcagaggtgggagccgagggga 
gcctgggtccaggacactcttccaggcttgaggaaacccggggcagataaggtttccacc 
cgccttgagagaagatcgccgccactggataagcgcgccccaaaagggtgact 

SEQ ID NO: 80. >HUMAAPA_T1 # TY Transcript # LN 1550 # Source Gene: HUMAAPA 
# Encoded protein: H(JMAAPA_P1 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
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gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattggagggtagacagcgcggcaacttccgg 
gtatgagatagggaacccccctgactaccgagggcagagctgcatgaagaggcacggcat 
tcccatgagccacgttgcccggcagattaccaaagaagattttgccacatttgattatat 
actatgtatggatgaaagcaatctgagagatttgaatagaaaaagtaatcaagttaaaac 
ctgcaaagctaaaattgaactacttgggagctatgatccacaaaaacaacttattattga 
agatccctattatgggaatgactctgactttgagacggtgtaccagcagtgtgtcaggtg 
ctgcagagcgttcttggagaaggcccactgaggcaggttcgtgccctgctgcggccagcc 
tgactagaccccaccctgaggtcctgcatttctcagtcggtgtgtaatcacgttccaggg 
cccaaagcccagctctttgttcagttgacttactgtttcttaccttaaaaagtaattgta 
gatggaaatcagttgtgtttggcaggagaatcaataaaaatctttgattcagacagctta 
tggggtattttaagcattcttagactagttgaacatctcactttgccccagttacaaaaa 
tagtagaacaagcaacataaaacaatgaaggaaaacctcacttgaaggcccaggtcaaca 
tctaagcctgttgagacttagataatcgagtctacctcttcagtaggtttgtgtggatgg 
cctggagggcaggtgccctctgctccccagtgctacctctctcttccctagggccttttg 
tggattgacagtagtcccctccgtaggagctcacagtctagattagaagtgttttaattt 
ctacacacccatagtgcacacttgtatattgaaaagatagggaagagagaaacatttatg 
gaatcagtcgttggcaccttcaatacttcatgatttttgtcgagtttacttcatgaggag 
gtcagcccattggctcccatctgaaccactttgcctctgaaacttaattacatccagaaa 
gaaggacacttgtatgctagtctatggtcagttgaggaatatgactgtttttatatgcac 
atgtaacccaaatgtccaatataaattggcttattttttaaaataattttaaaagttggg 
aaaagtgttattatttggcatgcttaaatattgaataagtattcttcatcagcatttaat 
aaatgtataggcagatgtaaggtaatttctgtgtattttgagataatgtcaaaatcatga 
atatttcaaaataaactggggagttataaaaatacaactagagatataaa 

SEQ ID NO: 81. >HUMAAPA_T2 # TY Transcript # LN 1550 # Source Gene: HUMAAPA 

# Encoded protein: HUMAAPA JP2 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattgggtcattgacagcggtgctgtttctga 
ctggaacgtgggccggtccccagacccaagagctgtgagctgcctaagaaatcatggcat 
tcacacagcccataaagcaagacagattaccaaagaagattttgccacatttgattatat 
actatgtatggatgaaagcaatctgagagatttgaatagaaaaagtaatcaagttaaaac 
ctgcaaagctaaaattgaactacttgggagctatgatccacaaaaacaacttattattga 
agatccctattatgggaatgactctgactttgagacggtgtaccagcagtgtgtcaggtg 
ctgcagagcgttcttggagaaggcccactgaggcaggttcgtgccctgctgcggccagcc 
tgactagaccccaccctgaggtcctgcatttctcagtcggtgtgtaatcacgttccaggg 
cccaaagcccagctctttgttcagttgacttactgtttcttaccttaaaaagtaattgta 
gatggaaatcagttgtgtttggcaggagaatcaataaaaatctttgattcagacagctta 
tggggtattttaagcattcttagactagttgaacatctcactttgccccagttacaaaaa 
tagtagaacaagcaacataaaacaatgaaggaaaacctcacttgaaggcccaggtcaaca 
tctaagcctgttgagacttagataatcgagtctacctcttcagtaggtttgtgtggatgg 
cctggagggcaggtgccctctgctccccagtgctacctctctcttccctagggccttttg 
tggattgacagtagtcccctccgtaggagctcacagtctagattagaagtgttttaattt 
ctacacacccatagtgcacacttgtatattgaaaagatagggaagagagaaacatttatg 
gaatcagtcgttggcaccttcaatacttcatgatttttgtcgagtttacttcatgaggag 
gtcagcccattggctcccatctgaaccactttgcctctgaaacttaattacatccagaaa 
gaaggacacttgtatgctagtctatggtcagttgaggaatatgactgtttttatatgcac 
atgtaacccaaatgtccaatataaattggcttattttttaaaataattttaaaagttggg 
aaaagtgttattatttggcatgcttaaatattgaataagtattcttcatcagcatttaat 
aaatgtataggcagatgtaaggtaatttctgtgtattttgagataatgtcaaaatcatga 
atatttcaaaataaactggggagttataaaaatacaactagagatataaa 

SEQ ID NO: 82. >HUMAAPA_T3 # TY Transcript # LN 1459 # Source Gene: HUMAAPA 

# Encoded protein: HUMAAPA_P3 

gaattcccgcgcagaggccgcaagtccgtgctgtttgtgtgtctgggtaacatttgtcga 
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tcacccattgcagaagcagttttcaggaaacttgtaaccgatcaaaacatctcagagaat 
tgggtcattgacagcggtgctgtttctgactggaacgtgggccggtccccagacccaaga 
gctgtgagctgcctaagaaatcatggcattcacacagcccataaagcaagacagattacc 
aaagaagattttgccacatttgattatatactatgtatggatgaaagcaatctgagagat 
ttgaatagaaaaagtaatcaagttaaaacctgcaaagctaaaattgaactacttgggagc 
tatgatccacaaaaacaacttattattgaagatccctattatgggaatgactctgacttt 
gagacggtgtaccagcagtgtgtcaggtgctgcagagcgttcttggagaaggcccactga 
ggcaggttcgtgccctgctgcggccagcctgactagaccccaccctgaggtcctgcattt 
ctcagtcggtgtgtaatcacgttccagggcccaaagcccagctctttgttcagttgactt 
actgtttcttaccttaaaaagtaattgtagatggaaatcagttgtgtttggcaggagaat 
caataaaaatctttgattcagacagcttatggggtattttaagcattcttagactagttg 
aacatctcactttgccccagttacaaaaatagtagaacaagcaacataaaacaatgaagg 
aaaacctcacttgaaggcccaggtcaacatctaagcctgttgagacttagataatcgagt 
ctacctcttcagtaggtttgtgtggatggcctggagggcaggtgccctctgctccccagt 
gctacctctctcttccctagggccttttgtggattgacagtagtcccctccgtaggagct 
cacagtctagattagaagtgttttaatttctacacacccatagtgcacacttgtatattg 
aaaagatagggaagagagaaacatttatggaatcagtcgttggcaccttcaatacttcat 
gatttttgtcgagtttacttcatgaggaggtcagcccattggctcccatctgaaccactt 
tgcctctgaaacttaattacatccagaaagaaggacacttgtatgctagtctatggtcag 
ttgaggaatatgactgtttttatatgcacatgtaacccaaatgtccaatataaattggct 
tattttttaaaataattttaaaagttgggaaaagtgttattatttggcatgcttaaatat 
tgaataagtattcttcatcagcatttaataaatgtataggcagatgtaaggtaatttctg 
tgtattttgagataatgtcaaaatcatgaatatttcaaaataaactggggagttataaaa 
atacaactagagatataaa 

SEQ ID NO: 83. >HUMAAPA_T4 # TY Transcript # LN 1427 # Source Gene: HUMAAPA 

# Encoded protein: HUMAAPA_P4 

gaattcccccgcgcgcagagaggccgcgagacccattgcagaagcagttttcaggaaact 
tgtaaccgatcaaaacatctcagagaattggagggtagacagcgcggcaacttccgggta 
tgagatagggaacccccctgactaccgagggcagagctgcatgaagaggcacggcattcc 
catgagccacgttgcccggcagattaccaaagaagattttgccacatttgattatatact 
atgtatggatgaaagcaatctgagagatttgaatagaaaaagtaatcaagttaaaacctg 
caaagctaaaattgaactacttgggagctatgatccacaaaaacaacttattattgaaga 
tccctattatgggaatgactctgactttgagacggtgtaccagcagtgtgtcaggtgctg 
cagagcgttcttggagaaggcccactgaggcaggttcgtgccctgctgcggccagcctga 
ctagaccccaccctgaggtcctgcatttctcagtcggtgtgtaatcacgttccagggccc 
aaagcccagctctttgttcagttgacttactgtttcttaccttaaaaagtaattgtagat 
ggaaatcagttgtgtttggcaggagaatcaataaaaatctttgattcagacagcttatgg 
ggtattttaagcattcttagactagttgaacatctcactttgccccagttacaaaaatag 
tagaacaagcaacataaaacaatgaaggaaaacctcacttgaaggcccaggtcaacatct 
aagcctgttgagacttagataatcgagtctacctcttcagtaggtttgtgtggatggcct 
ggagggcaggtgccctctgctccccagtgctacctctctcttccctagggccttttgtgg 
attgacagtagtcccctccgtaggagctcacagtctagattagaagtgttttaatttcta 
cacacccatagtgcacacttgtatattgaaaagatagggaagagagaaacatttatggaa 
tcagtcgttggcaccttcaatacttcatgatttttgtcgagtttacttcatgaggaggtc 
agcccattggctcccatctgaaccactttgcctctgaaacttaattacatccagaaagaa 
ggacacttgtatgctagtctatggtcagttgaggaatatgactgtttttatatgcacatg 
taacccaaatgtccaatataaattggcttattttttaaaataattttaaaagttgggaaa 
agtgttattatttggcatgcttaaatattgaataagtattcttcatcagcatttaataaa 
tgtataggcagatgtaaggtaatttctgtgtattttgagataatgtcaaaatcatgaata 
tttcaaaataaactggggagttataaaaatacaactagagatataaa 

SEQ ID NO: 84. >HUMAAPA_T5 # TY Transcript # LN 1579 # Source Gene: HUMAAPA 

# Encoded protein: HUMAAPA_P11 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
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acttgtaaccgatcaaaacatctcagagaattggagggtagacagcgcggcaacttccgg 
tgggtcattgacagcggtgctgtttctgactggaacgtgggccggtccccagacccaaga 
gctgtgagctgcctaagaaatcatggcattcacacagcccataaagcaagacagattacc 
aaagaagattttgccacatttgattatatactatgtatggatgaaagcaatctgagagat 
ttgaatagaaaaagtaatcaagttaaaacctgcaaagctaaaattgaactacttgggagc 
tatgatccacaaaaacaacttattattgaagatccctattatgggaatgactctgacttt 
gagacggtgtaccagcagtgtgtcaggtgctgcagagcgttcttggagaaggcccactga 
ggcaggttcgtgccctgctgcggccagcctgactagaccccaccctgaggtcctgcattt 
ctcagtcggtgtgtaatcacgttccagggcccaaagcccagctctttgttcagttgactt 
actgtttcttaccttaaaaagtaattgtagatggaaatcagttgtgtttggcaggagaat 
caataaaaatctttgattcagacagcttatggggtattttaagcattcttagactagttg 
aacatctcactttgccccagttacaaaaatagtagaacaagcaacataaaacaatgaagg 
aaaacctcacttgaaggcccaggtcaacatctaagcctgttgagacttagataatcgagt 
ctacctcttcagtaggtttgtgtggatggcctggagggcaggtgccctctgctccccagt 
gctacctctctcttccctagggccttttgtggattgacagtagtcccctccgtaggagct 
cacagtctagattagaagtgttttaatttctacacacccatagtgcacacttgtatattg 
aaaagatagggaagagagaaacatttatggaatcagtcgttggcaccttcaatacttcat 
gatttttgtcgagtttacttcatgaggaggtcagcccattggctcccatctgaaccactt 
tgcctctgaaacttaattacatccagaaagaaggacacttgtatgctagtctatggtcag 
ttgaggaatatgactgtttttatatgcacatgtaacccaaatgtccaatataaattggct 
tattttttaaaataattttaaaagttgggaaaagtgttattatttggcatgcttaaatat 
tgaataagtattcttcatcagcatttaataaatgtataggcagatgtaaggtaatttctg 
tgtattttgagataatgtcaaaatcatgaatatttcaaaataaactggggagttataaaa 
atacaactagagatataaa 

SEQ ID NO: 85. >HUMAAPA_T6 # TY Transcript # LN 1597 # Source Gene: HUMAAPA 

# Encoded protein: HUMAAPA_P6 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaagattttttactgctgtggaaactacagtctctgtgggaa 
aaaagtaacatttgtcgatcacccattgcagaagcagttttcaggaaacttgtaaccgat 
caaaacatctcagagaattggagggtagacagcgcggcaacttccgggtatgagataggg 
aacccccctgactaccgagggcagagctgcatgaagaggcacggcattcccatgagccac 
gttgcccggcagattaccaaagaagattttgccacatttgattatatactatgtatggat 
gaaagcaatctgagagatttgaatagaaaaagtaatcaagttaaaacctgcaaagctaaa 
attgaactacttgggagctatgatccacaaaaacaacttattattgaagatccctattat 
gggaatgactctgactttgagacggtgtaccagcagtgtgtcaggtgctgcagagcgttc 
ttggagaaggcccactgaggcaggttcgtgccctgctgcggccagcctgactagacccca 
ccctgaggtcctgcatttctcagtcggtgtgtaatcacgttccagggcccaaagcccagc 
tctttgttcagttgacttactgtttcttaccttaaaaagtaattgtagatggaaatcagt 
tgtgtttggcaggagaatcaataaaaatctttgattcagacagcttatggggtattttaa 
gcattcttagactagttgaacatctcactttgccccagttacaaaaatagtagaacaagc 
aacataaaacaatgaaggaaaacctcacttgaaggcccaggtcaacatctaagcctgttg 
agacttagataatcgagtctacctcttcagtaggtttgtgtggatggcctggagggcagg 
tgccctctgctccccagtgctacctctctcttccctagggccttttgtggattgacagta 
gtcccctccgtaggagctcacagtctagattagaagtgttttaatttctacacacccata 
gtgcacacttgtatattgaaaagatagggaagagagaaacatttatggaatcagtcgttg 
gcaccttcaatacttcatgatttttgtcgagtttacttcatgaggaggtcagcccattgg 
ctcccatctgaaccactttgcctctgaaacttaattacatccagaaagaaggacacttgt 
atgctagtctatggtcagttgaggaatatgactgtttttatatgcacatgtaacccaaat 
gtccaatataaattggcttattttttaaaataattttaaaagttgggaaaagtgttatta 
tttggcatgcttaaatattgaataagtattcttcatcagcatttaataaatgtataggca 
gatgtaaggtaatttctgtgtattttgagataatgtcaaaatcatgaatatttcaaaata 
aactggggagttataaaaatacaactagagatataaa 

SEQ ID NO: 86 . >HUMAAPAJT7 # TY Transcript # LN 1562 # Source Gene: HUMAAPA 

# Encoded protein: 
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aataatgaaggggattaaatatgtgacattttagtatgttgactgtattatacactgcta 
ctaaggaattgaagccgatgtataaacattgtgtctacacattatactattttatgatta 
ttgtatggaactttataatacaaaatttctttgcagtacaattttgagaaataggttaac 
tctattttaacttaaaagtaccctaaaaattatttaattgtttattagtgagtaacaggc 
tcaaatacagcagttttttttttcttttagtattgctttgcatcctctaggcttgaatgg 
tataaacactgtgttttgacttcttattcaattttagattaccaaagaagattttgccac 
atttgattatatactatgtatggatgaaagcaatctgagagatttgaatagaaaaagtaa 
tcaagttaaaacctgcaaagctaaaattgaactacttgggagctatgatccacaaaaaca 
acttattattgaagatccctattatgggaatgactctgactttgagacggtgtaccagca 
gtgtgtcaggtgctgcagagcgttcttggagaaggcccactgaggcaggttcgtgccctg 
ctgcggccagcctgactagaccccaccctgaggtcctgcatttctcagtcggtgtgtaat 
cacgttccagggcccaaagcccagctctttgttcagttgacttactgtttcttaccttaa 
aaagtaattgtagatggaaatcagttgtgtttggcaggagaatcaataaaaatctttgat 
tcagacagcttatggggtattttaagcattcttagactagttgaacatctcactttgccc 
cagttacaaaaatagtagaacaagcaacataaaacaatgaaggaaaacctcacttgaagg 
cccaggtcaacatctaagcctgttgagacttagataatcgagtctacctcttcagtaggt 
ttgtgtggatggcctggagggcaggtgccctctgctccccagtgctacctctctcttccc 
tagggccttttgtggattgacagtagtcccctccgtaggagctcacagtctagattagaa 
gtgttttaatttctacacacccatagtgcacacttgtatattgaaaagatagggaagaga 
gaaacatttatggaatcagtcgttggcaccttcaatacttcatgatttttgtcgagttta 
cttcatgaggaggtcagcccattggctcccatctgaaccactttgcctctgaaacttaat 
tacatccagaaagaaggacacttgtatgctagtctatggtcagttgaggaatatgactgt 
ttttatatgcacatgtaacccaaatgtccaatataaattggcttattttttaaaataatt 
ttaaaagttgggaaaagtgttattatttggcatgcttaaatattgaataagtattcttca 
tcagcatttaataaatgtataggcagatgtaaggtaatttctgtgtattttgagataatg 
tcaaaatcatgaatatttcaaaataaactggggagttataaaaatacaactagagatata 
aa 

SEQ ID NO: 87. >HUMAAPA_T8 # TY Transcript # LN 2172 # Source Gene: HUMAAPA 
# Encoded protein: 

attttttaatttttatttttcagtccttattctgttctcattgtttttgctgacaccata 
gcccataggcatcgtaaaaacacctttttaggagaccccatttgctttgactggcactga 
gcaccgtgtttctttgcgtgtggggcaagtggagttcctgcagcctcacatttgcacgtg 
ctgttcctctgcctggaagctcctccttgcttcttgggtcattcggctcctatttgcccc 
tcgtgggtcagcttaaattcatttcttgatagaggtattctgctgccctgtcattaggcc 
agattgtgcattagacttaggcctgcctgcagttctcattcctgcacctaatcctgagct 
gtaaaacttcgtgcatgggagctcttggctgttccgttcccacataaagcacgtacctga 
ctcatcaactcatctgtcgcaggtgctcagtcgaaatttgttaaataaatggaggcgctt 
agttaagtttgcctttttcttttgtagcaatgtgaaagctaagggtggaagtctagagtg 
aagctgagttttcagcttgggcagaggcttaaggaataaaagatggaagagtaattaagg 
agtgggtcgtgctgtgggaagtgctgtgtcaagacacctgaggatggaattggtagccct 
ctgttttgggactggttcagatggtgttttgggttcctttctatgacctttatctctgaa 
acttgattgtcttaaaatgtattttgtgggagaaaatataattattgtatattttgtgta 
acagaatcagtgagaataagctgctgtcgcaaactgtcttgcctagagagagggcagtgg 
catacagctgcagaaggggccacacgggcctgctgagtgttccgtttcatttcaaattag 
gtcattctgtcttgattttgatatggatgtttcagaagaccctagcagatgtccctgttt 
aacttgaaaccatagatcagaaaactaagttcatatttcaattttacagagatttgaata 
gaaaaagtaatcaagttaaaacctgcaaagctaaaattgaactacttgggagctatgatc 
cacaaaaacaacttattattgaagatccctattatgggaatgactctgactttgagacgg 
tgtaccagcagtgtgtcaggtgctgcagagcgttcttggagaaggcccactgaggcaggt 
tcgtgccctgctgcggccagcctgactagaccccaccctgaggtcctgcatttctcagtc 
ggtgtgtaatcacgttccagggcccaaagcccagctctttgttcagttgacttactgttt 
cttaccttaaaaagtaattgtagatggaaatcagttgtgtttggcaggagaatcaataaa 
aatctttgattcagacagcttatggggtattttaagcattcttagactagttgaacatct 
cactttgccccagttacaaaaatagtagaacaagcaacataaaacaatgaaggaaaacct 
cacttgaaggcccaggtcaacatctaagcctgttgagacttagataatcgagtctacctc 
ttcagtaggtttgtgtggatggcctggagggcaggtgccctctgctccccagtgctacct 
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ctctcttccctagggccttttgtggattgacagtagtcccctccgtaggagctcacagtc 
tagattagaagtgttttaatttctacacacccatagtgcacacttgtatattgaaaagat 
agggaagagagaaacatttatggaatcagtcgttggcaccttcaatacttcatgattttt 
gtcgagtttacttcatgaggaggtcagcccattggctcccatctgaaccactttgcctct 
gaaacttaattacatccagaaagaaggacacttgtatgctagtctatggtcagttgagga 
atatgactgtttttatatgcacatgtaacccaaatgtccaatataaattggcttattttt 
taaaataattttaaaagttgggaaaagtgttattatttggcatgcttaaatattgaataa 
gtattcttcatcagcatttaataaatgtataggcagatgtaaggtaatttctgtgtattt 
tgagataatgtcaaaatcatgaatatttcaaaataaactggggagttataaaaatacaac 
tagagatataaa 

SEQ ID NO: 88. >HUMAAPA_T9 # TY Transcript # LN 1488 # Source Gene: HUMAAPA 
# Encoded protein: HUMAAPA_P7 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattggagggtagacagcgcggcaacttccgg 
gtatgagatagggaacccccctgactaccgagggcagagctgcatgaagaggcacggcat 
tcccatgagccacgttgcccggcagagatttgaatagaaaaagtaatcaagttaaaacct 
gcaaagctaaaattgaactacttgggagctatgatccacaaaaacaacttattattgaag 
atccctattatgggaatgactctgactttgagacggtgtaccagcagtgtgtcaggtgct 
gcagagcgttcttggagaaggcccactgaggcaggttcgtgccctgctgcggccagcctg 
actagaccccaccctgaggtcctgcatttctcagtcggtgtgtaatcacgttccagggcc 
caaagcccagctctttgttcagttgacttactgtttcttaccttaaaaagtaattgtaga 
tggaaatcagttgtgtttggcaggagaatcaataaaaatctttgattcagacagcttatg 
gggtattttaagcattcttagactagttgaacatctcactttgccccagttacaaaaata 
gtagaacaagcaacataaaacaatgaaggaaaacctcacttgaaggcccaggtcaacatc 
taagcctgttgagacttagataatcgagtctacctcttcagtaggtttgtgtggatggcc 
tggagggcaggtgccctctgctccccagtgctacctctctcttccctagggccttttgtg 
gattgacagtagtcccctccgtaggagctcacagtctagattagaagtgttttaatttct 
acacacccatagtgcacacttgtatattgaaaagatagggaagagagaaacatttatgga 
atcagtcgttggcaccttcaatacttcatgatttttgtcgagtttacttcatgaggaggt 
cagcccattggctcccatctgaaccactttgcctctgaaacttaattacatccagaaaga 
aggacacttgtatgctagtctatggtcagttgaggaatatgactgtttttatatgcacat 
gtaacccaaatgtccaatataaattggcttattttttaaaataattttaaaagttgggaa 
aagtgttattatttggcatgcttaaatattgaataagtattcttcatcagcatttaataa 
atgtataggcagatgtaaggtaatttctgtgtattttgagataatgtcaaaatcatgaat 
atttcaaaataaactggggagttataaaaatacaactagagatataaa 

SEQ ID NO: 89. >HUMAAPAJT10 # TY Transcript # LN 1691 # Source Gene: 

HUMAAPA # Encoded protein: HUMAAPA_P8 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattggagggtagacagcgcggcaacttccgg 
gtatgagatagggaacccccctgactaccgagggcagagctgcatgaagaggcacggcat 
tcccatgagccacgttgcccggcagattaccaaagaagattttgccacatttgattatat 
actatgtatggatgaaagcaatctgagagatttgaatagaaaaagtaatcaagttaaaac 
ctgcaaagctaaaattgaactacttgggagctatgatccacaaaaacaacttattattga 
agatccctattatgtaagtacagttcacgttttagggctaatatgaagacccaacacatt 
tgtatcctgccatattaaataacagatgagattgtgttaaggatgtttttgttatgcagg 
ttttgccattttcttctttttcctgtccatttaggggaatgactctgactttgagacggt 
gtaccagcagtgtgtcaggtgctgcagagcgttcttggagaaggcccactgaggcaggtt 
cgtgccctgctgcggccagcctgactagaccccaccctgaggtcctgcatttctcagtcg 
gtgtgtaatcacgttccagggcccaaagcccagctctttgttcagttgacttactgtttc 
ttaccttaaaaagtaattgtagatggaaatcagttgtgtttggcaggagaatcaataaaa 
atctttgattcagacagcttatggggtattttaagcattcttagactagttgaacatctc 
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actttgccccagttacaaaaatagtagaacaagcaacataaaacaatgaaggaaaacctc 
acttgaaggcccaggtcaacatctaagcctgttgagacttagataatcgagtctacctct 
tcagtaggtttgtgtggatggcctggagggcaggtgccctctgctccccagtgctacctc 
tctcttccctagggccttttgtggattgacagtagtcccctccgtaggagctcacagtct 
agattagaagtgttttaatttctacacacccatagtgcacacttgtatattgaaaagata 
gggaagagagaaacatttatggaatcagtcgttggcaccttcaatacttcatgatttttg 
tcgagtttacttcatgaggaggtcagcccattggctcccatctgaaccactttgcctctg 
aaacttaattacatccagaaagaaggacacttgtatgctagtctatggtcagttgaggaa 
tatgactgtttttatatgcacatgtaacccaaatgtccaatataaattggcttatttttt 
aaaataattttaaaagttgggaaaagtgttattatttggcatgcttaaatattgaataag 
tattcttcatcagcatttaataaatgtataggcagatgtaaggtaatttctgtgtatttt 
gagataatgtcaaaatcatgaatatttcaaaataaactggggagttataaaaatacaact 
agagatataaa 

SEQ ID NO: 90, >HUMAAPAJT11 # TY Transcript # LN 2223 # Source Gene: 

HUMAAPA # Encoded protein: HUMAAPA_P1 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattggagggtagacagcgcggcaacttccgg 
gtatgagatagggaacccccctgactaccgagggcagagctgcatgaagaggcacggcat 
tcccatgagccacgttgcccggcagattaccaaagaagattttgccacatttgattatat 
actatgtatggatgaaagcaatctgagagatttgaatagaaaaagtaatcaagttaaaac 
ctgcaaagctaaaattgaactacttgggagctatgatccacaaaaacaacttattattga 
agatccctattatgggaatgactctgactttgagacggtgtaccagcagtgtgtcaggtg 
ctgcagagcgttcttggagaaggcccactgaggcaggttcgtgccctgctgcggccagcc 
tgactagaccccaccctgaggtcctgcatttctcagtcggtgtgtaatcacgttccaggg 
cccaaagcccagctctttgttcagttgacttactgtttcttaccttaaaaagtaattgta 
gatggaaatcagttgtgtttggcaggagaatcaataaaaatctttgattcagacagctta 
tggggtattttaagcattcttagactagttgaacatctcactttgccccagttacaaaaa 
tagtagaacaagcaacataaaacaatgaaggaaaacctcacttgaaggcccaggtcaaca 
tctaagcctgttgagacttagataatcgagtctacctcttcagtaggtttgtgtggatgg 
cctggagggcaggtgccctctgctccccagtgctacctctctcttccctagggccttttg 
tggattgacagtagtcccctccgtaggagctcacagtctagattagaagtgttttaattt 
ctacacacccatagtgcacacttgtatattgaaaagatagggaagagagaaacatttatg 
gaatcagtcgttggcaccttcaatacttcatgatttttgtcgagtttacttcatgaggag 
gtcagcccattggctcccatctgaaccactttgcctctgaaacttaattacatccagaaa 
gaaggacacttgtatgctagtctatggtcagttgaggaatatgactgtttttatatgcac 
atgtaacccaaatgtccaatataaattggcttattttttaaaataattttaaaagttggg 
aaaagtgttattatttggcatgcttaaatattgaataagtattcttcatcagcatttaat 
aaatgtataggcagatgtaaggtaatttctgtgtattttgagataatgtcaaaatcatga 
atatttcaaaataaactggggagttataaaaatacaactagagatataaaaaaaaaaaaa 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaccaacccaaaaaaaaaaaaa 
aaaaaaaaaaannnnnttgggggcgccccccccgccccccaaaaaaacattttttaaaca 
acccttgttggcgcgccggccccccccaaaaaggaaaaaaagaaacaggtttcctttttt 
ctttccacaaaaaaacacccgcggacaaaaaaaactcggtttgttctcccccccccaaac 
aaaaaaaaaccccccggggggggcccaaccattcttttataaacaaacaacccggggggg 
ggccccctcgaaggagtctttattggcgggcttctcacccgaggggagcccccgcacccg 
ccttttctatttatatatattatcacccccccccccccccctgtcttcttttcgagaggg 
gggggggggtgcattttaaataaaaacctcctccgccggagaagggggttgaaccccccc 
ctatacaacgggggtggagaggagaataaacgcaccccctcccccggcgcgaggacaccc 
cccccccctcggcgccccccgcggaaaaaaaacacaacccccccccgcgggaaacacaaa 
agg 

SEQ ID NO: 91. >HUMAAPA_T12 # TY Transcript # LN 2313 # Source Gene: 

HUMAAPA # Encoded protein: 
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attttttaatttttatttttcagtccttattctgttctcattgtttttgctgacaccata 
gcccataggcatcgtaaaaacacctttttaggagaccccatttgctttgactggcactga 
gcaccgtgtttctttgcgtgtggggcaagtggagttcctgcagcctcacatttgcacgtg 
ctgttcctctgcctggaagctcctccttgcttcttgggtcattcggctcctatttgcccc 
tcgtgggtcagcttaaattcatttcttgatagaggtattctgctgccctgtcattaggcc 
agattgtgcattagacttaggcctgcctgcagttctcattcctgcacctaatcctgagct 
gtaaaacttcgtgcatgggagctcttggctgttccgttcccacataaagcacgtacctga 
ctcatcaactcatctgtcgcaggtgctcagtcgaaatttgttaaataaatggaggcgctt 
agttaagtttgcctttttcttttgtagcaatgtgaaagctaagggtggaagtctagagtg 
aagctgagttttcagcttgggcagaggcttaaggaataaaagatggaagagtaattaagg 
agtgggtcgtgctgtgggaagtgctgtgtcaagacacctgaggatggaattggtagccct 
ctgttttgggactggttcagatggtgttttgggttcctttctatgacctttatctctgaa 
acttgattgtcttaaaatgtattttgtgggagaaaatataattattgtatattttgtgta 
acagaatcagtgagaataagctgctgtcgcaaactgtcttgcctagagagagggcagtgg 
catacagctgcagaaggggccacacgggcctgctgagtgttccgtttcatttcaaattag 
gtcattctgtcttgattttgatatggatgtttcagaagaccctagcagatgtccctgttt 
aacttgaaaccatagatcagaaaactaagttcatatttcaattttacagagatttgaata 
gaaaaagtaatcaagttaaaacctgcaaagctaaaattgaactacttgggagctatgatc 
cacaaaaacaacttattattgaagatccctattatgtaagtacagttcacgttttagggc 
taatatgaagacccaacacatttgtatcctgccatattaaataacagatgagattgtgtt 
aaggatgtttttgttatgcaggttttgccattttcttctttttcctgtccatttagggga 
atgactctgactttgagacggtgtaccagcagtgtgtcaggtgctgcagagcgttcttgg 
agaaggcccactgaggcaggttcgtgccctgctgcggccagcctgactagaccccaccct 
gaggtcctgcatttctcagtcggtgtgtaatcacgttccagggcccaaagcccagctctt 
tgttcagttgacttactgtttcttaccttaaaaagtaattgtagatggaaatcagttgtg 
tttggcaggagaatcaataaaaatctttgattcagacagcttatggggtattttaagcat 
tcttagactagttgaacatctcactttgccccagttacaaaaatagtagaacaagcaaca 
taaaacaatgaaggaaaacctcacttgaaggcccaggtcaacatctaagcctgttgagac 
ttagataatcgagtctacctcttcagtaggtttgtgtggatggcctggagggcaggtgcc 
ctctgctccccagtgctacctctctcttccctagggccttttgtggattgacagtagtcc 
cctccgtaggagctcacagtctagattagaagtgttttaatttctacacacccatagtgc 
acacttgtatattgaaaagatagggaagagagaaacatttatggaatcagtcgttggcac 
cttcaatacttcatgatttttgtcgagtttacttcatgaggaggtcagcccattggctcc 
catctgaaccactttgcctctgaaacttaattacatccagaaagaaggacacttgtatgc 
tagtctatggtcagttgaggaatatgactgtttttatatgcacatgtaacccaaatgtcc 
aatataaattggcttattttttaaaataattttaaaagttgggaaaagtgttattatttg 
gcatgcttaaatattgaataagtattcttcatcagcatttaataaatgtataggcagatg 
taaggtaatttctgtgtattttgagataatgtcaaaatcatgaatatttcaaaataaact 
ggggagttataaaaatacaactagagatataaa 

SEQ ID NO: 92. >HUMAAPA_T13 # TY Transcript # LN 1458 # Source Gene: 

HUMAAPA # Encoded protein: HUMAAPA_P1 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattggagggtagacagcgcggcaacttccgg 
gtatgagatagggaacccccctgactaccgagggcagagctgcatgaagaggcacggcat 
tcccatgagccacgttgcccggcagattaccaaagaagattttgccacatttgattatat 
actatgtatggatgaaagcaatctgagagatttgaatagaaaaagtaatcaagttaaaac 
ctgcaaagctaaaattgaactacttgggagctatgatccacaaaaacaacttattattga 
agatccctattatgggaatgactctgactttgagacggtgtaccagcagtgtgtcaggtg 
ctgcagagcgttcttggagaaggcccactgaggcaggttcgtgccctgctgcggccagcc 
tgactagaccccaccctgaggtcctgcatttctcagtcggtgtgtaatcacgttccaggg 
cccaaagcccagctctttgttcagttgacttactgtttcttaccttaaaaagtaattgta 
gatggaaatcagttgtgtttggcaggagaatcaataaaaatctttgattcagacagctta 
tggggtattttaagcattcttagactagttgaacatctcactttgccccagttacaaaaa 
tagtagaacaagcaacataaaacaatgaaggaaaacctcacttgaaggcccaggtcaaca 
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tctaagcctgttgagacttagataatcgagtctacctcttcagtaggtttgtgtggatgg 
cctggagggcaggtgccctctgctccccagtgctacctctctcttccctagggccttttg 
tggattgacagtagtcccctccgtaggagctcacagtctagattagaagtgttttaattt 
ctacacacccatagtgcacacttgtatattgaaaagatagggaagagagaaacatttatg 
gaatcagtcgttggcaccttcaatacttcatgatttttgtcgagtttacttcatgaggag 
gtcagcccattggctcccatctgaaccactttgcctctgaaacttaattacatccagaaa 
gaaggacacttgtatgctagtctatggtcagttgaggaatatgactgtttttatatgcac 
atgtaacccaaatgtccaatataaattggcttattttttaaaataattttaaaagttggg 
aaaagtgttattatttggcatgcttaaatattgaataagtattcttcatcagcatttaat 
aaatgtataggcagatgt 

SEQ ID NO: 93. >HUMAAPA_T14 # TY Transcript # LN 775 # Source Gene: HUMAAPA 

# Encoded protein: HUMAAPA_P1 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattggagggtagacagcgcggcaacttccgg 
gtatgagatagggaacccccctgactaccgagggcagagctgcatgaagaggcacggcat 
tcccatgagccacgttgcccggcagattaccaaagaagattttgccacatttgattatat 
actatgtatggatgaaagcaatctgagagatttgaatagaaaaagtaatcaagttaaaac 
ctgcaaagctaaaattgaactacttgggagctatgatccacaaaaacaacttattattga 
agatccctattatgggaatgactctgactttgagacggtgtaccagcagtgtgtcaggtg 
ctgcagagcgttcttggagaaggcccactgaggcaggttcgtgccctgctgcggccagcc 
tgactagaccccaccctgaggtcctgcatttctcagtcggtgtgtaatcacgttccaggg 
cccaaagcccagctctttgttcagttgacttactgtttcttaccttaaaaagtaattgta 
gatggaaatcagttgtgtttggcaggagaatcaataaaaatctttgattcagaca 

SEQ ID NO: 94. >HUMAAPA_T15 # TY Transcript # LN 1458 # Source Gene: 

HUMAAPA 8 Encoded protein: HUMAAPA_P2 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattgggtcattgacagcggtgctgtttctga 
ctggaacgtgggccggtccccagacccaagagctgtgagctgcctaagaaatcatggcat 
tcacacagcccataaagcaagacagattaccaaagaagattttgccacatttgattatat 
actatgtatggatgaaagcaatctgagagatttgaatagaaaaagtaatcaagttaaaac 
ctgcaaagctaaaattgaactacttgggagctatgatccacaaaaacaacttattattga 
agatccctattatgggaatgactctgactttgagacggtgtaccagcagtgtgtcaggtg 
ctgcagagcgttcttggagaaggcccactgaggcaggttcgtgccctgctgcggccagcc 
tgactagaccccaccctgaggtcctgcatttctcagtcggtgtgtaatcacgttccaggg 
cccaaagcccagctctttgttcagttgacttactgtttcttaccttaaaaagtaattgta 
gatggaaatcagttgtgtttggcaggagaatcaataaaaatctttgattcagacagctta 
tggggtattttaagcattcttagactagttgaacatctcactttgccccagttacaaaaa 
tagtagaacaagcaacataaaacaatgaaggaaaacctcacttgaaggcccaggtcaaca 
tctaagcctgttgagacttagataatcgagtctacctcttcagtaggtttgtgtggatgg 
cctggagggcaggtgccctctgctccccagtgctacctctctcttccctagggccttttg 
tggattgacagtagtcccctccgtaggagctcacagtctagattagaagtgttttaattt 
ctacacacccatagtgcacacttgtatattgaaaagatagggaagagagaaacatttatg 
gaatcagtcgttggcaccttcaatacttcatgatttttgtcgagtttacttcatgaggag 
gtcagcccattggctcccatctgaaccactttgcctctgaaacttaattacatccagaaa 
gaaggacacttgtatgctagtctatggtcagttgaggaatatgactgtttttatatgcac 
atgtaacccaaatgtccaatataaattggcttattttttaaaataattttaaaagttggg 
aaaagtgttattatttggcatgcttaaatattgaataagtattcttcatcagcatttaat 
aaatgtataggcagatgt 

SEQ ID NO: 95. >HUMAAPAJT16 # TY Transcript # LN 775 # Source Gene: HUMAAPA 

# Encoded protein: HUMAAPA_P2 
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tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattgggtcattgacagcggtgctgtttctga 
ctggaacgtgggccggtccccagacccaagagctgtgagctgcctaagaaatcatggcat 
tcacacagcccataaagcaagacagattaccaaagaagattttgccacatttgattatat 
actatgtatggatgaaagcaatctgagagatttgaatagaaaaagtaatcaagttaaaac 
ctgcaaagctaaaattgaactacttgggagctatgatccacaaaaacaacttattattga 
agatccctattatgggaatgactctgactttgagacggtgtaccagcagtgtgtcaggtg 
ctgcagagcgttcttggagaaggcccactgaggcaggttcgtgccctgctgcggccagcc 
tgactagaccccaccctgaggtcctgcatttctcagtcggtgtgtaatcacgttccaggg 
cccaaagcccagctctttgttcagttgacttactgtttcttaccttaaaaagtaattgta 
gatggaaatcagttgtgtttggcaggagaatcaataaaaatctttgattcagaca 

SEQ ID NO: 96. >HUMAAPA_T17 # TY Transcript # LN 804 # Source Gene: HUMAAPA 

# Encoded protein: HUMAAPA_P11 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattggagggtagacagcgcggcaacttccgg 
tgggtcattgacagcggtgctgtttctgactggaacgtgggccggtccccagacccaaga 
gctgtgagctgcctaagaaatcatggcattcacacagcccataaagcaagacagattacc 
aaagaagattttgccacatttgattatatactatgtatggatgaaagcaatctgagagat 
ttgaatagaaaaagtaatcaagttaaaacctgcaaagctaaaattgaactacttgggagc 
tatgatccacaaaaacaacttattattgaagatccctattatgggaatgactctgacttt 
gagacggtgtaccagcagtgtgtcaggtgctgcagagcgttcttggagaaggcccactga 
ggcaggttcgtgccctgctgcggccagcctgactagaccccaccctgaggtcctgcattt 
ctcagtcggtgtgtaatcacgttccagggcccaaagcccagctctttgttcagttgactt 
actgtttcttaccttaaaaagtaattgtagatggaaatcagttgtgtttggcaggagaat 
caataaaaatctttgattcagaca 

SEQ ID NO: 97. >HUMAAPA_T18 # TY Transcript # LN 1042 # Source Gene: 

HUMAAPA # Encoded protein: HUMAAPA_P2 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattgggtcattgacagcggtgctgtttctga 
ctggaacgtgggccggtccccagacccaagagctgtgagctgcctaagaaatcatggcat 
tcacacagcccataaagcaagacagattaccaaagaagattttgccacatttgattatat 
actatgtatggatgaaagcaatctgagagatttgaatagaaaaagtaatcaagttaaaac 
ctgcaaagctaaaattgaactacttgggagctatgatccacaaaaacaacttattattga 
agatccctattatgggaatgactctgactttgagacggtgtaccagcagtgtgtcaggtg 
ctgcagagcgttcttggagaaggcccactgaggcaggttcgtgccctgctgcggccagcc 
tgactagaccccaccctgaggtcctgcatttctcagtcggtgtgtaatcacgttccaggg 
cccaaagcccagctctttgttcagttgacttactgtttcttaccttaaaaagtaattgta 
gatggaaatcagttgtgtttggcaggagaatcaataaaaatctttgattcagacagctta 
tggggtattttaagcattcttagactagttgaacatctcactttgcccagctcaaaattg 
tgacagcaacttaacaggaggaaaccccttgaggcaggtaactcaagcgtgaataaaaac 
agtaccctcagaggtgggagccgaggggagcctgggtccaggacactcttccaggcttga 
ggaaacccggggcagataaggtttccacccgccttgagagaagatcgccgccactggata 
agcgcgccccaaaagggtgact 

SEQ ID NO: 98. >HUMAAPA_T19 # TY Transcript # LN 787 # Source Gene: HUMAAPA 

# Encoded protein: 

aataatgaaggggattaaatatgtgacattttagtatgttgactgtattatacactgcta 
ctaaggaattgaagccgatgtataaacattgtgtctacacattatactattttatgatta 
ttgtatggaactttataatacaaaatttctttgcagtacaattttgagaaataggttaac 
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tctattttaacttaaaagtaccctaaaaattatttaattgtttattagtgagtaacaggc 
tcaaatacagcagttttttttttcttttagtattgctttgcatcctctaggcttgaatgg 
tataaacactgtgttttgacttcttattcaattttagattaccaaagaagattttgccac 
atttgattatatactatgtatggatgaaagcaatctgagagatttgaatagaaaaagtaa 
tcaagttaaaacctgcaaagctaaaattgaactacttgggagctatgatccacaaaaaca 
acttattattgaagatccctattatgggaatgactctgactttgagacggtgtaccagca 
gtgtgtcaggtgctgcagagcgttcttggagaaggcccactgaggcaggttcgtgccctg 
ctgcggccagcctgactagaccccaccctgaggtcctgcatttctcagtcggtgtgtaat 
cacgttccagggcccaaagcccagctctttgttcagttgacttactgtttcttaccttaa 
aaagtaattgtagatggaaatcagttgtgtttggcaggagaatcaataaaaatctttgat 
tcagaca 

SEQ ID NO: 99. >HUMAAPAJT20 # TY Transcript # LN 916 # Source Gene: HUMAAPA 
# Encoded protein: HUMAAPA_P8 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattggagggtagacagcgcggcaacttccgg 
gtatgagatagggaacccccctgactaccgagggcagagctgcatgaagaggcacggcat 
tcccatgagccacgttgcccggcagattaccaaagaagattttgccacatttgattatat 
actatgtatggatgaaagcaatctgagagatttgaatagaaaaagtaatcaagttaaaac 
ctgcaaagctaaaattgaactacttgggagctatgatccacaaaaacaacttattattga 
agatccctattatgtaagtacagttcacgttttagggctaatatgaagacccaacacatt 
tgtatcctgccatattaaataacagatgagattgtgttaaggatgtttttgttatgcagg 
ttttgccattttcttctttttcctgtccatttaggggaatgactctgactttgagacggt 
gtaccagcagtgtgtcaggtgctgcagagcgttcttggagaaggcccactgaggcaggtt 
cgtgccctgctgcggccagcctgactagaccccaccctgaggtcctgcatttctcagtcg 
gtgtgtaatcacgttccagggcccaaagcccagctctttgttcagttgacttactgtttc 
ttaccttaaaaagtaattgtagatggaaatcagttgtgtttggcaggagaatcaataaaa 
atctttgattcagaca 

SEQ ID NO: 100. >HUMAAPA_T21 # TY Transcript # LN 713 # Source Gene: 

HUMAAPA # Encoded protein: HUMAAPA_P7 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattggagggtagacagcgcggcaacttccgg 
gtatgagatagggaacccccctgactaccgagggcagagctgcatgaagaggcacggcat 
tcccatgagccacgttgcccggcagagatttgaatagaaaaagtaatcaagttaaaacct 
gcaaagctaaaattgaactacttgggagctatgatccacaaaaacaacttattattgaag 
atccctattatgggaatgactctgactttgagacggtgtaccagcagtgtgtcaggtgct 
gcagagcgttcttggagaaggcccactgaggcaggttcgtgccctgctgcggccagcctg 
actagaccccaccctgaggtcctgcatttctcagtcggtgtgtaatcacgttccagggcc 
caaagcccagctctttgttcagttgacttactgtttcttaccttaaaaagtaattgtaga 
tggaaatcagttgtgtttggcaggagaatcaataaaaatctttgattcagaca 

SEQ ID NO: 101. >HUMAAPAJF22 # TY Transcript # LN 1534 # Source Gene: 

HUMAAPA # Encoded protein: HUMAAPA_P5 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattggagggtagacagcgcggcaacttccgg 
tgggtcattgacagcggtgctgtttctgactggaacgtgggccggtccccagacccaaga 
gctgtgagctgcctaagaaatcatggcattcacacagcccataaagcaagacaggtagac 
aagctcttgttcaatttctaatatatagagtccagtaacttgagaagtagcgaaaggatt 
aaccagacttgtatattaatgaatgtgtttatttagggtgagcttaaccagctatggtgt 
gtccattttgtttcacttctggttgcacggtgttgaaagacttgcctgactttggaattt 
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acttattaaaatgcacataaaagctaggtaatttataatgagagagcctgactgtgagct 
ggggctgagcggtgctctgtcttctgttccttcctgcataatttttattaaacatttagg 
ccatagtaatcatcctgctgatattgcaagtttgttgctagaatgaggttatataatata 
tacaaaaacattttttcaactgtaaagtgccttagtaatatagggtaataccagcaacat 
tatggatatataattatagtctattgggccacacttaagtttggagtctaataaagtcac 
aatcaaattctgcaatttcaattgaagataaccttgtctttatattatgaattagaagct 
aaagttgatttttctaagagttctttatttaaatgaagtactctgggactgaccttttcg 
gaaatggaatcttcattggtcaggtgattcaacatttttatacaatttatccatcctcat 
ctcttcaggatttgcataccttgccagtttctactggccattgttgaaaatacatttatt 
tggagaagtccaaagccaaggggctcatggggctgtgaggtccttcttgctgcatcgtcc 
tgtggtagaaggtggaggagtcaagagagtgccccagagtgagtgagagcgagaactaga 
aaaacgggaagagggaaacagaggagagagagagagaggacccatcagtgtcaggagccc 
actcccaagatagtggcattaataaagatcctgcgtctcactattgttgcattggggatg 
aagtttccaacaaaggaactttgggggacacatccaaaccataacataggatttaaataa 
ttttacagagttcaagagttctgctactgaaccgtttgagatccctgttctgaggtctca 
tcactttccagttttagcaggaagagaagtggcaagtggcaggagtctgcagattggggc 
ctgcacctttttttgaggcaccttttttatgaac 

SEQ ID NO: 102 . >HUMAAPA_T23 # TY Transcript # LN 820 # Source Gene: 

HUMAAPA # Encoded protein: HUMAAPA_P9 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaacatttgtcgatcacccattgcagaagcagttttcaggaa 
acttgtaaccgatcaaaacatctcagagaattggagggtagacagcgcggcaacttccgg 
gtatgagatagggaacccccctgactaccgagggcagagctgcatgaagaggcacggcat 
tcccatgagccacgttgcccggcagattaccaaagaagattttgccacatttgattatat 
actatgtatggatgaaagcaatctgaggtaatcctgtttttgaagaatatttctgttcaa 
ctctcagttcagcagtgggccaagtaatttgttgtccagatttactttttctattttaaa 
ggttttaatagtcagtgatggtcaccatatgtagaattattttatttagaacagcaaaaa 
acttagtaatctagaattgtctcttaggtatttacaaggaaatacaccttagaaaaggga 
aagtctagttgttaatagcatgatttaattggaaaactagcttggctgttatagttttgt 
atcaagcatattgttctgcatgagaacgattttgatatttttgctgtaatgattcctggg 
cagggcagtgttttattgattctcaggtcaaagctctgcaaccataggttcctattatta 
tcccatcttgtaaaagaaggaagtgagaaacggtaatgtt 

SEQ ID NO: 103 . >HUMAAPA_T24 # TY Transcript # LN 497 # Source Gene: 

HUMAAPA # Encoded protein: 

gttgtatttcattttaaaataagctggttgcaacccacttcattttcgtgattcactcct 
ggatcattagtagcaactctttaaacaataatttttgttgaaattgtttttctgatatct 
ttgtcatctgtacttggagtacttggctggttggtgtcaaatcactgaacttgagcctga 
taggctgttctccagcagggtctaccctcactggttggacaaaggcgtcaaaggatagtg 
ttggcctttgtttttcctgagggatagcacagcccccgtgtgtcaggtgtgtaaagaagg 
ggagctggcatgtgcccttccatccaacctaaccctgtttccccacccctcccttttttt 
aaaggtaacatttgtcgatcacccattgcagaagcagttttcaggaaacttgtaaccgat 
caaaacatctcagagaatgtaagtaccattcattatcttaaagaggccaacctgaactcc 
tctgggcaggaaattgc 

SEQ ID NO: 104 • >HUMAAPAJT25 # TY Transcript # LN 501 # Source Gene: 

HUMAAPA # Encoded protein: HUMAAPA_P10 

tgcgcaggcgcgcggggcaagaggctggcagtgcgcctgcgccgcgtcggcgtgcggaac 
gccgcggtgtctcggcgcctctgcgcgcgggaagatggcggaacaggctaccaagtccgt 
gctgtttgtgtgtctgggtaagattttttactgctgtggaaactacagtctctgtgggaa 
aaaagtaacatttgtcgatcacccattgcagaagcagttttcaggaaacttgtaaccgat 
caaaacatctcagagaattggagggtagacagcgcggcaacttccgggtatgagataggg 
aacccccctgactgctgcggccagcctgactagaccccaccctgaggtcctgcatttctc 
agtcggtgtgtaatcacgttccagggcccaaagcccagctctttgttcagttgacttact 
gtttcttaccttaaaaagtaattgtagatggaaatcagttgtgtttggcaggagaatcaa 
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taaaaatctttgattcagaca 

SEQ ID NO: 105. >HUMAAPA_P1 # TY Protein # CC #LN 158 # Source Gene 

HUMAAPA # Encoding Transcript: 1 

MAEQATKSVLFVCLGNICRSPIAEAVFRKLVTDQNISENWRVDSAATSGYEIGNPPDYRG 
QSCMKRHGI PMSHVARQI TKEDFAT FDYI LCMDESNLRDLNRKSNQVKTCKAKI ELLGS Y 
DPQKQLIIEDPYYGNDSDFETVYQQCVRCCRAFLEKAH 

SEQ ID NO: 106. >HUMAAPA_P2 # TY Protein # CC #LN 158 # Source Gene 

HUMAAPA # Encoding Transcript: 2 

MAEQATKSVLFVCLGNICRSPIAEAVFRKLVTDQNISENWVIDSGAVSDWNVGRSPDPRA 
VSCLRNHGIHTAHKARQITKEDFATFDYILCMDESNLRDLNRKSNQVKTCKAKIELLGSY • 
DPQKQLI I EDPYYGNDSDFETVYQQCVRCCRAFLEKAH 

SEQ ID NO: 107. >HUMAAPA_P3 # TY Protein # CC #LN 159 # Source Gene 

HUMAAPA # Encoding Transcript: 3 

E FPRRGRKS VL FVCLGN I CRS P I AEAV FRKL VT DQN I S EN WV I DS GAVS DWN VGRS P DPR 
AVSCLRNHGIHTAHKARQITKEDFATFDYILCMDESNLRDLNRKSNQVKTCKAKIELLGS 
Y DPQKQLI I EDPYYGNDSDFETVYQQCVRCCRAFLEKAH 

SEQ ID NO: 108. >HUMAAPA_P4 # TY Protein # CC #LN 14 8 # Source Gene 

HUMAAPA # Encoding Transcript: 4 

NSPARREAARPIAEAVFRKLVTDQNISENWRVDSAATSGYEIGNPPDYRGQSCMKRHGIP 
MSH VARQI TKEDFAT FDYI LCMDESNLRDLNRKSNQVKTCKAKI ELLGS YDPQKQLI I ED 
PYYGNDS DFET VYQQCVRCCRAFLEKAH 

SEQ ID NO: 109. >HUMAAPA_P11 # TY Protein # CC #LN 73 # Source Gene 

HUMAAPA # Encoding Transcript: 5 

MAEQATKSVLFVCLGNICRSPIAEAVFRKLVTDQNISENWRVDSAATSGGSLIAVLFLTG 
TWAGPQTKSCGAA 

SEQ ID NO: 110. >HUMAAPA_P6 # TY Protein # CC #LN 174 # Source Gene 

HUMAAPA # Encoding Transcript: 6 

MAEQATKSVLFVCLGKISLLLWKLQSLWEKSNICRSPIAEAVFRKLVTDQNISENWRVDS 
AATSGYEIGNPPDYRGQSCMKRHGIPMSHVARQITKEDFATFDYILCMDESNLRDLNRKS 
NQVKTCKAKI ELLGS YDPQKQLI I EDPYYGNDSDFETVYQQCVRCCRAFLEKAH 

SEQ ID NO: 111. >HUMAAPA_P7 # TY Protein # CC #LN 80 # Source Gene 

HUMAAPA # Encoding Transcript: 9 

MAEQATKSVLFVCLGNICRSPIAEAVFRKLVTDQNISENWRVDSAATSGYEIGNPPDYRG 
QSCMKRHGI PMSH VARQRFE 

SEQ ID NO: 112. >HUMAAPA_P8 # TY Protein # CC #LN 143 # Source Gene 

HUMAAPA # Encoding Transcript: 10 

MAEQATKSVLFVCLGNICRSPIAEAVFRKLVTDQNISENWRVDSAATSGYEIGNPPDYRG 
QSCMKRHG I PMSHVARQI TKEDFAT FDYI LCMDESNLRDLNRKSNQVKTCKAKI ELLGS Y 
DPQKQLI I EDPYYVSTVHVLGLI 

SEQ ID NO: 113. >HUMAAPA_P5 # TY Protein # CC #LN 70 # Source Gene 

HUMAAPA # Encoding Transcript: 22 

MAEQATKSVLFVCLGNICRSPIAEAVFRKLVTDQNISENWRVDSAATSGGSLTAVLFLTG 
TWAGPQTQEL 

SEQ ID NO: 114. >HUMAAPA_P9 # TY Protein # CC #LN 98 # Source Gene 

HUMAAPA # Encoding Transcript: 23 

MAEQATKSVLFVCLGNICRSPIAEAVFRKLVTDQNISENWRVDSAATSGYEIGNPPDYRG 
QSCMKRHGI PMSHVARQI TKEDFAT FDYI LCMDESNLR 
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SEQ ID NO:115. >HUMAAPA_P10 # TY Protein § CC #LN 46 # Source Gene: 

HCJMAAPA # Encoding Transcript: 25 
CAGARGKRLAVRLRRVGVRNAAVSRRLCAREDGGTGYQVRAVCVSG 



SEQ ID NO: 116. >HUMPTPB # TY Consensus # Length 2792 # Number of exons 17 

gaaccagtcagcgattagaggccgagtcttcggccacccaaaggcggagtaagaaaccag 

aagcggatctgattggttgctggaagacgccgcgcccacctcacagaaggacgaaccagt 

gagctaagctgcggggcgcgggctcggccggggcaccggtgagtcgccggcgctgcagag 

ggaggcggcactggtctcgacgtggggcggccagcgatgaagccgcccagttcaatacaa 

acaagtgagtttgactcatcagatgaagagcctattgaagatgaacagactccaattcat 

atatcatggctatctttgtcacgagtgaattgttctcagtttctcggtttatgtgctctt 

ccaggttgtaaatttaaagatgttagaagaaatgtccaaaaagatacagaagaactaaag 

agctgtggtatacaagacatatttgttttctgcaccagaggggaactgtcaaaatataga 

gtcccaaaccttctggatctctaccagcaatgtggaattatcacccatcatcatccaatc 

gcagatggagggactcctgacatagccagctgctgtgaaataatggaagagcttacaacc 

tgccttaaaaattaccgaaaaaccttaatacaaggtgcaacttccttcggtcgtcccgaa 

tccgggttcatccgacaccagccgcctccaccatgccgccgaagttcgaccccaacgaga 

tcaaagtcgtatacctgaggtgcaccggaggtgaagtcggtgccacttctgccctggccc 

ccaagatcggccccctgggtctgtctccaaaaaagttggtgatgacattgccaaggcaac 

gggtgactggaagggcctgaggattacagtgaaactgaccattcagaacagacaggccca 

gattgaggtggtgccttctgcctctgccctgatcatcaaagccctcaaggaaccaccaag 

agacagaaagaaacagaaaaacattaaacacagtgggaatatcacttttgatgagattgt 

caacattgctcgacagatgcggcaccgatccttagccagagaactctctggaaccattaa 

agagatcctggggactgcccagtcagtgggctgtaatgttgatggccgccatcctcatga 

catcatcgatgacatcaacagtggtgctgtggaatgcccagccagttaagcacaaaggaa 

aacatttcaataaaggatcatttgacaactggtgaaaaaaaaaaaaaaaggagatcttgt 

cttgctgctatggaggacttgggagatcttgtcttgcaattacaccagtcttctgtatca 

acagaagtcttattaaatataaaatgctgtaccctgtcactagttatttaataacatatt 

atttttctgtcatgttccattagtagctgcttgtctcctactatacctgtctgacacaat 

atcaccagagcaagccatagacagcctgcgagacctaagaggatccggggcaatacagac 

catcaaggatctacctacttctcagtttttgccccagtccgttttgggaatggatcctct 

cacctaggacaatgagatggttattgttaatacaaacttgcaatacaattatcttcatga 

gtttcgggacaaattagctgcacatctatcatcaagagattcacaatcaagatctgtatc 

aagataaaggaattcaaatagcatatatatgaccatgtctgaaatgtcagttctctagca 

taatttgtattgaaatgaaaccaccagtgttatcaacttgaatgtaaatgtacatgtgca 

gatattcctaaagttttattgacaaaaaaaaaaaraaaamrwaaaaaaaaaaaaaaaaaa 

aaaaggggggggggcaaaaaaatttcccggggggggcccaaacccccctttttcttgaaa 

acagggggcgactcctaaagagaggaggaatattatataaggacaaggggagcggacgcg 

cgttttaaaaaaacgtgttctgtgagtaggacaaaaatggcttgagtgggcatttctttg 

aggaaagcacaccatcagttggggagcagggaaaattagtagggacaccactcaccgaga 

agagtaaatgaaacgcacgcgagggggataaataaacaatggggagtagataatgttgat 

acaccgagcatcaaatgtgtggagaagaagattaccaagaaaacaacnncaccccccccc 

atttccccccacgcccccaaaatataggtccacctccctaaccccctttttccgcccccc 

aaaaacctttggtgggcccccttgcgcccctccaaaaaactttcttaaaaccattttttt 

tgggccccccgggccccccgaagggtaaaggtaaaaccgggtccaaaacctggttcccca 

aaggaaaaccctgggccaaataaccaaaagtgccgcgaactcccctccccaaataaaaaa 

acaccccccggggggcacccccaaacgtgtttcgtaaaacaattccccaaagggaggttt 

ccggagggggcccttactggggatatttaccaccccgggagacccccagagcgaaacctc 

ccaatacaaaatatttaagccccccgcccttggttaatcttctttcgcaaaaaaacgggg 
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gggtaaattccccctaagaccctttcccttccccgaaattggcaaatccccctccccaca 
aaccggcatttgatggacacccttaaattaccccccgcggggggtaattcaaacccccct 
ttggtcgccccttgtgccataataaaatattt 

SEQ ID NO: 117 . >HUMPTPB_T1 # TY Transcript # LN 1369 t Source Gene: 

HUMPTPB # Encoded protein: HUMPTPB_P1 

gaaccagtcagcgattagaggccgagtcttcggccacccaaaggcggagtaagaaaccag 
aagcggatctgattggttgctggaagacgccgcgcccacctcacagaaggacgaaccagt 
gagctaagctgcggggcgcgggctcggccggggcaccggtgagtcgccggcgctgcagag 
ggaggcggcactggtctcgacgtggggcggccagcgatgaagccgcccagttcaatacaa 
acaagtgagtttgactcatcagatgaagagcctattgaagatgaacagactccaattcat 
atatcatggctatctttgtcacgagtgaattgttctcagtttctcggtttatgtgctctt 
ccaggttgtaaatttaaagatgttagaagaaatgtccaaaaagatacagaagaactaaag 
agctgtggtatacaagacatatttgttttctgcaccagaggggaactgtcaaaatataga 
gtcccaaaccttctggatctctaccagcaatgtggaattatcacccatcatcatccaatc 
gcagatggagggactcctgacatagccagctgctgtgaaataatggaagagcttacaacc 
tgccttaaaaattaccgaaaaaccttaatacactgctatggaggacttgggagatcttgt 
cttgtagctgcttgtctcctactatacctgtctgacacaatatcaccagagcaagccata 
gacagcctgcgagacctaagaggatccggggcaatacagaccatcaagcaatacaattat 
cttcatgagtttcgggacaaattagctgcacatctatcatcaagagattcacaatcaaga 
tctgtatcaagataaaggaattcaaatagcatatatatgaccatgtctgaaatgtcagtt 
ctctagcataatttgtattgaaatgaaaccaccagtgttatcaacttgaatgtaaatgta 
catgtgcagatattcctaaagttttattgacaaaaaaaaaaaraaaamrwaaaaaaaaaa 
aaaaaaaaaaaaggggggggggcaaaaaaatttcccggggggggcccaaacccccctttt 
tcttgaaaacagggggcgactcctaaagagaggaggaatattatataaggacaaggggag 
cggacgcgcgttttaaaaaaacgtgttctgtgagtaggacaaaaatggcttgagtgggca 
tttctttgaggaaagcacaccatcagttggggagcagggaaaattagtagggacaccact 
caccgagaagagtaaatgaaacgcacgcgagggggataaataaacaatggggagtagata 
atgttgatacaccgagcatcaaatgtgtggagaagaagattaccaagaa 

SEQ ID NO: 118, >HUMPTPB_T2 # TY Transcript # LN 1586 # Source Gene: 

HUMPTPB # Encoded protein: HUMPTPB_P1 

gaaccagtcagcgattagaggccgagtcttcggccacccaaaggcggagtaagaaaccag 
aagcggatctgattggttgctggaagacgccgcgcccacctcacagaaggacgaaccagt 
gagctaagctgcggggcgcgggctcggccggggcaccggtgagtcgccggcgctgcagag 
ggaggcggcactggtctcgacgtggggcggccagcgatgaagccgcccagttcaatacaa 
acaagtgagtttgactcatcagatgaagagcctattgaagatgaacagactccaattcat 
atatcatggctatctttgtcacgagtgaattgttctcagtttctcggtttatgtgctctt 
ccaggttgtaaatttaaagatgttagaagaaatgtccaaaaagatacagaagaactaaag 
agctgtggtatacaagacatatttgttttctgcaccagaggggaactgtcaaaatataga 
gtcccaaaccttctggatctctaccagcaatgtggaattatcacccatcatcatccaatc 
gcagatggagggactcctgacatagccagctgctgtgaaataatggaagagcttacaacc 
tgccttaaaaattaccgaaaaaccttaatacactgctatggaggacttgggagatcttgt 
cttgtagctgcttgtctcctactatacctgtctgacacaatatcaccagagcaagccata 
gacagcctgcgagacctaagaggatccggggcaatacagaccatcaagcaatacaattat 
cttcatgagtttcgggacaaattagctgcacatctatcatcaagagattcacaatcaaga 
tctgtatcaagataaaggaattcaaatagcatatatatgaccatgtctgaaatgtcagtt 
ctctagcataatttgtattgaaatgaaaccaccagtgttatcaacttgaatgtaaatgta 
catgtgcagatattcctaaagttttattgacaaaaaacaacnncacccccccccatttcc 
ccccacgcccccaaaatataggtccacctccctaaccccctttttccgccccccaaaaac 
ctttggtgggcccccttgcgcccctccaaaaaactttcttaaaaccattttttttgggcc 
ccccgggccccccgaagggtaaaggtaaaaccgggtccaaaacctggttccccaaaggaa 
aaccctgggccaaataaccaaaagtgccgcgaactcccctccccaaataaaaaaacaccc 
cccggggggcacccccaaacgtgtttcgtaaaacaattccccaaagggaggtttccggag 
ggggcccttactggggatatttaccaccccgggagacccccagagcgaaacctcccaata 
caaaatatttaagccccccgcccttggttaatcttctttcgcaaaaaaacgggggggtaa 
attccccctaagaccctttcccttccccgaaattggcaaatccccctccccacaaaccgg 
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catttgatggacacccttaaattaccccccgcggggggtaattcaaacccccctttggtc 
gccccttgtgccataataaaatattt 

SEQ ID NO: 119. >HUMPTPB_T3 # TY Transcript # LN 1249 # Source Gene: 

HUMPTPB # Encoded protein: HUMPTPB_P2 

gaaccagtcagcgattagaggccgagtcttcggccacccaaaggcggagtaagaaaccag 
aagcggatctgattggttgctggaagacgccgcgcccacctcacagaaggacgaaccagt 
gagctaagctgcggggcgcgggctcggccggggcaccggtgagtcgccggcgctgcagag 
ggaggcggcactggtctcgacgtggggcggccagcgatgaagccgcccagttcaatacaa 
acaagttgtaaatttaaagatgttagaagaaatgtccaaaaagatacagaagaactaaag 
agctgtggtatacaagacatatttgttttctgcaccagaggggaactgtcaaaatataga 
gtcccaaaccttctggatctctaccagcaatgtggaattatcacccatcatcatccaatc 
gcagatggagggactcctgacatagccagctgctgtgaaataatggaagagcttacaacc 
tgccttaaaaattaccgaaaaaccttaatacactgctatggaggacttgggagatcttgt 
cttgtagctgcttgtctcctactatacctgtctgacacaatatcaccagagcaagccata 
gacagcctgcgagacctaagaggatccggggcaatacagaccatcaagcaatacaattat 
cttcatgagtttcgggacaaattagctgcacatctatcatcaagagattcacaatcaaga 
tctgtatcaagataaaggaattcaaatagcatatatatgaccatgtctgaaatgtcagtt 
ctctagcataatttgtattgaaatgaaaccaccagtgttatcaacttgaatgtaaatgta 
catgtgcagatattcctaaagttttattgacaaaaaaaaaaaraaaamrwaaaaaaaaaa 
aaaaaaaaaaaaggggggggggcaaaaaaatttcccggggggggcccaaacccccctttt 
tcttgaaaacagggggcgactcctaaagagaggaggaatattatataaggacaaggggag 
cggacgcgcgttttaaaaaaacgtgttctgtgagtaggacaaaaatggcttgagtgggca 
tttctttgaggaaagcacaccatcagttggggagcagggaaaattagtagggacaccact 
caccgagaagagtaaatgaaacgcacgcgagggggataaataaacaatggggagtagata 
atgttgatacaccgagcatcaaatgtgtggagaagaagattaccaagaa 

SEQ ID NO: 120. >HUMPTPB_T4 # TY Transcript # LN 1286 # Source Gene: 

HUMPTPB # Encoded protein: HUMPTPB_P8 

gaaccagtcagcgattagaggccgagtcttcggccacccaaaggcggagtaagaaaccag 
aagcggatctgattggttgctggaagacgccgcgcccacctcacagaaggacgaaccagt 
gagctaagctgcggggcgcgggctcggccggggcaccggtgagtcgccggcgctgcagag 
ggaggcggcactggtctcgacgtggggcggccagcgatgaagccggctatctttgtcacg 
agtgaattgttctcagtttctcggtttatgtgctcttccaggttgtaaatttaaagatgt 
tagaagaaatgtccaaaaagatacagaagaactaaagagctgtggtatacaagacatatt 
tgttttctgcaccagaggggaactgtcaaaatatagagtcccaaaccttctggatctcta 
ccagcaatgtggaattatcacccatcatcatccaatcgcagatggagggactcctgacat 
agccagctgctgtgaaataatggaagagcttacaacctgccttaaaaattaccgaaaaac 
cttaatacactgctatggaggacttgggagatcttgtcttgtagctgcttgtctcctact 
atacctgtctgacacaatatcaccagagcaagccatagacagcctgcgagacctaagagg 
atccggggcaatacagaccatcaagcaatacaattatcttcatgagtttcgggacaaatt 
agctgcacatctatcatcaagagattcacaatcaagatctgtatcaagataaaggaattc 
aaatagcatatatatgaccatgtctgaaatgtcagttctctagcataatttgtattgaaa 
tgaaaccaccagtgttatcaacttgaatgtaaatgtacatgtgcagatattcctaaagtt 
ttattgacaaaaaaaaaaaraaaamrwaaaaaaaaaaaaaaaaaaaaaaggggggggggc 
aaaaaaatttcccggggggggcccaaacccccctttttcttgaaaacagggggcgactcc 
taaagagaggaggaatattatataaggacaaggggagcggacgcgcgttttaaaaaaacg 
tgttctgtgagtaggacaaaaatggcttgagtgggcatttctttgaggaaagcacaccat 
cagttggggagcagggaaaattagtagggacaccactcaccgagaagagtaaatgaaacg 
cacgcgagggggataaataaacaatggggagtagataatgttgatacaccgagcatcaaa 
tgtgtggagaagaagattaccaagaa 

SEQ ID NO: 121. >HUMPTPB_T5 # TY Transcript # LN 1313 # Source Gene: 

HUMPTPB # Encoded protein: HUMPTPB_P8 

gaaccagtcagcgattagaggccgagtcttcggccacccaaaggcggagtaagaaaccag 
aagcggatctgattggttgctggaagacgccgcgcccacctcacagaaggacgaaccagt 
gagctaagctgcggggcgcgggctcggccggggcaccggtgagtcgccggcgctgcagag 
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ggaggcggcactggtctcgacgtggggcggccagcgatgaagccgcccagttcaatacaa 
acaagtgagtttgactcatcagatgaagagcctattgaagatgaacagactccaattcat 
atatcatggttgtaaatttaaagatgttagaagaaatgtccaaaaagatacagaagaact 
aaagagctgtggtatacaagacatatttgttttctgcaccagaggggaactgtcaaaata 
tagagtcccaaaccttctggatctctaccagcaatgtggaattatcacccatcatcatcc 
aatcgcagatggagggactcctgacatagccagctgctgtgaaataatggaagagcttac 
aacctgccttaaaaattaccgaaaaaccttaatacactgctatggaggacttgggagatc 
ttgtcttgtagctgcttgtctcctactatacctgtctgacacaatatcaccagagcaagc 
catagacagcctgcgagacctaagaggatccggggcaatacagaccatcaagcaatacaa 
ttatcttcatgagtttcgggacaaattagctgcacatctatcatcaagagattcacaatc 
aagatctgtatcaagataaaggaattcaaatagcatatatatgaccatgtctgaaatgtc 
agttctctagcataatttgtattgaaatgaaaccaccagtgttatcaacttgaatgtaaa 
tgtacatgtgcagatattcctaaagttttattgacaaaaaaaaaaaraaaamrwaaaaaa 
aaaaaaaaaaaaaaaaggggggggggcaaaaaaatttcccggggggggcccaaacccccc 
tttttcttgaaaacagggggcgactcctaaagagaggaggaatattatataaggacaagg 
ggagcggacgcgcgttttaaaaaaacgtgttctgtgagtaggacaaaaatggcttgagtg 
ggcatttctttgaggaaagcacaccatcagttggggagcagggaaaattagtagggacac 
cactcaccgagaagagtaaatgaaacgcacgcgagggggataaataaacaatggggagta 
gataatgttgatacaccgagcatcaaatgtgtggagaagaagattaccaagaa 

SEQ ID NO: 122. >HUMPTPB_T6 # TY Transcript # LN 1462 # Source Gene: 

HUMPTPB # Encoded protein: HUMPTPB_P9 

gaaccagtcagcgattagaggccgagtcttcggccacccaaaggcggagtaagaaaccag 
aagcggatctgattggttgctggaagacgccgcgcccacctcacagaaggacgaaccagt 
gagctaagctgcggggcgcgggctcggccggggcaccggtgagtcgccggcgctgcagag 
ggaggcggcactggtctcgacgtggggcggccagcgatgaagccgcccagttcaatacaa 
acaagtgagtttgactcatcagatgaagagcctattgaagatgaacagactccaattcat 
atatcatggctatctttgtcacgagtgaattgttctcagtttctcggtttatgtgctctt 
ccaggttgtaaatttaaagatgttagaagaaatgtccaaaaagatacagaagaactaaag 
agctgtggtatacaagacatatttgttttctgcaccagaggggaactgtcaaaatataga 
gtcccaaaccttctggatctctaccagcaatgtggaattatcacccatcatcatccaatc 
gcagatggagggactcctgacatagccagctgctgtgaaataatggaagagcttacaacc 
tgccttaaaaattaccgaaaaaccttaatacactgctatggaggacttgggagatcttgt 
cttgtagctgcttgtctcctactatacctgtctgacacaatatcaccagagcaagccata 
gacagcctgcgagacctaagaggatccggggcaatacagaccatcaaggatctacctact 
tctcagtttttgccccagtccgttttgggaatggatcctctcacctaggacaatgagatg 
gttattgttaatacaaacttgcaatacaattatcttcatgagtttcgggacaaattagct 
gcacatctatcatcaagagattcacaatcaagatctgtatcaagataaaggaattcaaat 
agcatatatatgaccatgtctgaaatgtcagttctctagcataatttgtattgaaatgaa 
accaccagtgttatcaacttgaatgtaaatgtacatgtgcagatattcctaaagttttat 
tgacaaaaaaaaaaaraaaamrwaaaaaaaaaaaaaaaaaaaaaaggggggggggcaaaa 
aaatttcccggggggggcccaaacccccctttttcttgaaaacagggggcgactcctaaa 
gagaggaggaatattatataaggacaaggggagcggacgcgcgttttaaaaaaacgtgtt 
ctgtgagtaggacaaaaatggcttgagtgggcatttctttgaggaaagcacaccatcagt 
tggggagcagggaaaattagtagggacaccactcaccgagaagagtaaatgaaacgcacg 
cgagggggataaataaacaatggggagtagataatgttgatacaccgagcatcaaatgtg 
tggagaagaagattaccaagaa 

SEQ ID NO: 123 . >HUMPTPB_T7 # TY Transcript # LN 1348 # Source Gene: 

HUMPTPB # Encoded protein: HUMPTPB_P10 

gaaccagtcagcgattagaggccgagtcttcggccacccaaaggcggagtaagaaaccag 
aagcggatctgattggttgctggaagacgccgcgcccacctcacagaaggacgaaccagt 
gagctaagctgcggggcgcgggctcggccggggcaccggtgagtcgccggcgctgcagag 
ggaggcggcactggtctcgacgtggggcggccagcgatgaagccgcccagttcaatacaa 
acaagtgagtttgactcatcagatgaagagcctattgaagatgaacagactccaattcat 
atatcatggctatctttgtcacgagtgaattgttctcagtttctcggtttatgtgctctt 
ccaggttgtaaatttaaagatgttagaagaaatgtccaaaaagatacagaagaactaaag 
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agctgtggtatacaagacatatttgttttctgcaccagaggggaactgtcaaaatataga 
gtcccaaaccttctggatctctaccagcaatgtggaattatcacccatcatcatccaatc 
gcagatggagggactcctgacatagccagctgctgtgaaataatggaagagcttacaacc 
tgccttaaaaactgctatggaggacttgggagatcttgtcttgtagctgcttgtctccta 
ctatacctgtctgacacaatatcaccagagcaagccatagacagcctgcgagacctaaga 
ggatccggggcaatacagaccatcaagcaatacaattatcttcatgagtttcgggacaaa 
ttagctgcacatctatcatcaagagattcacaatcaagatctgtatcaagataaaggaat 
tcaaatagcatatatatgaccatgtctgaaatgtcagttctctagcataatttgtattga 
aatgaaaccaccagtgttatcaacttgaatgtaaatgtacatgtgcagatattcctaaag 
ttttattgacaaaaaaaaaaaraaaamrwaaaaaaaaaaaaaaaaaaaaaaggggggggg 
gcaaaaaaatttcccggggggggcccaaacccccctttttcttgaaaacagggggcgact 
cctaaagagaggaggaatattatataaggacaaggggagcggacgcgcgttttaaaaaaa 
cgtgttctgtgagtaggacaaaaatggcttgagtgggcatttctttgaggaaagcacacc 
atcagttggggagcagggaaaattagtagggacaccactcaccgagaagagtaaatgaaa 
cgcacgcgagggggataaataaacaatggggagtagataatgttgatacaccgagcatca 
aatgtgtggagaagaagattaccaagaa 

SEQ ID NO: 124. >HUMPTPB_T8 # TY Transcript # LN 1337 # Source Gene: 

HUMPTPB # Encoded protein: HUMPTPBJP7 

aggtgcaacttccttcggtcgtcccgaatccgggttcatccgacaccagccgcctccacc 
atgccgccgaagttcgaccccaacgagatcaaagtcgtatacctgaggtgcaccggaggt 
gaagtcggtgccacttctgccctggcccccaagatcggccccctgggtctgtctccaaaa 
aagttggtgatgacattgccaaggcaacgggtgactggaagggcctgaggattacagtga 
aactgaccattcagaacagacaggcccagattgaggtggtgccttctgcctctgccctga 
tcatcaaagccctcaaggaaccaccaagagacagaaagaaacagaaaaacattaaacaca 
gtgggaatatcacttttgatgagattgtcaacattgctcgacagatgcggcaccgatcct 
tagccagagaactctctggaaccattaaagagatcctggggactgcccagtcagtgggct 
gtaatgttgatggccgccatcctcatgacatcatcgatgacatcaacagtggtgctgtgg 
aatgcccagccagttaagcacaaaggaaaacatttcaataaaggatcatttgacaactgg 
tgaaaaaaaaaaaaaaaggagatcttgtcttgtagctgcttgtctcctactatacctgtc 
tgacacaatatcaccagagcaagccatagacagcctgcgagacctaagaggatccggggc 
aatacagaccatcaagcaatacaattatcttcatgagtttcgggacaaattagctgcaca 
tctatcatcaagagattcacaatcaagatctgtatcaagataaaggaattcaaatagcat 
atatatgaccatgtctgaaatgtcagttctctagcataatttgtattgaaatgaaaccac 
cagtgttatcaacttgaatgtaaatgtacatgtgcagatattcctaaagttttattgaca 
aaaaaaaaaaraaaamrwaaaaaaaaaaaaaaaaaaaaaaggggggggggcaaaaaaatt 
tcccggggggggcccaaacccccctttttcttgaaaacagggggcgactcctaaagagag 
gaggaatattatataaggacaaggggagcggacgcgcgttttaaaaaaacgtgttctgtg 
agtaggacaaaaatggcttgagtgggcatttctttgaggaaagcacaccatcagttgggg 
agcagggaaaattagtagggacaccactcaccgagaagagtaaatgaaacgcacgcgagg 
gggataaataaacaatggggagtagataatgttgatacaccgagcatcaaatgtgtggag 
aagaagattaccaagaa 

SEQ ID NO: 125 . >HUMPTPB_T9 ft TY Transcript # LN 812 # Source Gene: HUMPTPB 
# Encoded protein: 

caattacaccagtcttctgtatcaacagaagtcttattaaatataaaatgctgtaccctg 
tcactagttatttaataacatattatttttctgtcatgttccattagtagctgcttgtct 
cctactatacctgtctgacacaatatcaccagagcaagccatagacagcctgcgagacct 
aagaggatccggggcaatacagaccatcaagcaatacaattatcttcatgagtttcggga 
caaattagctgcacatctatcatcaagagattcacaatcaagatctgtatcaagataaag 
gaattcaaatagcatatatatgaccatgtctgaaatgtcagttctctagcataatttgta 
ttgaaatgaaaccaccagtgttatcaacttgaatgtaaatgtacatgtgcagatattcct 
aaagttttattgacaaaaaaaaaaaraaaamrwaaaaaaaaaaaaaaaaaaaaaaggggg 
gggggcaaaaaaatttcccggggggggcccaaacccccctttttcttgaaaacagggggc 
gactcctaaagagaggaggaatattatataaggacaaggggagcggacgcgcgttttaaa 
aaaacgtgttctgtgagtaggacaaaaatggcttgagtgggcatttctttgaggaaagca 
caccatcagttggggagcagggaaaattagtagggacaccactcaccgagaagagtaaat 
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gaaacgcacgcgagggggataaataaacaatggggagtagataatgttgatacaccgagc 
atcaaatgtgtggagaagaagattaccaagaa 

SEQ ID NO: 126 . >HUMPTPB_P1 # TY Protein # CC #LN 212 # Source Gene: 

HUMPTPB # Encoding Transcript: 1 

MKPPSSIQTSEFDSSDEEPIEDEQTPIHISWLSLSRVNCSQFLGLCALPGCKFKDVRRNV 
QKDTEELKSCGIQDIFVFCTRGELSKYRVPNLLDLYQQCGIITHHHPIADGGTPDIASCC 
EIMEELTTCLKNYRKTLIHCYGGLGRSCLVAACLLLYLSDTISPEQAIDSLRDLRGSGAI 
QTIKQYNYLHEFRDKLAAHLSSRDSQSRSVSR 

SEQ ID NO: 127. >HUMPTPB_P2 # TY Protein # CC #LN 172 # Source Gene: 

HUMPTPB # Encoding Transcript: 3 

MKPPSSIQTSCKFKDVRRNVQKDTEELKSCGIQDIFVFCTRGELSKYRVPNLLDLYQQCG 
IITHHHPIADGGTPDIASCCEIMEELTTCLKNYRKTLIHCYGGLGRSCLVAACLLLYLSD 
TISPEQAIDSLRDLRGSGAIQTIKQYNYLHEFRDKIAAHLSSRDSQSRSVSR 

SEQ ID NO: 128. >HUMPTPB_P8 # TY Protein # CC #LN 166 # Source Gene: 

HUMPTPB # Encoding Transcript: 4 

MKPPSSIKDVRRNVQKDTEELKSCGIQDIFVFCTRGELSKYRVPNLLDLYQQCGIITHHH 
PIADGGTPDIASCCEIMEELTTCLKNYRKTLIHCYGGLGRSCLVAACLLLYLSDTISPEQ 
AIDSLRDLRGSGAIQTIKQYNYLHEFRDKLAAHLSSRDSQSRSVSR 

SEQ ID NO: 129. >HUMPTPB_P9 # TY Protein # CC #LN 136 # Source Gene: 

HUMPTPB # Encoding Transcript: 6 

MKPPSSIQTSEFDSSDEEPIEDEQTPIHISWLSLSRVNCSQFLGLCALPGCKFKDVRRNV 
QKDTEELKSCGIQDIFVFCTRGELSKYRVPNLLDLYQQCGIITHHHPIADGGTPDIASCC 
EIMEELTTCLKNYRKT 



SEQ ID NO: 130. >HUMPTPB__P10 # TY Protein # CC #LN 207 # Source Gene: 

HUMPTPB # Encoding Transcript: 7 

MKPPSSIQTSEFDSSDEEPIEDEQTPIHISWLSLSRVNCSQFLGLCALPGCKFKDVRRNV 
QKDTEELKSCGIQDIFVFCTRGELSKYRVPNLLDLYQQCGIITHHHPVADGGTPDIASCC 
EIMEELTTCLTL I HCYGGLGRSCLVAACLLLYLSDTI S PEQAI DSLRDLRGSGAIQT I KQ 
YSYLHEFRDILAAHLSSRDSQSRSVSR 



SEQ ID NO: 131. >HUMPTPB_P7 # TY Protein # CC #LN 185 # Source Gene: 

HUMPTPB # Encoding Transcript: 8 

RCNFLRSSRIRVHPTPAASTMPPKFDPNEIKWYLRCTGGEVRCHFCPGPQDRPPGSVSK 
KVGDDIAKATGDWKGLRITVKLTIQNRQAQI EWPSASALI IKALKEPPRDRKKQKNIKH 
SGNITFDEIVNIARQMRHRSLARELSGTIKEILGTAQSVGCNVDGRHPHDIIDDINSGAV 
ECPAS 



SEQ ID NO: 132. >R09837 # TY Consensus # Length 7115 # Number of exons 31 

gagcgaggcctgggtggcgttctcaagacgaggtgactggtgggcggtgaggggggaaag 

ccccggatgttcgttggggattcaacatggcggcgggagtgtccgcggtggtggcggtgc 

aagagagctgagggaggcgcgagggcgcggagttccaggtcgagcagttaggccgcgagc 

gactgcggcgccgagccgccttcctgctgcccaccccgcgtccctccagactgggtcccg 

tgctcccgtcccccagccctgaccaccctgggcagctgctgcatggggacagatgagtaa 

cccgaagcccctagaggagtggtcacctgcctgagggcacttctgtcccaccagcatcag 

accaggtgtgtgcaggctctctatgggatgccgccatcagaatgtctcttcctcagcccc 

tttgttttttaaagtggctttggtcgaccatttgctctgtgcttggtactgtgtcaaggg 

cccgattgaagaccacagatcctaaagataggggagtgtcgcgacaaggctgcctggggt 
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Figure 1 (Cont'd) 



gccctcttaacccaggggatcagtaaccctgtgctgagtcaatatccaaaggccgcaccg 
agtccccggcaccatgtttgggaagaggaagaagcgggtggagatctccgcgccgtccaa 
cttcgagcaccgcgtgcacacgggcttcgaccagcacgagcagaagttcacggggctgcc 
ccgccagtggcagagcctgatcgaggagtcggctcgccggcccaagcccctcgtcgaccc 
cgcctgcatcacctccatccagcccggggcccccaagaccatcgtgcggggcagcaaagg 
tgccaaagatggggccctcacgctgctgctggacgagtttgagaacatgtcggtgacacg 
ctccaactccctgcggagagacagcccgccgccgcccgcccgtgcccgccaggaaaatgg 
gatgccagaggagccggccaccacggccagagggggcccagggaaggcaggcagccgagg 
ccggttcgccggtcacagcgaggcgggtggcggcagtggtgacaggcgacgggcggggcc 
agagaagaggcccaagtcttccagggagggctcagggggtccccaggagtcctcccggga 
caaacgccccctctccgggcctgatgtcggcaccccccagcctgctggtctggccagtgg 
ggcgaaactggcagctggccggccctttaacacctacccgagggctgacacggaccaccc 
atcccggggtgcccagggggagcctcatgacgtggcccctaacgggccatcagcgggggg 
cctggccatcccccagtcctcctcctcctcctcccggcctcccacccgagcccgaggtgc 
ccccagccctggagtgctgggaccccacgcctcagagccccagctggcccctccagcctg 
cacccccgccgcccctgctgttcctgggccccccacaccagctcggggttgcacaccagc 
tcggggttgcacaccagctcggggttgcacaccagctcggggttgcacaccagctcgggg 
ttgcacaccaactcggggttgcacaccagctcggggttgcacaccagctcggggttgcac 
accagctcggggtttggcccccgctcaccacagcgggagccacagcgagtatcccatgag 
cagttccgggctgccctgcagctggtggtggacccaggcgacccccgctcctacctggac 
aacttcatcaagattggcgagggctccacgggcatcgtgtgcatcgccaccgtgcgcagc 
tcgggcaagctggtggccgtcaagaagatggacctgcgcaagcagcagaggcgcgagctg 
ctcttcaacgaggtggtaatcatgagggactaccagcacgagaatgtggtggagatgtac 
aacagctacctggtgggggacgagctctgggtggtcatggagttcctggaaggaggcgcc 
ctcaccgacatcgtcacccacaccaggatgaacgaggagcagatcgcggccgtgtgcctt 
gcagtgctgcaggccctgtcggtgctccacgcccagggcgtcatccaccgggacatcaag 
agcgactcgatcctgctgacccatgatggcagggtgaagctgtcagactttgggttctgc 
gcccaggtgagcaaggaagtgccccgaaggaagtcgctggtcggcacgccctactggatg 
gccccagagctcatctcccgccttccctacgggccagaggtagacatctggtcgctgggg 
ataatggtgattgagatggtggacggagagcccccctacttcaacgagccacccctcaaa 
gccatgaagatgattcgggacaacctgccaccccgactgaagaacctgcacaaggtgtcg 
ccatccctgaagggcttcctggaccgcctgctggtgcgagaccctgcccagcgggccacg 
gcagccgagctgctgaagcacccattcctggccaaggcagggccgcctgccagcatcgtg 
cccctcatgcgccagaaccgcaccagatgaggcccagcgcccttcccctcaaccaaagag 
ccccccgggtcacccccgccccactgaggccagtagggggccaggcctcccactcctccc 
agcccgggagatgctccgcgtggcaccaccctccttgctgggggtagatgagaccctact 
actgaactccagttttgatctcgtgacttttagaaaaacacagggactcgtgggagcaag 
cgaggctcccaggacccccaccctctgggacaggccctcccccatgttcttctgtctcca 
ggaagggcagcggccctcccatcactggaagtctgcagtgggggtcgctgggggtggaga 
gaacactaagaggtgaacatgtatgagtgtgtgcacgcgtgtgagtgtgcatgtgtgtgt 
gtgtgcaaaggtccagccaccccgtcctccagcctgcaaggggtgtctggcgccttgcct 
gacacccagccccctctccccctgagccattgtgggggtcgatcatgaatgtccgaagag 
tggccttttcccgtagccctgcgccccctttctgtggctggatggggagacaggtcaggg 
ccccccaccctctccagcccctgcagcaaatgactactgcacctggacagcctcctcttt 
tctagaagtctatttatattgtcattttataacactctagcccctgcccttattggggga 
cagatggtccctgtcctgcggggtggccctggcagaaccactgcctgaagaaccaggttc 
ctgcccggtcagcgcagccccagcccgcccacccctgcctcgagttagttttacaattaa 
aacattgtcttgttttcttgttttgtgtctgtgtgcgatgtgtggggggcagggggccct 
gcccggctgtcttgggtgggaatttgcagggagagggtctggatctgggagcaaaccacg 
attccagccaaggcagggcaagggtggggtggggagtggggagttcaggtcatagcagcc 
agtaagctcccccagcctgccactccccagaatggggcaggattgtccccacccctggaa 
gcagccagtttgccacagtccatgtgcagactgatcccagtttgccaaatctgcaatttc 
ctggaaccttttaaaggctgtcttgagcgcgtttggtgagtaggagctaacccaagttag 
taaattgaaggccatttggcaaattggtcagtgggcagatgggcttttggggattgactg 
aggctgactggcctggagctgctggcttcggagagacaccctgtgaagtgtgtccttcca 
cgcaggagcccagagccgagcccacgctggggggaatctgactggcatggaggtggccat 
gccaccatcgctgctgcagctgcatcctggcactttgcgcctcaggccctgttgggctcc 
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Figur 1 (Cont'd) 



actttctgcatcctccccagcccccagggaggcagtggagtggggagagagccaggagtg 
agcctccgtccccaaagccagccaggcgtcatcagcaccagagacctcagcctggtcctc 
tcgggaagtgagtggccagggcagagattccaggttagtccacgcctcccacccttcaca 
ggtcctgaccccaagaatcagagcactgtgtgtgtggcagggcctatgccaagtgcaaac 
acagcctagatggatcatcacagagtgaaacccagcggtgcaagcagctgtgctctctgc 
gatgtattggaggcttaggtgaggtggatgcctttctggaaaaaaaaaaaatgctaacat 
tggcaaaagaagaaatagaaaacaagaccaaaataactgtctcctcactgcacacacact 
ccagaataaataaaaggtttcaggcttgaatgcactttcaaatgagattttttttttttt 
tttgagacggagtatcgctctgtcgcccaggctggagtgcrgtggcacgrtcttggctcr 
ctgcaacttctgcctcctgggttcaagcgattctcctgcctcagcctcccargtrgctgg 
gattrcaggcgcttgccaccacgccrggctarttttttgtattttgrgtagagactggrt 
ttcgccatcttggccagactggtcttgaactcctgacctcgtgatccrcccacctcagcc 
tcccaargtgctgggattacaggcatgagccaccatgcctggcctcaaatgaggtttacc 
agactttgaaggagcaggtaattccttctaccttgtgaacaagttgttccagaaagatag 
cagctcaggaggcctctgtgaccatggttccagacccagataaggacggcaaagaacaga 
gcatctcagaaacgcaaggctcacagccagggtgcccggcgaccccacgggcactgagaa 
cagctagctctaggagctccactctcctgctgaagaaaccacgggctcagagacggggag 
ctccctcgcccagccacatctgtgaccccacggtaactctgctggttttggtgccttcag 
tcactcactgcaggtttgttttgttttgttttgttttgagtttttttgttttttgttttt 
tgtttttagagacrrgatgttgctatgttgcctaggctggtttcgaactcctggcctcar 
gggatcctccctccttggccccgcaargtgctgggattrcaggcgtgagccrccgtgccc 
rgcccgctcactgcagtttgaaggcatggctttgggtggcgtggggtgaaagctgcccga 
ggccccgttcctccccacgtggctgcctcctgccagagccagtcaggaaaacagacccca 
actagagttgtttcaaatggcagggatttggtaccggtggttggatcatgacaaagctct 
gagaaggctggaggagccacagagtgccaagtgcccagcaatcattagaggaaggaggct 
gctgccacctgtgtggctagaggaacagaggggccaatggcattccccaaaccccactct 
cgcctctgtctggccagagcagaatggcttcttccagcttcccaccctggactcccaccc 
aggagcctcctcctggcagacccttcctgaccccaccggcccggggggtctacagatcca 
tgtttcaggcgtccgcctggagcggaacaggggagtgcttaggacaagggtggtgccaga 
ggatccactctgcccacatttagttgaccagctgaggcactccacgggaatgaatgactc 
tcgacaggtgccggaggtgaggaggggcccggaggcccaggaggggcacagggatggatt 
cgtccgcctgggggctggaggtgtgtttacagagccccaaaataaacaatgcaaccaggt 
cagaccagcggttctcacagtgtggttcccagaccagcagcatcactgggagcttactgg 
acacgcaaataaatccctgtgtgccaccccagctgcatcagatgctggtggggcccagtg 
atctgtatttaacacaccctccgggggatgccggtgcccactcacgtttgagaacccctg 
cgatccacgactgccctcccgtgtaaaaggcccacctctgtgggactccaagt cat cage 
accctagggtccttccgtctttttccttcctcctgggacacctgcctctcccatgtcgta 
ttagagaattccttatgctcccaagtgggcacggggagaggaaggcactcctctttaagg 
accgacccagaggttttgccattgcttcactggccagagcttagtcacgcagcctcaccc 
agaggcaagggaggttggaaaatgtagtgtttgtgtgtgtctaacacaaattctattacc 
atgcagtcaggattctccactcttgctctttcattagatttgctgggcttcaccctggac 
tttctgatttagtgacagaacagagaacccagaggcagacccagatgtgtacaagggctt 
catatacaatcaggagatttaataatcatgctaggggccgggtgcagtggctcacgcctg 
taatcccaagcactttggggagccgaggcaggcrgatcacttgaggtcaggagtttgaga 
ccagcctgggcaacaaagtgagaccctgtctctactaaaaataacaaaaattagccgggc 
gtggtggtgggygcctgtaatcccagctcctcgggtggctgaggcatgagaatcacttga 
acccaggaggcagaggtttcagtgagctgagatcacatcactgcactccagcctgggtga 
cagagtgagattcygtcactgacaccattgtccgcagatctcggagcactcttattccgg 
gggcggacacggcggcgtttgcgacttcgccccgtggccattgctgtttctcggcccgcg 
ttttcttggtccgggtggcgaatctgccggtcccctatgggagccataatttcgtttcgc 
cggttcctgcaggagtctccggacaagccctctcgggcctgttcgggccctgctgtggtc 
tgcgtggggaactggtggcggcttaactaccgggtgcgccctccggtcccccctggctgg 
gcgctccctcccggagcggccccttggtgtccttg 

SEQ ID NO: 133. >R09837_T1 # TY Transcript # LN 2869 # Source Gene: R09837 
# Encoded protein: R09837_P1 

actggtgggcggtgaggggggaaagccccggatgttcgttggggattcaacatggcggcg 
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Figur 1 (Cont'd) 



ggagtgtccgcggtggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttc 
caggtcgagcagttaggccgcgagcgactgcggcgccgagccgatgagtaacccgaagcc 
cctagaggagtggtcacctgcctgagggcacttctgtcccaccagcatcagaccaggtgt 
gtgcaggccgcaccgagtccccggcaccatgtttgggaagaggaagaagcgggtggagat 
ctccgcgccgtccaacttcgagcaccgcgtgcacacgggcttcgaccagcacgagcagaa 
gttcacggggctgccccgccagtggcagagcctgatcgaggagtcggctcgccggcccaa 
gcccctcgtcgaccccgcctgcatcacctccatccagcccggggcccccaagaccatcgt 
gcggggcagcaaaggtgccaaagatggggccctcacgctgctgctggacgagtttgagaa 
catgtcggtgacacgctccaactccctgcggagagacagcccgccgccgcccgcccgtgc 
ccgccaggaaaatgggatgccagaggagccggccaccacggccagagggggcccagggaa 
ggcaggcagccgaggccggttcgccggtcacagcgaggcgggtggcggcagtggtgacag 
gcgacgggcggggccagagaagaggcccaagtcttccagggagggctcagggggtcccca 
ggagtcctcccgggacaaacgccccctctccgggcctgatgtcggcaccccccagcctgc 
tggtctggccagtggggcgaaactggcagctggccggccctttaacacctacccgagggc 
tgacacggaccacccatcccggggtgcGcagggggagcctcatgacgtggcccctaacgg 
gccatcagcggggggcctggccatcccccagtcctcctcctcctcctcccggcctcccac 
ccgagcccgaggtgcccccagccctggagtgctgggaccccacgcctcagagccccagct 
ggcccctccagcctgcacccccgccgcccctgctgttcctgggccccctggcccccgctc 
accacagcgggagccacagcgagtatcccatgagcagttccgggctgccctgcagctggt 
ggtggacccaggcgacccccgctcctacctggacaacttcatcaagattggcgagggctc 
cacgggcatcgtgtgcatcgccaccgtgcgcagctcgggcaagctggtggccgtcaagaa 
gatggacctgcgcaagcagcagaggcgcgagctgctcttcaacgaggtggtaatcatgag 
ggactaccagcacgagaatgtggtggagatgtacaacagctacctggtgggggacgagct 
ctgggtggtcatggagttcctggaaggaggcgccctcaccgacatcgtcacccacaccag 
gatgaacgaggagcagatcgcggccgtgtgccttgcagtgctgcaggccctgtcggtgct 
ccacgcccagggcgtcatccaccgggacatcaagagcgactcgatcctgctgacccatga 
tggcagggtgaagctgtcagactttgggttctgcgcccaggtgagcaaggaagtgccccg 
aaggaagtcgctggtcggcacgccctactggatggccccagagctcatctcccgccttcc 
ctacgggccagaggtagacatctggtcgctggggataatggtgattgagatggtggacgg 
agagcccccctacttcaacgagccacccctcaaagccatgaagatgattcgggacaacct 
gccaccccgactgaagaacctgcacaaggtgtcgccatccctgaagggcttcctggaccg 
cctgctggtgcgagaccctgcccagcgggccacggcagccgagctgctgaagcacccatt 
cctggccaaggcagggccgcctgccagcatcgtgcccctcatgcgccagaaccgcaccag 
atgaggcccagcgcccttcccctcaaccaaagagccccccgggtcacccccgccccactg 
aggccagtagggggccaggcctcccactcctcccagcccgggagatgctccgcgtggcac 
caccctccttgctgggggtagatgagaccctactactgaactccagttttgatctcgtga 
cttttagaaaaacacagggactcgtgggagcaagcgaggctcccaggacccccaccctct 
gggacaggccctcccccatgttcttctgtctccaggaagggcagcggccctcccatcact 
ggaagtctgcagtgggggtcgctgggggtggagagaacactaagaggtgaacatgtatga 
gtgtgtgcacgcgtgtgagtgtgcatgtgtgtgtgtgtgcaaaggtccagccaccccgtc 
ctccagcctgcaaggggtgtctggcgccttgcctgacacccagccccctctccccctgag 
ccattgtgggggtcgatcatgaatgtccgaagagtggccttttcccgtagccctgcgccc 
cctttctgtggctggatggggagacaggtcagggccccccaccctctccagcccctgcag 
caaatgactactgcacctggacagcctcctcttttctagaagtctatttatattgtcatt 
ttataacactctagcccctgcccttattgggggacagatggtccctgtcctgcggggtgg 
ccctggcagaaccactgcctgaagaaccaggttcctgcccggtcagcgcagccccagccc 
gcccacccctgcctcgagttagttttacaattaaaacattgtcttgttt 

SEQ ID NO: 134. >R09837_T2 # TY Transcript # LN 2855 # Source Gene: R09837 
# Encoded protein: R09837JP1 

gagcgaggcctgggtggcgttctcaagacgaggtgacatggcggcgggagtgtccgcggt 
ggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttccaggtcgagcagtt 
aggccgcgagcgactgcggcgccgagccgatgagtaacccgaagcccctagaggagtggt 
cacctgcctgagggcacttctgtcccaccagcatcagaccaggtgtgtgcaggccgcacc 
gagtccccggcaccatgtttgggaagaggaagaagcgggtggagatctccgcgccgtcca 
acttcgagcaccgcgtgcacacgggcttcgaccagcacgagcagaagttcacggggctgc 
cccgccagtggcagagcctgatcgaggagtcggctcgccggcccaagcccctcgtcgacc 
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Figure 1 (Cont'd) 



ccgcctgcatcacctccatccagcccggggcccccaagaccatcgtgcggggcagcaaag 
gtgccaaagatggggccctcacgctgctgctggacgagtttgagaacatgtcggtgacac 
gctccaactccctgcggagagacagcccgccgccgcccgcccgtgcccgccaggaaaatg 
ggatgccagaggagccggccaccacggccagagggggcccagggaaggcaggcagccgag 
gccggttcgccggtcacagcgaggcgggtggcggcagtggtgacaggcgacgggcggggc 
cagagaagaggcccaagtcttccagggagggctcagggggtccccaggagtcctcccggg 
acaaacgccccctctccgggcctgatgtcggcaccccccagcctgctggtctggccagtg 
gggcgaaactggcagctggccggccctttaacacctacccgagggctgacacggaccacc 
catcccggggtgcccagggggagcctcatgacgtggcccctaacgggccatcagcggggg 
gcctggccatcccccagtcctcctcctcctcctcccggcctcccacccgagcccgaggtg 
cccccagccctggagtgctgggaccccacgcctcagagccccagctggcccctccagcct 
gcacccccgccgcccctgctgttcctgggccccctggcccccgctcaccacagcgggagc 
cacagcgagtatcccatgagcagttccgggctgccctgcagctggtggtggacccaggcg 
acccccgctcctacctggacaacttcatcaagattggcgagggctccacgggcatcgtgt 
gcatcgccaccgtgcgcagctcgggcaagctggtggccgtcaagaagatggacctgcgca 
agcagcagaggcgcgagctgctcttcaacgaggtggtaatcatgagggactaccagcacg 
agaatgtggtggagatgtacaacagctacctggtgggggacgagctctgggtggtcatgg 
agttcctggaaggaggcgccctcaccgacatcgtcacccacaccaggatgaacgaggagc 
agatcgcggccgtgtgccttgcagtgctgcaggccctgtcggtgctccacgcccagggcg 
tcatccaccgggacatcaagagcgactcgatcctgctgacccatgatggcagggtgaagc 
tgtcagactttgggttctgcgcccaggtgagcaaggaagtgccccgaaggaagtcgctgg 
tcggcacgccctactggatggccccagagctcatctcccgccttccctacgggccagagg 
tagacatctggtcgctggggataatggtgattgagatggtggacggagagcccccctact 
tcaacgagccacccctcaaagccatgaagatgattcgggacaacctgccaccccgactga 
agaacctgcacaaggtgtcgccatccctgaagggcttcctggaccgcctgctggtgcgag 
accctgcccagcgggccacggcagccgagctgctgaagcacccattcctggccaaggcag 
ggccgcctgccagcatcgtgcccctcatgcgccagaaccgcaccagatgaggcccagcgc 
ccttcccctcaaccaaagagccccccgggtcacccccgccccactgaggccagtaggggg 
ccaggcctcccactcctcccagcccgggagatgctccgcgtggcaccaccctccttgctg 
ggggtagatgagaccctactactgaactccagttttgatctcgtgacttttagaaaaaca 
cagggactcgtgggagcaagcgaggctcccaggacccccaccctctgggacaggccctcc 
cccatgttcttctgtctccaggaagggcagcggccctcccatcactggaagtctgcagtg 
ggggtcgctgggggtggagagaacactaagaggtgaacatgtatgagtgtgtgcacgcgt 
gtgagtgtgcatgtgtgtgtgtgtgcaaaggtccagccaccccgtcctccagcctgcaag 
gggtgtctggcgccttgcctgacacccagccccctctccccctgagccattgtgggggtc 
gatcatgaatgtccgaagagtggccttttcccgtagccctgcgccccctttctgtggctg 
gatggggagacaggtcagggccccccaccctctccagcccctgcagcaaatgactactgc 
acctggacagcctcctcttttctagaagtctatttatattgtcattttataacactctag 
cccctgcccttattgggggacagatggtccctgtcctgcggggtggccctggcagaacca 
ctgcctgaagaaccaggttcctgcccggtcagcgcagccccagcccgcccacccctgcct 
cgagttagttttacaattaaaacattgtcttgttt 

SEQ ID NO: 135. >R09837_T3 # TY Transcript # LN 3085 # Source Gene: R09837 
# Encoded protein: R09837_P1 

actggtgggcggtgaggggggaaagccccggatgttcgttggggattcaacatggcggcg 
ggagtgtccgcggtggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttc 
caggtcgagcagttaggccgcgagcgactgcggcgccgagccgatgagtaacccgaagcc 
cctagaggagtggtcacctgcctgagggcacttctgtcccaccagcatcagaccaggtgt 
gtgcaggctctctatgggatgccgccatcagaatgtctcttcctcagcccctttgttttt 
taaagtggctttggtcgaccatttgctctgtgcttggtactgtgtcaagggcccgattga 
agaccacagatcctaaagataggggagtgtcgcgacaaggctgcctggggtgccctctta 
acccaggggatcagtaaccctgtgctgagtcaatatccaaaggccgcaccgagtccccgg 
caccatgtttgggaagaggaagaagcgggtggagatctccgcgccgtccaacttcgagca 
ccgcgtgcacacgggcttcgaccagcacgagcagaagttcacggggctgccccgccagtg 
gcagagcctgatcgaggagtcggctcgccggcccaagcccctcgtcgaccccgcctgcat 
cacctccatccagcccggggcccccaagaccatcgtgcggggcagcaaaggtgccaaaga 
tggggccctcacgctgctgctggacgagtttgagaacatgtcggtgacacgctccaactc 
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Figure 1 (Cont'd) 

cctgcggagagacagcccgccgccgcccgcccgtgcccgccaggaaaatgggatgccaga 
ggagccggccaccacggccagagggggcccagggaaggcaggcagccgaggccggttcgc 
cggtcacagcgaggcgggtggcggcagtggtgacaggcgacgggcggggccagagaagag 
gcccaagtcttccagggagggctcagggggtccccaggagtcctcccgggacaaacgccc 
cctctccgggcctgatgtcggcaccccccagcctgctggtctggccagtggggcgaaact 
ggcagctggccggccctttaacacctacccgagggctgacacggaccacccatcccgggg 
tgcccagggggagcctcatgacgtggcccctaacgggccatcagcggggggcctggccat 
cccccagtcctcctcctcctcctcccggcctcccacccgagcccgaggtgcccccagccc 
tggagtgctgggaccccacgcctcagagccccagctggcccctccagcctgcacccccgc 
cgcccctgctgttcctgggccccctggcccccgctcaccacagcgggagccacagcgagt 
atcccatgagcagttccgggctgccctgcagctggtggtggacccaggcgacccccgctc 
ctacctggacaacttcatcaagattggcgagggctccacgggcatcgtgtgcatcgccac 
cgtgcgcagctcgggcaagctggtggccgtcaagaagatggacctgcgcaagcagcagag 
gcgcgagctgctcttcaacgaggtggtaatcatgagggactaccagcacgagaatgtggt 
ggagatgtacaacagctacctggtgggggacgagctctgggtggtcatggagttcctgga 
aggaggcgccctcaccgacatcgtcacccacaccaggatgaacgaggagcagatcgcggc 
cgtgtgccttgcagtgctgcaggccctgtcggtgctccacgcccagggcgtcatccaccg 
ggacatcaagagcgactcgatcctgctgacccatgatggcagggtgaagctgtcagactt 
tgggttctgcgcccaggtgagcaaggaagtgccccgaaggaagtcgctggtcggcacgcc 
ctactggatggccccagagctcatctcccgccttccctacgggccagaggtagacatctg 
gtcgctggggataatggtgattgagatggtggacggagagcccccctacttcaacgagcc 
acccctcaaagccatgaagatgattcgggacaacctgccaccccgactgaagaacctgca 
caaggtgtcgccatccctgaagggcttcctggaccgcctgctggtgcgagaccctgccca 
gcgggccacggcagccgagctgctgaagcacccattcctggccaaggcagggccgcctgc 
cagcatcgtgcccctcatgcgccagaaccgcaccagatgaggcccagcgcccttcccctc 
aaccaaagagccccccgggtcacccccgccccactgaggccagtagggggccaggcctcc 
cactcctcccagcccgggagatgctccgcgtggcaccaccctccttgctgggggtagatg 
agaccctactactgaactccagttttgatctcgtgacttttagaaaaacacagggactcg 
tgggagcaagcgaggctcccaggacccccaccctctgggacaggccctcccccatgttct 
tctgtctccaggaagggcagcggccctcccatcactggaagtctgcagtgggggtcgctg 
ggggtggagagaacactaagaggtgaacatgtatgagtgtgtgcacgcgtgtgagtgtgc 
atgtgtgtgtgtgtgcaaaggtccagccaccccgtcctccagcctgcaaggggtgtctgg 
cgccttgcctgacacccagccccctctccccctgagccattgtgggggtcgatcatgaat 
gtccgaagagtggccttttcccgtagccctgcgccccctttctgtggctggatggggaga 
caggtcagggccccccaccctctccagcccctgcagcaaatgactactgcacctggacag 
cctcctcttttctagaagtctatttatattgtcattttataacactctagcccctgccct 
tattgggggacagatggtccctgtcctgcggggtggccctggcagaaccactgcctgaag 
aaccaggttcctgcccggtcagcgcagccccagcccgcccacccctgcctcgagttagtt 
ttacaattaaaacattgtcttgttt 

SEQ ID NO: 136 . >R09837_T4 # TY Transcript # LN 2772 # Source Gene: R09837 
# Encoded protein: R09837_P1 

gagcgaggcctgggtggcgttctcaagacgaggtgacatggcggcgggagtgtccgcggt 
ggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttccaggtcgagcagtt 
aggccgcgagcgactgcggcgccgagccggccgcaccgagtccccggcaccatgtttggg 
aagaggaagaagcgggtggagatctccgcgccgtccaacttcgagcaccgcgtgcacacg 
ggcttcgaccagcacgagcagaagttcacggggctgccccgccagtggcagagcctgatc 
gaggagtcggctcgccggcccaagcccctcgtcgaccccgcctgcatcacctccatccag 
cccggggcccccaagaccatcgtgcggggcagcaaaggtgccaaagatggggccctcacg 
ctgctgctggacgagtttgagaacatgtcggtgacacgctccaactccctgcggagagac 
agcccgccgccgcccgcccgtgcccgccaggaaaatgggatgccagaggagccggccacc 
acggccagagggggcccagggaaggcaggcagccgaggccggttcgccggtcacagcgag 
gcgggtggcggcagtggtgacaggcgacgggcggggccagagaagaggcccaagtcttcc 
agggagggctcagggggtccccaggagtcctcccgggacaaacgccccctctccgggcct 
gatgtcggcaccccccagcctgctggtctggccagtggggcgaaactggcagctggccgg 
ccctttaacacctacccgagggctgacacggaccacccatcccggggtgcccagggggag 
cctcatgacgtggcccctaacgggccatcagcggggggcctggccatcccccagtcctcc 
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tcctcctcctcccggcctcccacccgagcccgaggtgcccccagccctggagtgctggga 
ccccacgcctcagagccccagctggcccctccagcctgcacccccgccgcccctgctgtt 
cctgggccccctggcccccgctcaccacagcgggagccacagcgagtatcccatgagcag 
ttccgggctgccctgcagctggtggtggacccaggcgacccccgctcctacctggacaac 
ttcatcaagattggcgagggctccacgggcatcgtgtgcatcgccaccgtgcgcagctcg 
ggcaagctggtggccgtcaagaagatggacctgcgcaagcagcagaggcgcgagctgctc 
ttcaacgaggtggtaatcatgagggactaccagcacgagaatgtggtggagatgtacaac 
agctacctggtgggggacgagctctgggtggtcatggagttcctggaaggaggcgccctc 
accgacatcgtcacccacaccaggatgaacgaggagcagatcgcggccgtgtgccttgca 
gtgctgcaggccctgtcggtgctccacgcccagggcgtcatccaccgggacatcaagagc 
gactcgatcctgctgacccatgatggcagggtgaagctgtcagactttgggttctgcgcc 
caggtgagcaaggaagtgccccgaaggaagtcgctggtcggcacgccctactggatggcc 
ccagagctcatctcccgccttccctacgggccagaggtagacatctggtcgctggggata 
atggtgattgagatggtggacggagagcccccctacttcaacgagccacccctcaaagcc 
atgaagatgattcgggacaacctgccaccccgactgaagaacctgcacaaggtgtcgcca 
tccctgaagggcttcctggaccgcctgctggtgcgagaccctgcccagcgggccacggca 
gccgagctgctgaagcacccattcctggccaaggcagggccgcctgccagcatcgtgccc 
ctcatgcgccagaaccgcaccagatgaggcccagcgcccttcccctcaaccaaagagccc 
cccgggtcacccccgccccactgaggccagtagggggccaggcctcccactcctcccagc 
ccgggagatgctccgcgtggcaccaccctccttgctgggggtagatgagaccctactact 
gaactccagttttgatctcgtgacttttagaaaaacacagggactcgtgggagcaagcga 
ggctcccaggacccccaccctctgggacaggccctcccccatgttcttctgtctccagga 
agggcagcggccctcccatcactggaagtctgcagtgggggtcgctgggggtggagagaa 
cactaagaggtgaacatgtatgagtgtgtgcacgcgtgtgagtgtgcatgtgtgtgtgtg 
tgcaaaggtccagccaccccgtcctccagcctgcaaggggtgtctggcgccttgcctgac 
acccagccccctctccccctgagccattgtgggggtcgatcatgaatgtccgaagagtgg 
ccttttcccgtagccctgcgccccctttctgtggctggatggggagacaggtcagggccc 
cccaccctctccagcccctgcagcaaatgactactgcacctggacagcctcctcttttct 
agaagtctatttatattgtcattttataacactctagcccctgcccttattgggggacag 
atggtccctgtcctgcggggtggccctggcagaaccactgcctgaagaaccaggttcctg 
cccggtcagcgcagccccagcccgcccacccctgcctcgagttagttttacaattaaaac 
attgtcttgttt 

SEQ ID NO: 137. >R09837__T5 # TY Transcript # LN 2599 # Source Gene: R09837 
# Encoded protein: R09837_P2 

actggtgggcggtgaggggggaaagccccggatgttcgttggggattcaacatggcggcg 
ggagtgtccgcggtggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttc 
caggtcgagcagttaggccgcgagcgactgcggcgccgagccgatgagtaacccgaagcc 
cctagaggagtggtcacctgcctgagggcacttctgtcccaccagcatcagaccaggtgt 
gtgcaggccgcaccgagtccccggcaccatgtttgggaagaggaagaagcgggtggagat 
ctccgcgccgtccaacttcgagcaccgcgtgcacacgggcttcgaccagcacgagcagaa 
gttcacggggctgccccgccagtggcagagcctgatcgaggagtcggctcgccggcccaa 
gcccctcgtcgaccccgcctgcatcacctccatccagcccggggcccccaagaccatcgt 
gcggggcagcaaaggtgccaaagatggggccctcacgctgctgctggacgagtttgagaa 
catgtcggtgacacgctccaactccctgcggagagacagcccgccgccgcccgcccgtgc 
ccgccaggaaaatgggatgccagaggagccggccaccacggccagagggggcccagggaa 
gggggagcctcatgacgtggcccctaacgggccatcagcggggggcctggccatccccca 
gtcctcctcctcctcctcccggcctcccacccgagcccgaggtgcccccagccctggagt 
gctgggaccccacgcctcagagccccagctggcccctccagcctgcacccccgccgcccc 
tgctgttcctgggccccctggcccccgctcaccacagcgggagccacagcgagtatccca 
tgagcagttccgggctgccctgcagctggtggtggacccaggcgacccccgctcctacct 
ggacaacttcatcaagattggcgagggctccacgggcatcgtgtgcatcgccaccgtgcg 
cagctcgggcaagctggtggccgtcaagaagatggacctgcgcaagcagcagaggcgcga 
gctgctcttcaacgaggtggtaatcatgagggactaccagcacgagaatgtggtggagat 
gtacaacagctacctggtgggggacgagctctgggtggtcatggagttcctggaaggagg 
cgccctcaccgacatcgtcacccacaccaggatgaacgaggagcagatcgcggccgtgtg 
ccttgcagtgctgcaggccctgtcggtgctccacgcccagggcgtcatccaccgggacat 
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caagagcgactcgatcctgctgacccatgatggcagggtgaagctgtcagactttgggtt 
ctgcgcccaggtgagcaaggaagtgccccgaaggaagtcgctggtcggcacgccctactg 
gatggccccagagctcatctcccgccttccctacgggccagaggtagacatctggtcgct 
ggggataatggtgattgagatggtggacggagagcccccctacttcaacgagccacccct 
caaagccatgaagatgattcgggacaacctgccaccccgactgaagaacctgcacaaggt 
gtcgccatccctgaagggcttcctggaccgcctgctggtgcgagaccctgcccagcgggc 
cacggcagccgagctgctgaagcacccattcctggccaaggcagggccgcctgccagcat 
cgtgcccctcatgcgccagaaccgcaccagatgaggcccagcgcccttcccctcaaccaa 
agagccccccgggtcacccccgccccactgaggccagtagggggccaggcctcccactcc 
tcccagcccgggagatgctccgcgtggcaccaccctccttgctgggggtagatgagaccc 
tactactgaactccagttttgatctcgtgacttttagaaaaacacagggactcgtgggag 
caagcgaggctcccaggacccccaccctctgggacaggccctcccccatgttcttctgtc 
tccaggaagggcagcggccctcccatcactggaagtctgcagtgggggtcgctgggggtg 
gagagaacactaagaggtgaacatgtatgagtgtgtgcacgcgtgtgagtgtgcatgtgt 
gtgtgtgtgcaaaggtccagccaccccgtcctccagcctgcaaggggtgtctggcgcctt 
gcctgacacccagccccctctccccctgagccattgtgggggtcgatcatgaatgtccga 
agagtggccttttcccgtagccctgcgccccctttctgtggctggatggggagacaggtc 
agggccccccaccctctccagcccctgcagcaaatgactactgcacctggacagcctcct 
cttttctagaagtctatttatattgtcattttataacactctagcccctgcccttattgg 
gggacagatggtccctgtcctgcggggtggccctggcagaaccactgcctgaagaaccag 
gttcctgcccggtcagcgcagccccagcccgcccacccctgcctcgagttagttttacaa 
ttaaaacattgtcttgttt 

SEQ ID NO: 138. >R09837_T6 # TY Transcript # LN 2313 # Source Gene: R09837 
# Encoded protein: R09837_P3 

gagcgaggcctgggtggcgttctcaagacgaggtgacatggcggcgggagtgtccgcggt 
ggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttccaggtcgagcagtt 
aggccgcgagcgactgcggcgccgagccggccgcaccgagtccccggcaccatgtttggg 
aagaggaagaagcgggtggagatctccgcgccgtccaacttcgagcaccgcgtgcacacg 
ggcttcgaccagcacgagcagaagttcacggggctgccccgccagtggcagagcctgatc 
gaggagtcggctcgccggcccaagcccctcgtcgaccccgcctgcatcacctccatccag 
cccggggcccccaagggggagcctcatgacgtggcccctaacgggccatcagcggggggc 
ctggccatcccccagtcctcctcctcctcctcccggcctcccacccgagcccgaggtgcc 
cccagccctggagtgctgggaccccacgcctcagagccccagctggcccctccagcctgc 
acccccgccgcccctgctgttcctgggccccctggcccccgctcaccacagcgggagcca 
cagcgagtatcccatgagcagttccgggctgccctgcagctggtggtggacccaggcgac 
ccccgctcctacctggacaacttcatcaagattggcgagggctccacgggcatcgtgtgc 
atcgccaccgtgcgcagctcgggcaagctggtggccgtcaagaagatggacctgcgcaag 
cagcagaggcgcgagctgctcttcaacgaggtggtaatcatgagggactaccagcacgag 
aatgtggtggagatgtacaacagctacctggtgggggacgagctctgggtggtcatggag 
ttcctggaaggaggcgccctcaccgacatcgtcacccacaccaggatgaacgaggagcag 
atcgcggccgtgtgccttgcagtgctgcaggccctgtcggtgctccacgcccagggcgtc 
atccaccgggacatcaagagcgactcgatcctgctgacccatgatggcagggtgaagctg 
tcagactttgggttctgcgcccaggtgagcaaggaagtgccccgaaggaagtcgctggtc 
ggcacgccctactggatggccccagagctcatctcccgccttccctacgggccagaggta 
gacatctggtcgctggggataatggtgattgagatggtggacggagagcccccctacttc 
aacgagccacccctcaaagccatgaagatgattcgggacaacctgccaccccgactgaag 
aacctgcacaaggtgtcgccatccctgaagggcttcctggaccgcctgctggtgcgagac 
cctgcccagcgggccacggcagccgagctgctgaagcacccattcctggccaaggcaggg 
ccgcctgccagcatcgtgcccctcatgcgccagaaccgcaccagatgaggcccagcgccc 
ttcccctcaaccaaagagccccccgggtcacccccgccccactgaggccagtagggggcc 
aggcctcccactcctcccagcccgggagatgctccgcgtggcaccaccctccttgctggg 
ggtagatgagaccctactactgaactccagttttgatctcgtgacttttagaaaaacaca 
gggactcgtgggagcaagcgaggctcccaggacccccaccctctgggacaggccctcccc 
catgttcttctgtctccaggaagggcagcggccctcccatcactggaagtctgcagtggg 
ggtcgctgggggtggagagaacactaagaggtgaacatgtatgagtgtgtgcacgcgtgt 
gagtgtgcatgtgtgtgtgtgtgcaaaggtccagccaccccgtcctccagcctgcaaggg 
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gtgtctggcgccttgcctgacacccagccccctctccccctgagccattgtgggggtcga 
tcatgaatgtccgaagagtggccttttcccgtagccctgcgccccctttctgtggctgga 
tggggagacaggtcagggccccccaccctctccagcccctgcagcaaatgactactgcac 
ctggacagcctcctcttttctagaagtctatttatattgtcattttataacactctagcc 
cctgcccttattgggggacagatggtccctgtcctgcggggtggccctggcagaaccact 
gcctgaagaaccaggttcctgcccggtcagcgcagccccagcccgcccacccctgcctcg 
agttagttttacaattaaaacattgtcttgttt 

SEQ ID NO: 139. >R09837_T7 # TY Transcript # LN 3043 ft Source Gene: R09837 
# Encoded protein: R09837_P1 

actggtgggcggtgaggggggaaagccccggatgttcgttggggattcaacatggcggcg 
ggagtgtccgcggtggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttc 
caggtcgagcagttaggccgcgagcgactgcggcgccgagccgatgagtaacccgaagcc 
cctagaggagtggtcacctgcctgagggcacttctgtcccaccagcatcagaccaggtgt 
gtgcaggccgcaccgagtccccggcaccatgtttgggaagaggaagaagcgggtggagat 
ctccgcgccgtccaacttcgagcaccgcgtgcacacgggcttcgaccagcacgagcagaa 
gttcacggggctgccccgccagtggcagagcctgatcgaggagtcggctcgccggcccaa 
gcccctcgtcgaccccgcctgcatcacctccatccagcccggggcccccaagaccatcgt 
gcggggcagcaaaggtgccaaagatggggccctcacgctgctgctggacgagtttgagaa 
catgtcggtgacacgctccaactccctgcggagagacagcccgccgccgcccgcccgtgc 
ccgccaggaaaatgggatgccagaggagccggccaccacggccagagggggcccagggaa 
ggcaggcagccgaggccggttcgccggtcacagcgaggcgggtggcggcagtggtgacag 
gcgacgggcggggccagagaagaggcccaagtcttccagggagggctcagggggtcccca 
ggagtcctcccgggacaaacgccccctctccgggcctgatgtcggcaccccccagcctgc 
tggtctggccagtggggcgaaactggcagctggccggccctttaacacctacccgagggc 
tgacacggaccacccatcccggggtgcccagggggagcctcatgacgtggcccctaacgg 
gccatcagcggggggcctggccatcccccagtcctcctcctcctcctcccggcctcccac 
ccgagcccgaggtgcccccagccctggagtgctgggaccccacgcctcagagccccagct 
ggcccctccagcctgcacccccgccgcccctgctgttcctgggccccctggcccccgctc 
accacagcgggagccacagcgagtatcccatgagcagttccgggctgccctgcagctggt 
ggtggacccaggcgacccccgctcctacctggacaacttcatcaagattggcgagggctc 
cacgggcatcgtgtgcatcgccaccgtgcgcagctcgggcaagctggtggccgtcaagaa 
gatggacctgcgcaagcagcagaggcgcgagctgctcttcaacgaggtggtaatcatgag 
ggactaccagcacgagaatgtggtggagatgtacaacagctacctggtgggggacgagct 
ctgggtggtcatggagttcctggaaggaggcgccctcaccgacatcgtcacccacaccag 
gatgaacgaggagcagatcgcggccgtgtgccttgcagtgctgcaggccctgtcggtgct 
ccacgcccagggcgtcatccaccgggacatcaagagcgactcgatcctgctgacccatga 
tggcagggtgaagctgtcagactttgggttctgcgcccaggtgagcaaggaagtgccccg 
aaggaagtcgctggtcggcacgccctactggatggccccagagctcatctcccgccttcc 
ctacgggccagaggtagacatctggtcgctggggataatggtgattgagatggtggacgg 
agagcccccctacttcaacgagccacccctcaaagccatgaagatgattcgggacaacct 
gccaccccgactgaagaacctgcacaaggtgtcgccatccctgaagggcttcctggaccg 
cctgctggtgcgagaccctgcccagcgggccacggcagccgagctgctgaagcacccatt 
cctggccaaggcagggccgcctgccagcatcgtgcccctcatgcgccagaaccgcaccag 
atgaggcccagcgcccttcccctcaaccaaagagccccccgggtcacccccgccccactg 
aggccagtagggggccaggcctcccactcctcccagcccgggagatgctccgcgtggcac 
caccctccttgctgggggtagatgagaccctactactgaactccagttttgatctcgtga 
cttttagaaaaacacagggactcgtgggagcaagcgaggctcccaggacccccaccctct 
gggacaggccctcccccatgttcttctgtctccaggaagggcagcggccctcccatcact 
ggaagtctgcagtgggggtcgctgggggtggagagaacactaagaggtgaacatgtatga 
gtgtgtgcacgcgtgtgagtgtgcatgtgtgtgtgtgtgcaaaggtccagccaccccgtc 
ctccagcctgcaaggggtgtctggcgccttgcctgacacccagccccctctccccctgag 
ccattgtgggggtcgatcatgaatgtccgaagagtggccttttcccgtagccctgcgccc 
cctttctgtggctggatggggagacaggtcagggccccccaccctctccagcccctgcag 
caaatgactactgcacctggacagcctcctcttttctagaagtctatttatattgtcatt 
ttataacactctagcccctgcccttattgggggacagatggtccctgtcctgcggggtgg 
ccctggcagaaccactgcctgaagaaccaggttcctgcccggtcagcgcagccccagccc 
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gcccacccctgcctcgagttagttttacaattaaaacattgtcttgtttactgacaccat 
tgtccgcagatctcggagcactcttattccgggggcggacacggcggcgtttgcgacttc 
gccccgtggccattgctgtttctcggcccgcgttttcttggtccgggtggcgaatctgcc 
ggtcccctatgggagccataatttcgtttcgccggttcctgca 

SEQ ID NO: 140. >R09837_T8 # TY Transcript # LN 6291 # Source Gene: R09837 
# Encoded protein: R09837_P1 

actggtgggcggtgaggggggaaagccccggatgttcgttggggattcaacatggcggcg 
ggagtgtccgcggtggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttc 
caggtcgagcagttaggccgcgagcgactgcggcgccgagccgatgagtaacccgaagcc 
cctagaggagtggtcacctgcctgagggcacttctgtcccaccagcatcagaccaggtgt 
gtgcaggccgcaccgagtccccggcaccatgtttgggaagaggaagaagcgggtggagat 
ctccgcgccgtccaacttcgagcaccgcgtgcacacgggcttcgaccagcacgagcagaa 
gttcacggggctgccccgccagtggcagagcctgatcgaggagtcggctcgccggcccaa 
gcccctcgtcgaccccgcctgcatcacctccatccagcccggggcccccaagaccatcgt 
gcggggcagcaaaggtgccaaagatggggccctcacgctgctgctggacgagtttgagaa 
catgtcggtgacacgctccaactccctgcggagagacagcccgccgccgcccgcccgtgc 
ccgccaggaaaatgggatgccagaggagccggccaccacggccagagggggcccagggaa 
ggcaggcagccgaggccggttcgccggtcacagcgaggcgggtggcggcagtggtgacag 
gcgacgggcggggccagagaagaggcccaagtcttccagggagggctcagggggtcccca 
ggagtcctcccgggacaaacgccccctctccgggcctgatgtcggcaccccccagcctgc 
tggtctggccagtggggcgaaactggcagctggccggccctttaacacctacccgagggc 
tgacacggaccacccatcccggggtgcccagggggagcctcatgacgtggcccctaacgg 
gccatcagcggggggcctggccatcccccagtcctcctcctcctcctcccggcctcccac 
ccgagcccgaggtgcccccagccctggagtgctgggaccccacgcctcagagccccagct 
ggcccctccagcctgcacccccgccgcccctgctgttcctgggccccctggcccccgctc 
accacagcgggagccacagcgagtatcccatgagcagttccgggctgccctgcagctggt 
ggtggacccaggcgacccccgctcctacctggacaacttcatcaagattggcgagggctc 
cacgggcatcgtgtgcatcgccaccgtgcgcagctcgggcaagctggtggccgtcaagaa 
gatggacctgcgcaagcagcagaggcgcgagctgctcttcaacgaggtggtaatcatgag 
ggactaccagcacgagaatgtggtggagatgtacaacagctacctggtgggggacgagct 
ctgggtggtcatggagttcctggaaggaggcgccctcaccgacatcgtcacccacaccag 
gatgaacgaggagcagatcgcggccgtgtgccttgcagtgctgcaggccctgtcggtgct 
ccacgcccagggcgtcatccaccgggacatcaagagcgactcgatcctgctgacccatga 
tggcagggtgaagctgtcagactttgggttctgcgcccaggtgagcaaggaagtgccccg 
aaggaagtcgctggtcggcacgccctactggatggccccagagctcatctcccgccttcc 
ctacgggccagaggtagacatctggtcgctggggataatggtgattgagatggtggacgg 
agagcccccctacttcaacgagccacccctcaaagccatgaagatgattcgggacaacct 
gccaccccgactgaagaacctgcacaaggtgtcgccatccctgaagggcttcctggaccg 
cctgctggtgcgagaccctgcccagcgggccacggcagccgagctgctgaagcacccatt 
cctggccaaggcagggccgcctgccagcatcgtgcccctcatgcgccagaaccgcaccag 
atgaggcccagcgcccttcccctcaaccaaagagccccccgggtcacccccgccccactg 
aggccagtagggggccaggcctcccactcctcccagcccgggagatgctccgcgtggcac 
caccctccttgctgggggtagatgagaccctactactgaactccagttttgatctcgtga 
cttttagaaaaacacagggactcgtgggagcaagcgaggctcccaggacccccaccctct 
gggacaggccctcccccatgttcttctgtctccaggaagggcagcggccctcccatcact 
ggaagtctgcagtgggggtcgctgggggtggagagaacactaagaggtgaacatgtatga 
gtgtgtgcacgcgtgtgagtgtgcatgtgtgtgtgtgtgcaaaggtccagccaccccgtc 
ctccagcctgcaaggggtgtctggcgccttgcctgacacccagccccctctccccctgag 
ccattgtgggggtcgatcatgaatgtccgaagagtggccttttcccgtagccctgcgccc 
cctttctgtggctggatggggagacaggtcagggccccccaccctctccagcccctgcag 
caaatgactactgcacctggacagcctcctcttttctagaagtctatttatattgtcatt 
ttataacactctagcccctgcccttattgggggacagatggtccctgtcctgcggggtgg 
ccctggcagaaccactgcctgaagaaccaggttcctgcccggtcagcgcagccccagccc 
gcccacccctgcctcgagttagttttacaattaaaacattgtcttgttttcttgttttgt 
gtctgtgtgcgatgtgtggggggcagggggccctgcccggctgtcttgggtgggaatttg 
cagggagagggtctggatctgggagcaaaccacgattccagccaaggcagggcaagggtg 
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gggtggggagtggggagttcaggtcatagcagccagtaagctcccccagcctgccactcc 
ccagaatggggcaggattgtccccacccctggaagcagccagtttgccacagtccatgtg 
cagactgatcccagtttgccaaatctgcaatttcctggaaccttttaaaggctgtcttga 
gcgcgtttggtgagtaggagctaacccaagttagtaaattgaaggccatttggcaaattg 
gtcagtgggcagatgggcttttggggattgactgaggctgactggcctggagctgctggc 
ttcggagagacaccctgtgaagtgtgtccttccacgcaggagcccagagccgagcccacg 
ctggggggaatctgactggcatggaggtggccatgccaccatcgctgctgcagctgcatc 
ctggcactttgcgcctcaggccctgttgggctccactttctgcatcctccccagccccca 
gggaggcagtggagtggggagagagccaggagtgagcctccgtccccaaagccagccagg 
cgtcatcagcaccagagacctcagcctggtcctctcgggaagtgagtggccagggcagag 
attccaggttagtccacgcctcccacccttcacaggtcctgaccccaagaatcagagcac 
tgtgtgtgtggcagggcctatgccaagtgcaaacacagcctagatggatcatcacagagt 
gaaacccagcggtgcaagcagctgtgctctctgcgatgtattggaggcttaggtgaggtg 
gatgcctttctggaaaaaaaaaaaatgctaacattggcaaaagaagaaatagaaaacaag 
accaaaataactgtctcctcactgcacacacactccagaataaataaaaggtttcaggct 
tgaatgcactttcaaatgagattttttttttttttttgagacggagtatcgctctgtcgc 
ccaggctggagtgcrgtggcacgrtcttggctcrctgcaacttctgcctcctgggttcaa 
gcgattctcctgcctcagcctcccargtrgctgggattrcaggcgcttgccaccacgccr 
ggctarttttttgtattttgrgtagagactggrtttcgccatcttggccagactggtctt 
gaactcctgacctcgtgatccrcccacctcagcctcccaargtgctgggattacaggcat 
gagccaccatgcctggcctcaaatgaggtttaccagactttgaaggagcaggtaattcct 
tctaccttgtgaacaagttgttccagaaagatagcagctcaggaggcctctgtgaccatg 
gttccagacccagataaggacggcaaagaacagagcatctcagaaacgcaaggctcacag 
ccagggtgcccggcgaccccacgggcactgagaacagctagctctaggagctccactctc 
ctgctgaagaaaccacgggctcagagacggggagctccctcgcccagccacatctgtgac 
cccacggtaactctgctggttttggtgccttcagtcactcactgcaggtttgttttgttt 
tgttttgttttgagtttttttgttttttgttttttgtttttagagacrrgatgttgctat 
gttgcctaggctggtttcgaactcctggcctcargggatcctccctccttggccccgcaa 
rgtgctgggattrcaggcgtgagccrccgtgcccrgcccgctcactgcagtttgaaggca 
tggctttgggtggcgtggggtgaaagctgcccgaggccccgttcctccccacgtggctgc 
ctcctgccagagccagtcaggaaaacagaccccaactagagttgtttcaaatggcaggga 
tttggtaccggtggttggatcatgacaaagctctgagaaggctggaggagccacagagtg 
ccaagtgcccagcaatcattagaggaaggaggctgctgccacctgtgtggctagaggaac 
agaggggccaatggcattccccaaaccccactctcgcctctgtctggccagagcagaatg 
gcttcttccagcttcccaccctggactcccacccaggagcctcctcctggcagacccttc 
ctgaccccaccggcccggggggtctacagatccatgtttcaggcgtccgcctggagcgga 
acaggggagtgcttaggacaagggtggtgccagaggatccactctgcccacatttagttg 
accagctgaggcactccacgggaatgaatgactctcgacaggtgccggaggtgaggaggg 
gcccggaggcccaggaggggcacagggatggattcgtccgcctgggggctggaggtgtgt 
ttacagagccccaaaataaacaatgcaaccaggtcagaccagcggttctcacagtgtggt 
tcccagaccagcagcatcactgggagcttactggacacgcaaataaatccctgtgtgcca 
ccccagctgcatcagatgctggtggggcccagtgatctgtatttaacacaccctccgggg 
gatgccggtgcccactcacgtttgagaacccctgcgatccacgactgccctcccgtgtaa 
aaggcccacctctgtgggactccaagtcatcagcaccctagggtccttccgtctttttcc 
ttcctcctgggacacctgcctctcccatgtcgtattagagaattccttatgctcccaagt 
gggcacggggagaggaaggcactcctctttaaggaccgacccagaggttttgccattgct 
tcactggccagagcttagtcacgcagcctcacccagaggcaagggaggttggaaaatgta 
gtgtttgtgtgtgtctaacacaaattctattaccatgcagtcaggattctccactcttgc 
tctttcattagatttgctgggcttcaccctggactttctgatttagtgacagaacagaga 
acccagaggcagacccagatgtgtacaagggcttcatatacaatcaggagatttaataat 
catgctaggggccgggtgcagtggctcacgcctgtaatcccaagcactttggggagccga 
ggcaggcrgatcacttgaggtcaggagtttgagaccagcctgggcaacaaagtgagaccc 
tgtctctactaaaaataacaaaaattagccgggcgtggtggtgggygcctgtaatcccag 
ctcctcgggtggctgaggcatgagaatcacttgaacccaggaggcagaggtttcagtgag 
ctgagatcacatcactgcactccagcctgggtgacagagtgagattcygtc 
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SEQ ID NO: 14 1 . >R09837_T9 # TY Transcript # LN 2870 # Source Gene: R09837 

# Encoded protein: R09837_P4 

actggtgggcggtgaggggggaaagccccggatgttcgttggggattcaacatggcggcg 
ggagtgtccgcggtggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttc 
caggtcgagcagttaggccgcgagcgactgcggcgccgagccgatgagtaacccgaagcc 
cctagaggagtggtcacctgcctgagggcacttctgtcccaccagcatcagaccaggtgt 
gtgcaggctctctatgggatgccgccatcagaatgtctcttcctcagcccctttgttttt 
taaagtggctttggtcgaccatttgctctgtgcttggtactgtgtcaagggcccgattga 
agaccacagatcctaaagataggggagtgtcgcgacaaggctgcctggggtgccctctta 
acccaggggatcagtaaccctgtgctgagtcaatatccaaaggccgcaccgagtccccgg 
caccatgtttgggaagaggaagaagcgggtggagatctccgcgccgtccaacttcgagca 
ccgcgtgcacacgggcttcgaccagcacgagcagaagttcacggggctgccccgccagtg 
gcagagcctgatcgaggagtcggctcgccggcccaagcccctcgtcgaccccgcctgcat 
cacctccatccagcccggggcccccaagaccatcgtgcggggcagcaaaggtgccaaaga 
tggggccctcacgctgctgctggacgagtttgagaacatgtcggtgacacgctccaactc 
cctgcggagagacagcccgccgccgcccgcccgtgcccgccaggaaaatgggatgccaga 
ggagccggcccagtggggcgaaactggcagctggccggccctttaacacctacccgaggg 
ctgacacggaccacccatcccggggtgcccagggggagcctcatgacgtggcccctaacg 
ggccatcagcggggggcctggccatcccccagtcctcctcctcctcctcccggcctccca 
cccgagcccgaggtgcccccagccctggagtgctgggaccccacgcctcagagccccagc 
tggcccctccagcctgcacccccgccgcccctgctgttcctgggccccctggcccccgct 
caccacagcgggagccacagcgagtatcccatgagcagttccgggctgccctgcagctgg 
tggtggacccaggcgacccccgctcctacctggacaacttcatcaagattggcgagggct 
ccacgggcatcgtgtgcatcgccaccgtgcgcagctcgggcaagctggtggccgtcaaga 
agatggacctgcgcaagcagcagaggcgcgagctgctcttcaacgaggtggtaatcatga 
gggactaccagcacgagaatgtggtggagatgtacaacagctacctggtgggggacgagc 
tctgggtggtcatggagttcctggaaggaggcgccctcaccgacatcgtcacccacacca 
ggatgaacgaggagcagatcgcggccgtgtgccttgcagtgctgcaggccctgtcggtgc 
tccacgcccagggcgtcatccaccgggacatcaagagcgactcgatcctgctgacccatg 
atggcagggtgaagctgtcagactttgggttctgcgcccaggtgagcaaggaagtgcccc 
gaaggaagtcgctggtcggcacgccctactggatggccccagagctcatctcccgccttc 
cctacgggccagaggtagacatctggtcgctggggataatggtgattgagatggtggacg 
gagagcccccctacttcaacgagccacccctcaaagccatgaagatgattcgggacaacc 
tgccaccccgactgaagaacctgcacaaggtgtcgccatccctgaagggcttcctggacc 
gcctgctggtgcgagaccctgcccagcgggccacggcagccgagctgctgaagcacccat 
tcctggccaaggcagggccgcctgccagcatcgtgcccctcatgcgccagaaccgcacca 
gatgaggcccagcgcccttcccctcaaccaaagagccccccgggtcacccccgccccact 
gaggccagtagggggccaggcctcccactcctcccagcccgggagatgctccgcgtggca 
ccaccctccttgctgggggtagatgagaccctactactgaactccagttttgatctcgtg 
acttttagaaaaacacagggactcgtgggagcaagcgaggctcccaggacccccaccctc 
tgggacaggccctcccccatgttcttctgtctccaggaagggcagcggccctcccatcac 
tggaagtctgcagtgggggtcgctgggggtggagagaacactaagaggtgaacatgtatg 
agtgtgtgcacgcgtgtgagtgtgcatgtgtgtgtgtgtgcaaaggtccagccaccccgt 
cctccagcctgcaaggggtgtctggcgccttgcctgacacccagccccctctccccctga 
gccattgtgggggtcgatcatgaatgtccgaagagtggccttttcccgtagccctgcgcc 
ccctttctgtggctggatggggagacaggtcagggccccccaccctctccagcccctgca 
gcaaatgactactgcacctggacagcctcctcttttctagaagtctatttatattgtcat 
tttataacactctagcccctgcccttattgggggacagatggtccctgtcctgcggggtg 
gccctggcagaaccactgcctgaagaaccaggttcctgcccggtcagcgcagccccagcc 
cgcccacccctgcctcgagttagttttacaattaaaacattgtcttgttt 

SEQ ID NO: 142. >R09837JT10 # TY Transcript < LN 2375 # Source Gene: R09837 

# Encoded protein: R09837_P9 

actggtgggcggtgaggggggaaagccccggatgttcgttggggattcaacatggcggcg 
ggagtgtccgcggtggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttc 
caggtcgagcagttaggccgcgagcgactgcggcgccgagccgatgagtaacccgaagcc 
cctagaggagtggtcacctgcctgagggcacttctgtcccaccagcatcagaccaggtgt 
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gtgcaggccgcaccgagtccccggcaccatgtttgggaagaggaagaagcgggtggagat 
ctccgcgccgtccaacttcgagcaccgcgtgcacacgggcttcgaccagcacgagcagaa 
gttcacggggctgccccgccagtggcagagcctgatcgaggagtcggctcgccggcccaa 
gcccctcgtcgaccccgcctgcatcacctccatccagcccggggcccccaagaccatcgt 
gcggggcagcaaaggtgccaaagatggggccctcacgctgctgctggacgagtttgagaa 
catgtcggtgacacgctccaactccctgcggagagacagcccgccgccgcccgcccgtgc 
ccgccaggaaaatgggatgccagaggagccggcctggcccccgctcaccacagcgggagc 
cacagcgagtatcccatgagcagttccgggctgccctgcagctggtggtggacccaggcg 
acccccgctcctacctggacaacttcatcaagattggcgagggctccacgggcatcgtgt 
gcatcgccaccgtgcgcagctcgggcaagctggtggccgtcaagaagatggacctgcgca 
agcagcagaggcgcgagctgctcttcaacgaggtggtaatcatgagggactaccagcacg 
agaatgtggtggagatgtacaacagctacctggtgggggacgagctctgggtggtcatgg 
agttcctggaaggaggcgccctcaccgacatcgtcacccacaccaggatgaacgaggagc 
agatcgcggccgtgtgccttgcagtgctgcaggccctgtcggtgctccacgcccagggcg 
tcatccaccgggacatcaagagcgactcgatcctgctgacccatgatggcagggtgaagc 
tgtcagactttgggttctgcgcccaggtgagcaaggaagtgccccgaaggaagtcgctgg 
tcggcacgccctactggatggccccagagctcatctcccgccttccctacgggccagagg 
tagacatctggtcgctggggataatggtgattgagatggtggacggagagcccccctact 
tcaacgagccacccctcaaagccatgaagatgattcgggacaacctgccaccccgactga 
agaacctgcacaaggtgtcgccatccctgaagggcttcctggaccgcctgctggtgcgag 
accctgcccagcgggccacggcagccgagctgctgaagcacccattcctggccaaggcag 
ggccgcctgccagcatcgtgcccctcatgcgccagaaccgcaccagatgaggcccagcgc 
ccttcccctcaaccaaagagccccccgggtcacccccgccccactgaggccagtaggggg 
ccaggcctcccactcctcccagcccgggagatgctccgcgtggcaccaccctccttgctg 
ggggtagatgagaccctactactgaactccagttttgatctcgtgacttttagaaaaaca 
cagggactcgtgggagcaagcgaggctcccaggacccccaccctctgggacaggccctcc 
cccatgttcttctgtctccaggaagggcagcggccctcccatcactggaagtctgcagtg 
ggggtcgctgggggtggagagaacactaagaggtgaacatgtatgagtgtgtgcacgcgt 
gtgagtgtgcatgtgtgtgtgtgtgcaaaggtccagccaccccgtcctccagcctgcaag 
gggtgtctggcgccttgcctgacacccagccccctctccccctgagccattgtgggggtc 
gatcatgaatgtccgaagagtggccttttcccgtagccctgcgccccctttctgtggctg 
gatggggagacaggtcagggccccccaccctctccagcccctgcagcaaatgactactgc 
acctggacagcctcctcttttctagaagtctatttatattgtcattttataacactctag 
cccctgcccttattgggggacagatggtccctgtcctgcggggtggccctggcagaacca 
ctgcctgaagaaccaggttcctgcccggtcagcgcagccccagcccgcccacccctgcct 
cgagttagttttacaattaaaacattgtcttgttt 

SEQ ID NO: 143. >R09837JT11 # TY Transcript # LN 2734 # Source Gene: R09837 
# Encoded protein: R09837_P4 

gagcgaggcctgggtggcgttctcaagacgaggtgacatggcggcgggagtgtccgcggt 
ggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttccaggtcgagcagtt 
aggccgcgagcgactgcggcgccgagccgccttcctgctgcccaccccgcgtccctccag 
actgggtcccgtgctcccgtcccccagccctgaccaccctgggcagctgctgcatgggga 
cagatgagtaacccgaagcccctagaggagtggtcacctgcctgagggcacttctgtccc 
accagcatcagaccaggtgtgtgcaggccgcaccgagtccccggcaccatgtttgggaag 
aggaagaagcgggtggagatctccgcgccgtccaacttcgagcaccgcgtgcacacgggc 
ttcgaccagcacgagcagaagttcacggggctgccccgccagtggcagagcctgatcgag 
gagtcggctcgccggcccaagcccctcgtcgaccccgcctgcatcacctccatccagccc 
ggggcccccaagaccatcgtgcggggcagcaaaggtgccaaagatggggccctcacgctg 
ctgctggacgagtttgagaacatgtcggtgacacgctccaactccctgcggagagacagc 
ccgccgccgcccgcccgtgcccgccaggaaaatgggatgccagaggagccggcccagtgg 
ggcgaaactggcagctggccggccctttaacacctacccgagggctgacacggaccaccc 
atcccggggtgcccagggggagcctcatgacgtggcccctaacgggccatcagcgggggg 
cctggccatcccccagtcctcctcctcctcctcccggcctcccacccgagcccgaggtgc 
ccccagccctggagtgctgggaccccacgcctcagagccccagctggcccctccagcctg 
cacccccgccgcccctgctgttcctgggccccctggcccccgctcaccacagcgggagcc 
acagcgagtatcccatgagcagttccgggctgccctgcagctggtggtggacccaggcga 
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cccccgctcctacctggacaacttcatcaagattggcgagggctccacgggcatcgtgtg 
catcgccaccgtgcgcagctcgggcaagctggtggccgtcaagaagatggacctgcgcaa 
gcagcagaggcgcgagctgctcttcaacgaggtggtaatcatgagggactaccagcacga 
gaatgtggtggagatgtacaacagctacctggtgggggacgagctctgggtggtcatgga 
gttcctggaaggaggcgccctcaccgacatcgtcacccacaccaggatgaacgaggagca 
gatcgcggccgtgtgccttgcagtgctgcaggccctgtcggtgctccacgcccagggcgt 
catccaccgggacatcaagagcgactcgatcctgctgacccatgatggcagggtgaagct 
gtcagactttgggttctgcgcccaggtgagcaaggaagtgccccgaaggaagtcgctggt 
cggcacgccctactggatggccccagagctcatctcccgccttccctacgggccagaggt 
agacatctggtcgctggggataatggtgattgagatggtggacggagagcccccctactt 
caacgagccacccctcaaagccatgaagatgattcgggacaacctgccaccccgactgaa 
gaacctgcacaaggtgtcgccatccctgaagggcttcctggaccgcctgctggtgcgaga 
ccctgcccagcgggccacggcagccgagctgctgaagcacccattcctggccaaggcagg 
gccgcctgccagcatcgtgcccctcatgcgccagaaccgcaccagatgaggcccagcgcc 
cttcccctcaaccaaagagccccccgggtcacccccgccccactgaggccagtagggggc 
caggcctcccactcctcccagcccgggagatgctccgcgtggcaccaccctccttgctgg 
gggtagatgagaccctactactgaactccagttttgatctcgtgacttttagaaaaacac 
agggactcgtgggagcaagcgaggctcccaggacccccaccctctgggacaggccctccc 
ccatgttcttctgtctccaggaagggcagcggccctcccatcactggaagtctgcagtgg 
gggtcgctgggggtggagagaacactaagaggtgaacatgtatgagtgtgtgcacgcgtg 
tgagtgtgcatgtgtgtgtgtgtgcaaaggtccagccaccccgtcctccagcctgcaagg 
ggtgtctggcgccttgcctgacacccagccccctctccccctgagccattgtgggggtcg 
atcatgaatgtccgaagagtggccttttcccgtagccctgcgccccctttctgtggctgg 
atggggagacaggtcagggccccccaccctctccagcccctgcagcaaatgactactgca 
cctggacagcctcctcttttctagaagtctatttatattgtcattttataacactctagc 
ccctgcccttattgggggacagatggtccctgtcctgcggggtggccctggcagaaccac 
tgcctgaagaaccaggttcctgcccggtcagcgcagccccagcccgcccacccctgcctc 
gagttagttttacaattaaaacattgtcttgttt 

SEQ ID NO: 144. >R09837_T12 # TY Transcript # LN 2396 # Source Gene: R09837 
# Encoded protein: R09837_P3 

gagcgaggcctgggtggcgttctcaagacgaggtgacatggcggcgggagtgtccgcggt 
ggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttccaggtcgagcagtt 
aggccgcgagcgactgcggcgccgagccgatgagtaacccgaagcccctagaggagtggt 
cacctgcctgagggcacttctgtcccaccagcatcagaccaggtgtgtgcaggccgcacc 
gagtccccggcaccatgtttgggaagaggaagaagcgggtggagatctccgcgccgtcca 
acttcgagcaccgcgtgcacacgggcttcgaccagcacgagcagaagttcacggggctgc 
cccgccagtggcagagcctgatcgaggagtcggctcgccggcccaagcccctcgtcgacc 
ccgcctgcatcacctccatccagcccggggcccccaagggggagcctcatgacgtggccc 
ctaacgggccatcagcggggggcctggccatcccccagtcctcctcctcctcctcccggc 
ctcccacccgagcccgaggtgcccccagccctggagtgctgggaccccacgcctcagagc 
cccagctggcccctccagcctgcacccccgccgcccctgctgttcctgggccccctggcc 
cccgctcaccacagcgggagccacagcgagtatcccatgagcagttccgggctgccctgc 
agctggtggtggacccaggcgacccccgctcctacctggacaacttcatcaagattggcg 
agggctccacgggcatcgtgtgcatcgccaccgtgcgcagctcgggcaagctggtggccg 
tcaagaagatggacctgcgcaagcagcagaggcgcgagctgctcttcaacgaggtggtaa 
tcatgagggactaccagcacgagaatgtggtggagatgtacaacagctacctggtggggg 
acgagctctgggtggtcatggagttcctggaaggaggcgccctcaccgacatcgtcaccc 
acaccaggatgaacgaggagcagatcgcggccgtgtgccttgcagtgctgcaggccctgt 
cggtgctccacgcccagggcgtcatccaccgggacatcaagagcgactcgatcctgctga 
cccatgatggcagggtgaagctgtcagactttgggttctgcgcccaggtgagcaaggaag 
tgccccgaaggaagtcgctggtcggcacgccctactggatggccccagagctcatctccc 
gccttccctacgggccagaggtagacatctggtcgctggggataatggtgattgagatgg 
tggacggagagcccccctacttcaacgagccacccctcaaagccatgaagatgattcggg 
acaacctgccaccccgactgaagaacctgcacaaggtgtcgccatccctgaagggcttcc 
tggaccgcctgctggtgcgagaccctgcccagcgggccacggcagccgagctgctgaagc 
acccattcctggccaaggcagggccgcctgccagcatcgtgcccctcatgcgccagaacc 
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gcaccagatgaggcccagcgcccttcccctcaaccaaagagccccccgggtcacccccgc 
cccactgaggccagtagggggccaggcctcccactcctcccagcccgggagatgctccgc 
gtggcaccaccctccttgctgggggtagatgagaccctactactgaactccagttttgat 
ctcgtgacttttagaaaaacacagggactcgtgggagcaagcgaggctcccaggaccccc 
accctctgggacaggccctcccccatgttcttctgtctccaggaagggcagcggccctcc 
catcactggaagtctgcagtgggggtcgctgggggtggagagaacactaagaggtgaaca 
tgtatgagtgtgtgcacgcgtgtgagtgtgcatgtgtgtgtgtgtgcaaaggtccagcca 
ccccgtcctccagcctgcaaggggtgtctggcgccttgcctgacacccagccccctctcc 
ccctgagccattgtgggggtcgatcatgaatgtccgaagagtggccttttcccgtagccc 
tgcgccccctttctgtggctggatggggagacaggtcagggccccccaccctctccagcc 
cctgcagcaaatgactactgcacctggacagcctcctcttttctagaagtctatttatat 
tgtcattttataacactctagcccctgcccttattgggggacagatggtccctgtcctgc 
ggggtggccctggcagaaccactgcctgaagaaccaggttcctgcccggtcagcgcagcc 
ccagcccgcccacccctgcctcgagttagttttacaattaaaacattgtcttgttt 

SEQ ID NO: 145 . >R09837_T13 # TY Transcript # LN 2502 # Source Gene: R09837 
# Encoded protein: R09837_P2 

gagcgaggcctgggtggcgttctcaagacgaggtgacatggcggcgggagtgtccgcggt 
ggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttccaggtcgagcagtt 
aggccgcgagcgactgcggcgccgagccggccgcaccgagtccccggcaccatgtttggg 
aagaggaagaagcgggtggagatctccgcgccgtccaacttcgagcaccgcgtgcacacg 
ggcttcgaccagcacgagcagaagttcacggggctgccccgccagtggcagagcctgatc 
gaggagtcggctcgccggcccaagcccctcgtcgaccccgcctgcatcacctccatccag 
cccggggcccccaagaccatcgtgcggggcagcaaaggtgccaaagatggggccctcacg 
ctgctgctggacgagtttgagaacatgtcggtgacacgctccaactccctgcggagagac 
agcccgccgccgcccgcccgtgcccgccaggaaaatgggatgccagaggagccggccacc 
acggccagagggggcccagggaagggggagcctcatgacgtggcccctaacgggccatca 
gcggggggcctggccatcccccagtcctcctcctcctcctcccggcctcccacccgagcc 
cgaggtgcccccagccctggagtgctgggaccccacgcctcagagccccagctggcccct 
ccagcctgcacccccgccgcccctgctgttcctgggccccctggcccccgctcaccacag 
cgggagccacagcgagtatcccatgagcagttccgggctgccctgcagctggtggtggac 
ccaggcgacccccgctcctacctggacaacttcatcaagattggcgagggctccacgggc 
atcgtgtgcatcgccaccgtgcgcagctcgggcaagctggtggccgtcaagaagatggac 
ctgcgcaagcagcagaggcgcgagctgctcttcaacgaggtggtaatcatgagggactac 
cagcacgagaatgtggtggagatgtacaacagctacctggtgggggacgagctctgggtg 
gtcatggagttcctggaaggaggcgccctcaccgacatcgtcacccacaccaggatgaac 
gaggagcagatcgcggccgtgtgccttgcagtgctgcaggccctgtcggtgctccacgcc 
cagggcgtcatccaccgggacatcaagagcgactcgatcctgctgacccatgatggcagg 
gtgaagctgtcagactttgggttctgcgcccaggtgagcaaggaagtgccccgaaggaag 
tcgctggtcggcacgccctactggatggccccagagctcatctcccgccttccctacggg 
ccagaggtagacatctggtcgctggggataatggtgattgagatggtggacggagagccc 
ccctacttcaacgagccacccctcaaagccatgaagatgattcgggacaacctgccaccc 
cgactgaagaacctgcacaaggtgtcgccatccctgaagggcttcctggaccgcctgctg 
gtgcgagaccctgcccagcgggccacggcagccgagctgctgaagcacccattcctggcc 
aaggcagggccgcctgccagcatcgtgcccctcatgcgccagaaccgcaccagatgaggc 
ccagcgcccttcccctcaaccaaagagccccccgggtcacccccgccccactgaggccag 
tagggggccaggcctcccactcctcccagcccgggagatgctccgcgtggcaccaccctc 
cttgctgggggtagatgagaccctactactgaactccagttttgatctcgtgacttttag 
aaaaacacagggactcgtgggagcaagcgaggctcccaggacccccaccctctgggacag 
gccctcccccatgttcttctgtctccaggaagggcagcggccctcccatcactggaagtc 
tgcagtgggggtcgctgggggtggagagaacactaagaggtgaacatgtatgagtgtgtg 
cacgcgtgtgagtgtgcatgtgtgtgtgtgtgcaaaggtccagccaccccgtcctccagc 
ctgcaaggggtgtctggcgccttgcctgacacccagccccctctccccctgagccattgt 
gggggtcgatcatgaatgtccgaagagtggccttttcccgtagccctgcgccccctttct 
gtggctggatggggagacaggtcagggccccccaccctctccagcccctgcagcaaatga 
ctactgcacctggacagcctcctcttttctagaagtctatttatattgtcattttataac 
actctagcccctgcccttattgggggacagatggtccctgtcctgcggggtggccctggc 
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Figur 1 (Cont'd) 



agaaccactgcctgaagaaccaggttcctgcccggtcagcgcagccccagcccgcccacc 
cctgcctcgagttagttttacaattaaaacattgtcttgttt 

SEQ ID NO: 14 6. >R09837_T14 # TY Transcript # LN 2769 # Source Gene: R09837 

# Encoded protein: R09837_P1 

actggtgggcggtgaggggggaaagccccggatgttcgttggggattcaacatggcggcg 
ggagtgtccgcggtggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttc 
caggtcgagcagttaggccgcgagcgactgcggcgccgagccgatgagtaacccgaagcc 
cctagaggagtggtcacctgcctgagggcacttctgtcccaccagcatcagaccaggtgt 
gtgcaggccgcaccgagtccccggcaccatgtttgggaagaggaagaagcgggtggagat 
ctccgcgccgtccaacttcgagcaccgcgtgcacacgggcttcgaccagcacgagcagaa 
gttcacggggctgccccgccagtggcagagcctgatcgaggagtcggctcgccggcccaa 
gcccctcgtcgaccccgcctgcatcacctccatccagcccggggcccccaagaccatcgt 
gcggggcagcaaaggtgccaaagatggggccctcacgctgctgctggacgagtttgagaa 
catgtcggtgacacgctccaactccctgcggagagacagcccgccgccgcccgcccgtgc 
ccgccaggaaaatgggatgccagaggagccggccaccacggccagagggggcccagggaa 
ggcaggcagccgaggccggttcgccggtcacagcgaggcgggtggcggcagtggtgacag 
gcgacgggcggggccagagaagaggcccaagtcttccagggagggctcagggggtcccca 
ggagtcctcccgggacaaacgccccctctccgggcctgatgtcggcaccccccagcctgc 
tggtctggccagtggggcgaaactggcagctggccggccctttaacacctacccgagggc 
tgacacggaccacccatcccggggtgcccagggggagcctcatgacgtggcccctaacgg 
gccatcagcggggggcctggccatcccccagtcctcctcctcctcctcccggcctcccac 
ccgagcccgaggtgcccccagccctggagtgctgggaccccacgcctcagagccccagct 
ggcccctccagcctgcacccccgccgcccctgctgttcctgggccccctggcccccgctc 
accacagcgggagccacagcgagtatcccatgagcagttccgggctgccctgcagctggt 
ggtggacccaggcgacccccgctcctacctggacaacttcatcaagattggcgagggctc 
cacgggcatcgtgtgcatcgccaccgtgcgcagctcgggcaagctggtggccgtcaagaa 
gatggacctgcgcaagcagcagaggcgcgagctgctcttcaacgaggtggtaatcatgag 
ggactaccagcacgagaatgtggtggagatgtacaacagctacctggtgggggacgagct 
ctgggtggtcatggagttcctggaaggaggcgccctcaccgacatcgtcacccacaccag 
gatgaacgaggagcagatcgcggccgtgtgccttgcagtgctgcaggccctgtcggtgct 
ccacgcccagggcgtcatccaccgggacatcaagagcgactcgatcctgctgacccatga 
tggcagggtgaagctgtcagactttgggttctgcgcccaggtgagcaaggaagtgccccg 
aaggaagtcgctggtcggcacgccctactggatggccccagagctcatctcccgccttcc 
ctacgggccagaggtagacatctggtcgctggggataatggtgattgagatggtggacgg 
agagcccccctacttcaacgagccacccctcaaagccatgaagatgattcgggacaacct 
gccaccccgactgaagaacctgcacaaggtgtcgccatccctgaagggcttcctggaccg 
cctgctggtgcgagaccctgcccagcgggccacggcagccgagctgctgaagcacccatt 
cctggccaaggcagggccgcctgccagcatcgtgcccctcatgcgccagaaccgcaccag 
atgaggcccagcgcccttcccctcaaccaaagagccccccgggtcacccccgccccactg 
aggccagtagggggccaggcctcccactcctcccagcccgggagatgctccgcgtggcac 
caccctccttgctgggggtagatgagaccctactactgaactccagttttgatctcgtga 
cttttagaaaaacacagggactcgtgggagcaagcgaggctcccaggacccccaccctct 
gggacaggccctcccccatgttcttctgtctccaggaagggcagcggccctcccatcact 
ggaagtctgcagtgggggtcgctgggggtggagagaacactaagaggtgaacatgtatga 
gtgtgtgcacgcgtgtgagtgtgcatgtgtgtgtgtgtgcaaaggtccgaagagtggcct 
tttcccgtagccctgcgccccctttctgtggctggatggggagacaggtcagggcccccc 
accctctccagcccctgcagcaaatgactactgcacctggacagcctcctcttttctaga 
agtctatttatattgtcattttataacactctagcccctgcccttattgggggacagatg 
gtccctgtcctgcggggtggccctggcagaaccactgcctgaagaaccaggttcctgccc 
ggtcagcgcagccccagcccgcccacccctgcctcgagttagttttacaattaaaacatt 
gtcttgttt 

SEQ ID NO: 147. >R09837JT15 # TY Transcript # LN 5818 # Source Gene: R09837 

# Encoded protein: R09837_P3 

gagcgaggcctgggtggcgttctcaagacgaggtgacatggcggcgggagtgtccgcggt 
ggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttccaggtcgagcagtt 
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Figure 1 (Cont'd) 



aggccgcgagcgactgcggcgccgagccgatgagtaacccgaagcccctagaggagtggt 
cacctgcctgagggcacttctgtcccaccagcatcagaccaggtgtgtgcaggccgcacc 
gagtccccggcaccatgtttgggaagaggaagaagcgggtggagatctccgcgccgtcca 
acttcgagcaccgcgtgcacacgggcttcgaccagcacgagcagaagttcacggggctgc 
cccgccagtggcagagcctgatcgaggagtcggctcgccggcccaagcccctcgtcgacc 
ccgcctgcatcacctccatccagcccggggcccccaagggggagcctcatgacgtggccc 
ctaacgggccatcagcggggggcctggccatcccccagtcctcctcctcctcctcccggc 
ctcccacccgagcccgaggtgcccccagccctggagtgctgggaccccacgcctcagagc 
cccagctggcccctccagcctgcacccccgccgcccctgctgttcctgggccccctggcc 
cccgctcaccacagcgggagccacagcgagtatcccatgagcagttccgggctgccctgc 
agctggtggtggacccaggcgacccccgctcctacctggacaacttcatcaagattggcg 
agggctccacgggcatcgtgtgcatcgccaccgtgcgcagctcgggcaagctggtggccg 
tcaagaagatggacctgcgcaagcagcagaggcgcgagctgctcttcaacgaggtggtaa 
tcatgagggactaccagcacgagaatgtggtggagatgtacaacagctacctggtggggg 
acgagctctgggtggtcatggagttcctggaaggaggcgccctcaccgacatcgtcaccc 
acaccaggatgaacgaggagcagatcgcggccgtgtgccttgcagtgctgcaggccctgt 
cggtgctccacgcccagggcgtcatccaccgggacatcaagagcgactcgatcctgctga 
cccatgatggcagggtgaagctgtcagactttgggttctgcgcccaggtgagcaaggaag 
tgccccgaaggaagtcgctggtcggcacgccctactggatggccccagagctcatctccc 
gccttccctacgggccagaggtagacatctggtcgctggggataatggtgattgagatgg 
tggacggagagcccccctacttcaacgagccacccctcaaagccatgaagatgattcggg 
acaacctgccaccccgactgaagaacctgcacaaggtgtcgccatccctgaagggcttcc 
tggaccgcctgctggtgcgagaccctgcccagcgggccacggcagccgagctgctgaagc 
acccattcctggccaaggcagggccgcctgccagcatcgtgcccctcatgcgccagaacc 
gcaccagatgaggcccagcgcccttcccctcaaccaaagagccccccgggtcacccccgc 
cccactgaggccagtagggggccaggcctcccactcctcccagcccgggagatgctccgc 
gtggcaccaccctccttgctgggggtagatgagaccctactactgaactccagttttgat 
ctcgtgacttttagaaaaacacagggactcgtgggagcaagcgaggctcccaggaccccc 
accctctgggacaggccctcccccatgttcttctgtctccaggaagggcagcggccctcc 
catcactggaagtctgcagtgggggtcgctgggggtggagagaacactaagaggtgaaca 
tgtatgagtgtgtgcacgcgtgtgagtgtgcatgtgtgtgtgtgtgcaaaggtccagcca 
ccccgtcctccagcctgcaaggggtgtctggcgccttgcctgacacccagccccctctcc 
ccctgagccattgtgggggtcgatcatgaatgtccgaagagtggccttttcccgtagccc 
tgcgccccctttctgtggctggatggggagacaggtcagggccccccaccctctccagcc 
cctgcagcaaatgactactgcacctggacagcctcctcttttctagaagtctatttatat 
tgtcattttataacactctagcccctgcccttattgggggacagatggtccctgtcctgc 
ggggtggccctggcagaaccactgcctgaagaaccaggttcctgcccggtcagcgcagcc 
ccagcccgcccacccctgcctcgagttagttttacaattaaaacattgtcttgttttctt 
gttttgtgtctgtgtgcgatgtgtggggggcagggggccctgcccggctgtcttgggtgg 
gaatttgcagggagagggtctggatctgggagcaaaccacgattccagccaaggcagggc 
aagggtggggtggggagtggggagttcaggtcatagcagccagtaagctcccccagcctg 
ccactccccagaatggggcaggattgtccccacccctggaagcagccagtttgccacagt 
ccatgtgcagactgatcccagtttgccaaatctgcaatttcctggaaccttttaaaggct 
gtcttgagcgcgtttggtgagtaggagctaacccaagttagtaaattgaaggccatttgg 
caaattggtcagtgggcagatgggcttttggggattgactgaggctgactggcctggagc 
tgctggcttcggagagacaccctgtgaagtgtgtccttccacgcaggagcccagagccga 
gcccacgctggggggaatctgactggcatggaggtggccatgccaccatcgctgctgcag 
ctgcatcctggcactttgcgcctcaggccctgttgggctccactttctgcatcctcccca 
gcccccagggaggcagtggagtggggagagagccaggagtgagcctccgtccccaaagcc 
agccaggcgtcatcagcaccagagacctcagcctggtcctctcgggaagtgagtggccag 
ggcagagattccaggttagtccacgcctcccacccttcacaggtcctgaccccaagaatc 
agagcactgtgtgtgtggcagggcctatgccaagtgcaaacacagcctagatggatcatc 
acagagtgaaacccagcggtgcaagcagctgtgctctctgcgatgtattggaggcttagg 
tgaggtggatgcctttctggaaaaaaaaaaaatgctaacattggcaaaagaagaaataga 
aaacaagaccaaaataactgtctcctcactgcacacacactccagaataaataaaaggtt 
tcaggcttgaatgcactttcaaatgagattttttttttttttttgagacggagtatcgct 
ctgtcgcccaggctggagtgcrgtggcacgrtcttggctcrctgcaacttctgcctcctg 
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Figure 1 (Cont'd) 



ggttcaagcgattctcctgcctcagcctcccargtrgctgggattrcaggcgcttgccac 
cacgccrggctarttttttgtattttgrgtagagactggrtttcgccatcttggccagac 
tggtcttgaactcctgacctcgtgatccrcccacctcagcctcccaargtgctgggatta 
caggcatgagccaccatgcctggcctcaaatgaggtttaccagactttgaaggagcaggt 
aattccttctaccttgtgaacaagttgttccagaaagatagcagctcaggaggcctctgt 
gaccatggttccagacccagataaggacggcaaagaacagagcatctcagaaacgcaagg 
ctcacagccagggtgcccggcgaccccacgggcactgagaacagctagctctaggagctc 
cactctcctgctgaagaaaccacgggctcagagacggggagctccctcgcccagccacat 
ctgtgaccccacggtaactctgctggttttggtgccttcagtcactcactgcaggtttgt 
tttgttttgttttgttttgagtttttttgttttttgttttttgtttttagagacrrgatg 
ttgctatgttgcctaggctggtttcgaactcctggcctcargggatcctccctccttggc 
cccgcaargtgctgggattrcaggcgtgagccrccgtgcccrgcccgctcactgcagttt 
gaaggcatggctttgggtggcgtggggtgaaagctgcccgaggccccgttcctccccacg 
tggctgcctcctgccagagccagtcaggaaaacagaccccaactagagttgtttcaaatg 
gcagggatttggtaccggtggttggatcatgacaaagctctgagaaggctggaggagcca 
cagagtgccaagtgcccagcaatcattagaggaaggaggctgctgccacctgtgtggcta 
gaggaacagaggggccaatggcattccccaaaccccactctcgcctctgtctggccagag 
cagaatggcttcttccagcttcccaccctggactcccacccaggagcctcctcctggcag 
acccttcctgaccccaccggcccggggggtctacagatccatgtttcaggcgtccgcctg 
gagcggaacaggggagtgcttaggacaagggtggtgccagaggatccactctgcccacat 
ttagttgaccagctgaggcactccacgggaatgaatgactctcgacaggtgccggaggtg 
aggaggggcccggaggcccaggaggggcacagggatggattcgtccgcctgggggctgga 
ggtgtgtttacagagccccaaaataaacaatgcaaccaggtcagaccagcggttctcaca 
gtgtggttcccagaccagcagcatcactgggagcttactggacacgcaaataaatccctg 
tgtgccaccccagctgcatcagatgctggtggggcccagtgatctgtatttaacacaccc 
tccgggggatgccggtgcccactcacgtttgagaacccctgcgatccacgactgccctcc 
cgtgtaaaaggcccacctctgtgggactccaagtcatcagcaccctagggtccttccgtc 
tttttccttcctcctgggacacctgcctctcccatgtcgtattagagaattccttatgct 
cccaagtgggcacggggagaggaaggcactcctctttaaggaccgacccagaggttttgc 
cattgcttcactggccagagcttagtcacgcagcctcacccagaggcaagggaggttgga 
aaatgtagtgtttgtgtgtgtctaacacaaattctattaccatgcagtcaggattctcca 
ctcttgctctttcattagatttgctgggcttcaccctggactttctgatttagtgacaga 
acagagaacccagaggcagacccagatgtgtacaagggcttcatatacaatcaggagatt 
taataatcatgctaggggccgggtgcagtggctcacgcctgtaatcccaagcactttggg 
gagccgaggcaggcrgatcacttgaggtcaggagtttgagaccagcctgggcaacaaagt 
gagaccctgtctctactaaaaataacaaaaattagccgggcgtggtggtgggygcctgta 
atcccagctcctcgggtggctgaggcatgagaatcacttgaacccaggaggcagaggttt 
cagtgagctgagatcacatcactgcactccagcctgggtgacagagtgagattcygtc 

SEQ ID NO: 148. >R09837JT16 # TY Transcript # LN 2644 # Source Gene: R09837 
# Encoded protein: R09837_P6 

actggtgggcggtgaggggggaaagccccggatgttcgttggggattcaacatggcggcg 
ggagtgtccgcggtggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttc 
caggtcgagcagttaggccgcgagcgactgcggcgccgagccgatgagtaacccgaagcc 
cctagaggagtggtcacctgcctgagggcacttctgtcccaccagcatcagaccaggtgt 
gtgcaggccgcaccgagtccccggcaccatgtttgggaagaggaagaagcgggtggagat 
ctccgcgccgtccaacttcgagcaccgcgtgcacacgggcttcgaccagcacgagcagaa 
gttcacggggctgccccgccagtggcagagcctgatcgaggagtcggctcgccggcccaa 
gcccctcgtcgaccccgcctgcatcacctccatccagcccggggcccccaagaccatcgt 
gcggggcagcaaaggtgccaaagatggggccctcacgctgctgctggacgagtttgagaa 
catgtcggtgacacgctccaactccctgcggagagacagcccgccgccgcccgcccgtgc 
ccgccaggaaaatgggatgccagaggagccggccaccacggccagagggggcccagggaa 
ggcaggcagccgaggccggttcgccggtcacagcgaggcgggtggcggcagtggtgacag 
gcgacgggcggggccagagaagaggcccaagtcttccagggagggctcagggggtcccca 
ggagtcctcccgggacaaacgccccctctccgggcctgatgtcggcaccccccagcctgc 
tggtctggccagtggggcgaaactggcagctggccggccctttaacacctacccgagggc 
tgacacggaccacccatcccggggtgcccagggggagcctcatgacgtggcccctaacgg 
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Figure 1 (Cont'd) 



gccatcagcggggggcctggccatcccccagtcctcctcctcctcctcccggcctcccac 
ccgagcccgaggtgcccccagccctggagtgctgggaccccacgcctcagagccccagct 
ggcccctccagcctgcacccccgccgcccctgctgttcctgggccccctggcccccgctc 
accacagcgggagccacagcgagtatcccatgagcagttccgggctgccctgcagctggt 
ggtggacccaggcgacccccgctcctacctggacaacttcatcaagattggcgagggctc 
cacgggcatcgtgtgcatcgccaccgtgcgcagctcgggcaagctggtggccgtcaagaa 
gatggacctgcgcaagcagcagaggcgcgagctgctcttcaacgaggtggtaatcatgag 
ggactaccagcacgagaatgtggtgaagctgtcagactttgggttctgcgcccaggtgag 
caaggaagtgccccgaaggaagtcgctggtcggcacgccctactggatggccccagagct 
catctcccgccttccctacgggccagaggtagacatctggtcgctggggataatggtgat 
tgagatggtggacggagagcccccctacttcaacgagccacccctcaaagccatgaagat 
gattcgggacaacctgccaccccgactgaagaacctgcacaaggtgtcgccatccctgaa 
gggcttcctggaccgcctgctggtgcgagaccctgcccagcgggccacggcagccgagct 
gctgaagcacccattcctggccaaggcagggccgcctgccagcatcgtgcccctcatgcg 
ccagaaccgcaccagatgaggcccagcgcccttcccctcaaccaaagagccccccgggtc 
acccccgccccactgaggccagtagggggccaggcctcccactcctcccagcccgggaga 
tgctccgcgtggcaccaccctccttgctgggggtagatgagaccctactactgaactcca 
gttttgatctcgtgacttttagaaaaacacagggactcgtgggagcaagcgaggctccca 
ggacccccaccctctgggacaggccctcccccatgttcttctgtctccaggaagggcagc 
ggccctcccatcactggaagtctgcagtgggggtcgctgggggtggagagaacactaaga 
ggtgaacatgtatgagtgtgtgcacgcgtgtgagtgtgcatgtgtgtgtgtgtgcaaagg 
tccagccaccccgtcctccagcctgcaaggggtgtctggcgccttgcctgacacccagcc 
ccctctccccctgagccattgtgggggtcgatcatgaatgtccgaagagtggccttttcc 
cgtagccctgcgccccctttctgtggctggatggggagacaggtcagggccccccaccct 
ctccagcccctgcagcaaatgactactgcacctggacagcctcctcttttctagaagtct 
atttatattgtcattttataacactctagcccctgcccttattgggggacagatggtccc 
tgtcctgcggggtggccctggcagaaccactgcctgaagaaccaggttcctgcccggtca 
gcgcagccccagcccgcccacccctgcctcgagttagttttacaattaaaacattgtctt 
gttt 

SEQ ID NO: 149. >R09837_T17 # TY Transcript # LN 914 t Source Gene: R09837 

# Encoded protein: R09837_P7 

actggtgggcggtgaggggggaaagccccggatgttcgttggggattcaacatggcggcg 
ggagtgtccgcggtggtggcggtgcaagagagctgagggaggcgcgagggcgcggagttc 
caggtcgagcagttaggccgcgagcgactgcggcgccgagccgatgagtaacccgaagcc 
cctagaggagtggtcacctgcctgagggcacttctgtcccaccagcatcagaccaggtgt 
gtgcaggccgcaccgagtccccggcaccatgtttgggaagaggaagaagcgggtggagat 
ctccgcgccgtccaacttcgagcaccgcgtgcacacgggcttcgaccagcacgagcagaa 
gttcacggggctgccccgccagtggcagagcctgatcgaggagtcggctcgccggcccaa 
gcccctcgtcgaccccgcctgcatcacctccatccagcccggggcccccaagaccatcgt 
gcggggcagcaaaggtgccaaagatggggccctcacgctgctgctggacgagtttgagaa 
catgtcggtgacacgctccaactccctgcggagagacagcccgccgccgcccgcccgtgc 
ccgccaggaaaatgggatgccagaggagccggccaccacggccagagggggcccagggaa 
ggcaggcagccgaggccggttcgccggtcacagcgaggcgggtggcggcagtggtgacag 
gcgacgggcggggccagagaagaggcccaagtcttccagggagggctcagggagtctccg 
gacaagccctctcgggcctgttcgggccctgctgtggtctgcgtggggaactggtggcgg 
cttaactaccgggtgcgccctccggtcccccctggctgggcgctccctcccggagcggcc 
ccttggtgtccttg 

SEQ ID NO: 150. >R09837_T18 # TY Transcript # LN 510 # Source Gene: R09837 

# Encoded protein: R09837_P8 

cacaccagctcggggttgcacaccagctcggggttgcacaccagctcggggttgcacacc 
agctcggggttgcacaccagctcggggttgcacaccaactcggggttgcacaccagctcg 
gggttgcacaccagctcggggttgcacaccagctcggggttccattgtgggggtcgatca 
tgaatgtccgaagagtggccttttcccgtagccctgcgccccctttctgtggctggatgg 
ggagacaggtcagggccccccaccctctccagcccctgcagcaaatgactactgcacctg 
gacagcctcctcttttctagaagtctatttatattgtcattttataacactctagcccct 
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gcccttattgggggacagatggtccctgtcctgcggggtggccctggcagaaccactgcc 
tgaagaaccaggttcctgcccggtcagcgcagccccagcccgcccacccctgcctcgagt 
tagttttacaattaaaacattgtcttgttt 

SEQ ID NO: 151. >R09837_P1 # TY Protein # CC #LN 591 # Source Gene: R09837 

# Encoding Transcript: 1 " 

MFGKRKKRVEISAPSNFEHRVHTGFDQHEQKFTGLPRQWQSLIEESARRPKPLVDPACIT 
SIQPGAPKTIVRGSKGAKDGALTLLLDEFENMSVTRSNSLRRDSPPPPARARQENGMPEE 
PATTARGGPGKAGSRGRFAGHSEAGGGSGDRRRAGPEKRPKSSREGSGGPQESSRDKRPL 
SGPDVGTPQPAGLASGAKLAAGRPFNTYPRADTDHPSRGAQGEPHDVAPNGPSAGGLAIP 
QSSSSSSRPPTRARGAPSPGVLGPHASEPQLAPPACTPAAPAVPGPPGPRSPQREPQRVS 
HEQFRAALQLWDPGDPRSYLDNFIKIGEGSTGIVCIATVRSSGKLVAVKKMDLRKQQRR 
ELLFNEVVIMRDYQHENVVEMYNSYLVGDELWVVMEFLEGGALTDIVTHTRMNEEQIAAV 
CLAVLQALSVLHAQGV I HRDI KSDS ILLTH DGRVKLS DFGFCAQVS KEVPRRKSLVGT PY 
WMAPELISRLPYGPEVDIWSLGIMVIEMVDGEPPYFNEPPLKAMKMIRDNLPPRLKNLHK 
VSPSLKGFLDRLLVRDPAQRATAAELLKHPFLAKAGPPASIVPLMRQNRTR 

SEQ ID NO: 152. >R09837_P2 # TY Protein # CC #LN 501 # Source Gene: R09837 

# Encoding Transcript: 5 

MFGKRKKRVEISAPSNFEHRVHTGFDQHEQKFTGLPRQWQSLIEESARRPKPLVDPACIT 
SIQPGAPKTIVRGSKGAKDGALTLLLDEFENMSVTRSNSLRRDSPPPPARARQENGMPEE 
PATTARGGPGKGEPHDVAPNGPSAGGLAIPQSSSSSSRPPTRARGAPSPGVLGPHASEPQ 
LAPPACTPAAPAVPGPPGPRSPQREPQRVSHEQFRAALQLWDPGDPRSYLDNFIKIGEG 
STGIVCIATVRSSGKLVAVKKMDLRKQQRRELLFNEWIMRDYQHENWEMYNSYLVGDE 
LWVVMEFLEGGALTDIVTHTRMNEEQIAAVCLAVLQALSVLHAQGVIHRDIKSDSILLTH 
DGRVKLSDFGFCAQVSKEVPRRKSLVGTPYWMAPELISRLPYGPEVDIWSLGIMVIEMVD 
GEPPYFNEPPLKAMKMIRDNLPPRLKNLHKVSPSLKGFLDRLLVRDPAQRATAAELLKHP 
FLAKAGPPASIVPLMRQNRTR 

SEQ ID NO: 153. >R09837_P3 # TY Protein # CC #LN 438 # Source Gene: R09837 

# Encoding Transcript: 6 

MFGKRKKRVEISAPSNFEHRVHTGFDQHEQKFTGLPRQWQSLIEESARRPKPLVDPACIT 
SIQPGAPKGEPHDVAPNGPSAGGLAIPQSSSSSSRPPTRARGAPSPGVLGPHASEPQLAP 
PACTPAAPAVPGPPGPRSPQREPQRVSHEQFRAALQLWDPGDPRSYLDNFIKIGEGSTG 
IVCIATVRSSGKLVAVKKMDLRKQQRRELLFNEVVIMRDYQHENWEMYNSYLVGDELWV 
VMEFLEGGALTDIVTHTRMNEEQIAAVCLAVLQALSVLHAQGVIHRDIKSDSILLTHDGR 
VKLSDFGFCAQVSKEVPRRKSLVGTPYWMAPELISRLPYGPEVDIWSLGIMVIEMVDGEP 
PYFNEPPLKAMKMIRDNLPPRLKNLHKVSPSLKGFLDRLLVRDPAQRATAAELLKHPFLA 
KAG P PAS I V P LMRQNRT R 

SEQ ID NO: 154. >R09837_P4 # TY Protein # CC #LN 405 # Source Gene: R09837 

# Encoding Transcript: 9 

MGCQRSRPSGAKLAAGRPFNTYPRADTDHPSRGAQGEPHDVAPNGPSAGGLAIPQSSSSS 
SRPPTRARGAPSPGVLGPHASEPQLAPPACTPAAPAVPGPPGPRSPQREPQRVSHEQFRA 
ALQLWDPGDPRSYLDNFIKIGEGSTGIVCIATVRSSGKLVAVKKMDLRKQQRRELLFNE 
WIMRDYQHENWEMYNSYLVGDELWWMEFLEGGALTDIVTHTRMNEEQIAAVCLAVLQ 
ALSVLHAQGVI HRDI KSDS I LLTH DGRVKLS DFG FCAQVS KEVPRRKSLVGT PYWMAPEL 
ISRLPYGPEVDIWSLGIMVIEMVDGEPPYFNEPPLKAMKMIRDNLPPRLKNLHKVSPSLK 
GFLDRLLVRDPAQRATAAELLKHP FLAKAGPPASIVPLMRQNRTR 

SEQ ID NO: 155. >R09837_P9 # TY Protein # CC #LN 426 # Source Gene: R09837 

# Encoding Transcript: 10 

MFGKRKKRVEISAPSNFEHRVHTGFDQHEQKFTGLPRQWQSLIEESARRPKPLVDPACIT 
SIQPGAPKTIVRGSKGAKDGALTLLLDEFENMSVTRSNSLRRDSPPPPARARQENGMPEK 
PPGPRSPQREPQRVSHEQFRAALQLWDPGDPRSYLDNFIKIGEGSTGIVCIATVRSSGK 
LVAVKKMDLRKQQRRELLFNEWIMRDYQHENWEMYNSYLVGDELWWMEFLEGGALTD 
IVTHTRMNEEQIAAVCLAVLQALSVLHAQGVIHRDIKSDSILLTHDGRVKLSDFGFCAQV 
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SKEVPRRKSLVGTPYWMAPELISRLPYGPBVDIWSLGIMVIEMVDGEPPYFNEPPLKAMK 
MIRDNLPPRLKNLHKVSPSLKGFLDRLLVRDPAQRATAAELLKHPFLAKAGPPASIVPLM 
RQNRTR 

SEQ ID NO: 156. >R09837_P6 # TY Protein # CC #LN 516 # Source Gene: R09837 

# Encoding Transcript: 16 

MFGKRKKRVEISAPSNFEHRVHTGFDQHEQKFTGLPRQWQSLIEESARRPKPLVDPACIT 
SIQPGAPKTIVRGSKGAKDGALTLLLDEFENMSVTRSNSLRRDSPPPPARARQENGMPEE 
PATTARGGPGKAGSRGRFAGHSEAGGGSGDRRRAGPEKRPKSSREGSGGPQESSRDKRPL 
SGPDVGTPQPAGLASGAKLAAGRPFNTYPRADTDHPSRGAQGEPHDVAPNGPSAGGLAIP 
QSSSSSSRPPTRARGAPSPGVLGPHASEPQLAPPACTPAAPAVPGPPGPRSPQREPQRVS 
HEQFRAALQLWDPGDPRSYLDNFIKIGEGSTGIVCIATVRSSGKLVAVKKMDLRKQQRR 
ELLFNEVVIMRDYQHENWKLSDFGFCAQVSKEVPRRKSLVGTPYWMAPELISRLPYGPE 
VDIWSLGIMVIEMVDGEPPYFNEPPLKAMKMIRD^3LPPRLKNLHKVSPSLKGFLDRLLVR 
DPAQRATAAELLKHPFLAKAGPPASIVPLMRQNRTR 

SEQ ID NO: 157 . >R09837_P7 # TY Protein 8 CC #LN 215 # Source Gene: R09837 

# Encoding Transcript: 17 

MFGKRKKRVEI SAPSNFEHRVHTGFDQHEQKFTGLPRQWQSLI EES ARRPKPLVDPACIT 
SIQPGAPKTIVRGSKGAKDGALTLLLDEFENMSVTRSNSLRRDSPPPPARARQENGMPEE 
PATTARGGPGKAGSRGRFAGHSEAGGGSGDRRRAGPEKRPKSSREGSGVSGKPSRACSGP 
AWCVGNWWRLNYRVLPSGPPWLGAPSRSGPLVSL 

SEQ ID NO: 158 . >R09837_P8 # TY Protein # CC #LN 60 # Source Gene: R09837 

# Encoding Transcript: 18 

HTSSGLHTSSGLHTSSGLHTSSGLHTSSGLHTNSGLHTSSGLHTSSGLHTSSGFHCGGRS 



SEQ ID NO: 159. >R14324 # TY Consensus # Length 5786 # Number of exons 21 

ccaccgagaagccgaggaggcaaggctcgcgagagttcagggaggccgccctgagattcc 

ggcgaggccgcgggtcccacctcccgggggcggggcgagggcggagcggggagaagggag 

ctgacgggcgcccggccggctgcggtccgtgcggaggctgagccggccgcgggcgcgacc 

ggaggcagtttccgttactatggcaatgacggcagggactacaacaacctttcctatgag 

caaccatacccgggaaagagtgactgtagccaagctcacattggagaatttttatagcaa 

cctaattttacagcatgaagagagagaaaccaggtatagtaagggctaatcaaaactttt 

tttagttaaataagctgttgtgctgtgagtggaatagaaacttgtttaccatgaatgata 

tgaatattatggatccctgcatgctgtttttgagtttcctgtactgacagtgcacataga 

catgtgttttattttcacagagtagtttcctgaggtgccccaagtcttgtgcatctatat 

acccgtatgacatgccaattaacgttttaaacaaaccttttccaaacatgaaattatgat 

atgtctttacattttcttttaaagggatgcctggcttggtgtggtgtggctcatgcctat 

aatcccagcactttgggaagccgaggtgggcagatcacctgaggtcaggagttcaagacc 

agcctggccaacatggtgaaaccccatctctactgaaaatacaaaaaattagccgggtgt 

ggtggcgggtgcctgtaatcccagctactcaggaggctgaggcaggagaattgcttgaag 

gcagcagtgagccaaggctgcagtgagccaagatcacaccactgcactccagcctgggag 

gagaaaactggaaatcattcaacaagcagcactgaagtgcctaggggcagaagaaattag 

aagtggccatggaagaagaaggattagcagatgaagaggtaatgtaattacctataatta 

ctgtacaaaagcacccccagtgacttt teat teat ttgacatcagaaaaagttacgtcga 

tcacaacacgctcgcaaagaaacagagttcttacggctcaaaaggaccagacttggcttg 

gatgactttgagtctctgaaagttataggaagaggagcttttggagaggtgcggttggtc 

cagaagaaagatacaggccatatctatgcaatgaagatattgagaaagtctgatatgctt 

gaaaaagagcaggtggcccatatccgagcagaaagagatattttggtagaagcagatggt 

gcctgggtggtgaagatgttttacagttttcaggataagaggaatctttatctaatcatg 

gaatttctccctggaggtgacatgatgacattgctaatgaagaaagacaccttgacagaa 
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gaggaaacacagttctacatttcagagactgttctggcaatagatgcgatccaccagttg 
ggtttcatccatcgggatattaagccagacaaccttttattggatgccaagggtcatgta 
aaattatctgattttggtttatgtacgggattaaagaaagctcacaggactgaattttat 
agaaatctcacacacaacccaccaagtgacttctcatttcagaacatgaactcaaagagg 
aaagcagaaacttggaagaagaacaggagacaactggcdtattccacagttgggacacca 
gattacattgctccagaagtattcatgcagactggttacaacaaattgtgtgactggtgg 
tctttgggagtgattatgtatgaaatgctaataggatatccacctttctgctctgaaaca 
cctcaagaaacatacagaaaagtgatgaactggaaagaaactctggtatttcctccagag 
gtacctatatctgagaaagccaaggacttaattctcagattttgtattgattctgaaaac 
agaattggaaatagtggagtagaagaaataaaaggtcatcccttttttgaaggtgtcgac 
tgggagcacataagggaaaggccagcagcaatccctatagaaatcaaaagcattgatgat 
acttcaaattttgatgacttccctgaatctgatattttacaaccagtgccaaataccaca 
gaaccggactacaaatccaaagactgggtttttctcaattatacctataaaaggtttgaa 
gggttgactcaacgtggctctatccccacctacatgaaagctgggaagttatgaatgaag 
ataacattcacccataaccaagagaactcaggtagctgcatcaccaggcttgcttggcgt 
agataacaatacactgaaatactcctgaagatggtggtgcttattgactacaagaggaaa 
ttctacaggattaggatttctaagactactataggaattggttggcagtgccagctggct 
cttttttttaatattttattatttttgttaactttattatatgaaggtactggaataaaa 
ggaacagacatccctttctaactgcactgcctacatgcgtattaaggtccattctgcctg 
tgtgtgctgtggctttgaactgtaacacctctaatcaattcaggagaaacacatatcatt 
taaagcaacataggctaacctgtaggtaacactgcagtattgatgttttactgcaaatct 
tatgggtctagataatcagtaaaagccatcttccatagttggtgttagaacattgcccta 
ttggtttggacatctgtagaatatatatgaagacaatttctgtaatggttttaagagatt 
taaaaagaaattcactggttctttacaaaatagaatttatcatcaagttattacacaaac 
ttcacagtaaggagtgacaagtttataataaggaagacaaagtttaacaccttcactcaa 
gcactccactaatatatttacgttgcattcagaaatactgatgaccttcatatacgtagt 
ctgtatactcatagggagatgtactgtattatataacatgtaaagttgattttcttgtga 
caagagaacttctttttttaacaagaggacatggcattattttaatttgattatggtgag 
ttgaatttaagacatgaccatgaaggctgcttgtagaattagtgtatttttattaaacta 
tttttttaaatgtcaaacttctatcatgtaaatggacttatagagaacaaaaagctattt 
actttggttttctagaaagttgttacatatcatggctggttaacttttatttcttttgat 
gaaaatttttcctttgatagtacttgtattattgtgccattattttcttatgctccaaat 
gtaccaaagatcttgaacagagtggatgttcacaactgagtagaattttcctttcctgtg 
ggcatgctgtgttcagacctgacagatctttgatagaggtcagcttattaaagggcaata 
ttgttcttgtttagctacatcactgtggtgaatatagatggaattaaggaagtaaatgca 
ggccagggggttgtgatgagaggataggggagataatatcagcatcaaattctttgggta 
tctctctaagaattaaataatcttttctagcttaatattttaattctaattcaaacaact 
ctgaggttttggtttcattagtaatagttgaggaataatatactagcaaagaatggccta 
atgtttgtcataactgttaatggatgaaattttttaaagatacaaccatgataaccatta 
taaatgatctatgatcaaaatctaaagtgatgaattatttgtaggaatgtcttcctaatg 
gggaagaattgcataggagcattatgcaaatctacacaagcttttataaatgttgctgct 
gggtagctccacagtgtttcataaggccatcctgtttcccccaactcccccatttttggt 
ttgtttctttttaaatatttgttgagtacttacgtgtttatctaacagttcacttccatt 
tttctagtctggattttttgagtatttaggaaagagagctattaaaaactctggggattt 
ctcaatgtgactaactctaatttttctaattataactgcctttaattaacataatattaa 
cttttgctgaggtttatgagattttctcaccccacatcgctcccctttttttaaaaagga 
ctgttttgctagtgtgataatgaataggtaagatatgagataattgcaacattgtctagt 
tctagtatggtaactattcttgaaatggtattgaaaaataccgttaattcaaattgacag 
agattgataaaaagaaactgatttacctaagtttactttttaattgcataatagagcatt 
ttttgttttgagttccctcattcttattaccagaaagagcttgcaaatagttttactttc 
ttggcactggaagggtagttctggaaagctactttgttgagagtctcattcttccctgga 
gttaatagagtgattcacaatctttggggttttctcctcatcaaaagcatttcttaagtg 
cctatctaaaagcaattaaagactgtgtctgccctttagaagctaagaatttgattcatg 
atgcaaattaactagataatttgcaaagtacccttgagattgaattttctctattatata 
tttcccatatttcaggtgaataatttaatttaaatgacaaaaccctatctagtcaactgg 
gcataatgacattttctttaaattagactctattttgaattaaaagagttttattataaa 
ccgtgtgtttttggtttttctaagtatatagaaagcttgtataattcagatttatcaatt 
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tcctgatttaatgtagactttgacttttttattaaaaacctttgtattaaagcaagttat 
gttatttttcttttatgcatttattactaacatagctttaaatctttaaatgtattgaag 
cattgtgctgtctgaaaataaggaattgcttataaaccagccacttctgaatacaatatg 
tagctgatttaataagctagttagtgaatggaaaataagtgtggagtattaaaaatgttc 
tttggttggtaaggcctaagatagggtttcatttatttctatactttttctgttttttaa 
acacctgcatatttttatgtaaatctctaaatttaaaatattttaagtacatttattttt 
ggtgttttattgtataaaaccttagacaatcaatcagtcagtctttactgacaggagcag 
cagctatctgtcttttgctgatctacaaataaatgaattgagaatttagtccatagaggt 
ccctggctaccaaacacattctcctttgaattgttaaaattcagaacattcaaaataact 
gttttgctacaacccatgattattttcctgttgtgtttatttaaatttactttctcttta 
gaagtgcacttatttctgaaaaatcttaatgaaacaaacgcttagaacaaatataaatat 
gagacacttgggactactagagatattttagatttttatgaaaaaaatgtgaggggatat 
tgctgctttaaaaaggaataaagtaataaaaatatatctcagctatttttttaaagcaat 
ataattcagcaattgtctagaaaagtaatcatgaggctactgagtttggtgttcagttac 
tgagtttcaaaaatgttttggtggcatgaggacaaaatttcattgaaggtaagataagaa 
taaaaactatgtttacaaaaaaactg 

SEQ ID NO: 160. >R14324JT1 # TY Transcript 8 LN 5106 « Source Gene: R14324 
# Encoded protein: R14324_P1 

ccaccgagaagccgaggaggcaaggctcgcgagagttcagggaggccgccctgagattcc 
ggcgaggccgcgggtcccacctcccgggggcggggcgagggcggagcggggagaagggag 
ctgacgggcgcccggccggctgcggtccgtgcggaggctgagccggccgcgggcgcgacc 
ggaggcagtttccgttactatggcaatgacggcagggactacaacaacctttcctatgag 
caaccatacccgggaaagagtgactgtagccaagctcacattggagaatttttatagcaa 
cctaattttacagcatgaagagagagaaaccaggcagaagaaattagaagtggccatgga 
agaagaaggattagcagatgaagagaaaaagttacgtcgatcacaacacgctcgcaaaga 
aacagagttcttacggctcaaaaggaccagacttggcttggatgactttgagtctctgaa 
agttataggaagaggagcttttggagaggtgcggttggtccagaagaaagatacaggcca 
tatctatgcaatgaagatattgagaaagtctgatatgcttgaaaaagagcaggtggccca 
tatccgagcagaaagagatattttggtagaagcagatggtgcctgggtggtgaagatgtt 
ttacagttttcaggataagaggaatctttatctaatcatggaatttctccctggaggtga 
catgatgacattgctaatgaagaaagacaccttgacagaagaggaaacacagttctacat 
ttcagagactgttctggcaatagatgcgatccaccagttgggtttcatccatcgggatat 
taagccagacaaccttttattggatgccaagggtcatgtaaaattatctgattttggttt 
atgtacgggattaaagaaagctcacaggactgaattttatagaaatctcacacacaaccc 
accaagtgacttctcatttcagaacatgaactcaaagaggaaagcagaaacttggaagaa 
gaacaggagacaactggcatattccacagttgggacaccagattacattgctccagaagt 
attcatgcagactggttacaacaaattgtgtgactggtggtctttgggagtgattatgta 
tgaaatgctaataggatatccacctttctgctctgaaacacctcaagaaacatacagaaa 
agtgatgaactggaaagaaactctggtatttcctccagaggtacctatatctgagaaagc 
caaggacttaattctcagattttgtattgattctgaaaacagaattggaaatagtggagt 
agaagaaataaaaggtcatcccttttttgaaggtgtcgactgggagcacataagggaaag 
gccagcagcaatccctatagaaatcaaaagcattgatgatacttcaaattttgatgactt 
ccctgaatctgatattttacaaccagtgccaaataccacagaaccggactacaaatccaa 
agactgggtttttctcaattatacctataaaaggtttgaagggttgactcaacgtggctc 
tatccccacctacatgaaagctgggaagttatgaatgaagataacattcacccataacca 
agagaactcaggtagctgcatcaccaggcttgcttggcgtagataacaatacactgaaat 
actcctgaagatggtggtgcttattgactacaagaggaaattctacaggattaggatttc 
taagactactataggaattggttggcagtgccagctggctcttttttttaatattttatt 
atttttgttaactttattatatgaaggtactggaataaaaggaacagacatccctttcta 
actgcactgcctacatgcgtattaaggtccattctgcctgtgtgtgctgtggctttgaac 
tgtaacacctctaatcaattcaggagaaacacatatcatttaaagcaacataggctaacc 
tgtaggtaacactgcagtattgatgttttactgcaaatcttatgggtctagataatcagt 
aaaagccatcttccatagttggtgttagaacattgccctattggtttggacatctgtaga 
atatatatgaagacaatttctgtaatggttttaagagatttaaaaagaaattcactggtt 
ctttacaaaatagaatttatcatcaagttattacacaaacttcacagtaaggagtgacaa 
gtttataataaggaagacaaagtttaacaccttcactcaagcactccactaatatattta 
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cgttgcattcagaaatactgatgaccttcatatacgtagtctgtatactcatagggagat 
gtactgtattatataacatgtaaagttgattttcttgtgacaagagaacttcttttttta 
acaagaggacatggcattattttaatttgattatggtgagttgaatttaagacatgacca 
tgaaggctgcttgtagaattagtgtatttttattaaactatttttttaaatgtcaaactt 
ctatcatgtaaatggacttatagagaacaaaaagctatttactttggttttctagaaagt 
tgttacatatcatggctggttaacttttatttcttttgatgaaaatttttcctttgatag 
tacttgtattattgtgccattattttcttatgctccaaatgtaccaaagatcttgaacag 
agtggatgttcacaactgagtagaattttcctttcctgtgggcatgctgtgttcagacct 
gacagatctttgatagaggtcagcttattaaagggcaatattgttcttgtttagctacat 
cactgtggtgaatatagatggaattaaggaagtaaatgcaggccagggggttgtgatgag 
aggataggggagataatatcagcatcaaattctttgggtatctctctaagaattaaataa 
tcttttctagcttaatattttaattctaattcaaacaactctgaggttttggtttcatta 
gtaatagttgaggaataatatactagcaaagaatggcctaatgtttgtcataactgttaa 
tggatgaaattttttaaagatacaaccatgataaccattataaatgatctatgatcaaaa 
tctaaagtgatgaattatttgtaggaatgtcttcctaatggggaagaattgcataggagc 
attatgcaaatctacacaagcttttataaatgttgctgctgggtagctccacagtgtttc 
ataaggccatcctgtttcccccaactcccccatttttggtttgtttctttttaaatattt 
gttgagtacttacgtgtttatctaacagttcacttccatttttctagtctggattttttg 
agtatttaggaaagagagctattaaaaactctggggatttctcaatgtgactaactctaa 
tttttctaattataactgcctttaattaacataatattaacttttgctgaggtttatgag 
attttctcaccccacatcgctcccctttttttaaaaaggactgttttgctagtgtgataa 
tgaataggtaagatatgagataattgcaacattgtctagttctagtatggtaactattct 
tgaaatggtattgaaaaataccgttaattcaaattgacagagattgataaaaagaaactg 
atttacctaagtttactttttaattgcataatagagcattttttgttttgagttccctca 
ttcttattaccagaaagagcttgcaaatagttttactttcttggcactggaagggtagtt 
ctggaaagctactttgttgagagtctcattcttccctggagttaatagagtgattcacaa 
tctttggggttttctcctcatcaaaagcatttcttaagtgcctatctaaaagcaattaaa 
gactgtgtctgccctttagaagctaagaatttgattcatgatgcaaattaactagataat 
ttgcaaagtacccttgagattgaattttctctattatatatttcccatatttcaggtgaa 
taatttaatttaaatgacaaaaccctatctagtcaactgggcataatgacattttcttta 
aattagactctattttgaattaaaagagttttattataaaccgtgtgtttttggtttttc 
taagtatatagaaagcttgtataattcagatttatcaatttcctgatttaatgtagactt 
tgacttttttattaaaaacctttgtattaaagcaagttatgttatttttcttttatgcat 
ttattactaacatagctttaaatctttaaatgtattgaagcattgtgctgtctgaaaata 
aggaattgcttataaaccagccacttctgaatacaatatgtagctgatttaataagctag 
ttagtgaatggaaaataagtgtggagtattaaaaatgttctttggttggtaaggcctaag 
atagggtttcatttatttctatactttttctgttttttaaacacctgcatatttttatgt 
aaatctctaaatttaaaatattttaagtacatttatttttggtgttttattgtataaaac 
cttagacaatcaatcagtcagtctttactgacaggagcagcagctatctgtcttttgctg 
atctacaaataaatgaattgagaatttagtccatagaggtccctggctaccaaacacatt 
ctcctttgaattgttaaaattcagaacattcaaaataactgttttgctacaacccatgat 
tattttcctgttgtgtttatttaaatttactttctctttagaagtgcacttatttctgaa 
aaatcttaatgaaacaaacgcttagaacaaatataaatatgagacacttgggactactag 
agatattttagatttttatgaaaaaaatgtgaggggatattgctgctttaaaaaggaata 
aagtaataaaaatatatctcagctatttttttaaagcaatataattcagcaattgtctag 
aaaagtaatcatgaggctactgagtttggtgttcagttactgagtttcaaaaatgttttg 
gtggcatgaggacaaaatttcattgaaggtaagataagaataaaaactatgtttacaaaa 
aaactg 

SEQ ID N0:161. >R14324_T2 # TY Transcript # LN 4234 # Source Gene: R14324 
# Encoded protein: R14324_P1 

ccaccgagaagccgaggaggcaaggctcgcgagagttcagggaggccgccctgagattcc 
ggcgaggccgcgggtcccacctcccgggggcggggcgagggcggagcggggagaagggag 
ctgacgggcgcccggccggctgcggtccgtgcggaggctgagccggccgcgggcgcgacc 
ggaggcagtttccgttactatggcaatgacggcagggactacaacaacctttcctatgag 
caaccatacccgggaaagagtgactgtagccaagctcacattggagaatttttatagcaa 
cctaattttacagcatgaagagagagaaaccaggcagaagaaattagaagtggccatgga 
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agaagaaggattagcagatgaagagaaaaagttacgtcgatcacaacacgctcgcaaaga 
aacagagttcttacggctcaaaaggaccagacttggcttggatgactttgagtctctgaa 
agttataggaagaggagcttttggagaggtgcggttggtccagaagaaagatacaggcca 
tatctatgcaatgaagatattgagaaagtctgatatgcttgaaaaagagcaggtggccca 
tatccgagcagaaagagatattttggtagaagcagatggtgcctgggtggtgaagatgtt 
ttacagttttcaggataagaggaatctttatctaatcatggaatttctccctggaggtga 
catgatgacattgctaatgaagaaagacaccttgacagaagaggaaacacagttctacat 
ttcagagactgttctggcaatagatgcgatccaccagttgggtttcatccatcgggatat 
taagccagacaaccttttattggatgccaagggtcatgtaaaattatctgattttggttt 
atgtacgggattaaagaaagctcacaggactgaattttatagaaatctcacacacaaccc 
accaagtgacttctcatttcagaacatgaactcaaagaggaaagcagaaacttggaagaa 
gaacaggagacaactggcatattccacagttgggacaccagattacattgctccagaagt 
attcatgcagactggttacaacaaattgtgtgactggtggtctttgggagtgattatgta 
tgaaatgctaataggatatccacctttctgctctgaaacacctcaagaaacatacagaaa 
agtgatgaactggaaagaaactctggtatttcctccagaggtacctatatctgagaaagc 
caaggacttaattctcagattttgtattgattctgaaaacagaattggaaatagtggagt 
agaagaaataaaaggtcatcccttttttgaaggtgtcgactgggagcacataagggaaag 
gccagcagcaatccctatagaaatcaaaagcattgatgatacttcaaattttgatgactt 
ccctgaatctgatattttacaaccagtgccaaataccacagaaccggactacaaatccaa 
agactgggtttttctcaattatacctataaaaggtttgaagggttgactcaacgtggctc 
tatccccacctacatgaaagctgggaagttatgaatgaagataacattcacccataacca 
agagaactcaggtagctgcatcaccaggcttgcttggcgtagataacaatacactgaaat 
actcctgaagatggtggtgcttattgactacaagaggaaattctacaggattaggatttc 
taagactactataggaattggttggcagtgccagctggctcttttttttaatattttatt 
atttttgttaactttattatatgaaggtactggaataaaaggaacagacatccctttcta 
actgcactgcctacatgcgtattaaggtccattctgcctgtgtgtgctgtggctttgaac 
tgtaacacctctaatcaattcaggagaaacacatatcatttaaagcaacataggctaacc 
tgtaggtaacactgcagtattgatgttttactgcaaatcttatgggtctagataatcagt 
aaaagccatcttccatagttggtgttagaacattgccctattggtttggacatctgtaga 
atatatatgaagacaatttctgtaatggttttaagagatttaaaaagaaattcactggtt 
ctttacaaaatagaatttatcatcaagttattacacaaacttcacagtaaggagtgacaa 
gtttataataaggaagacaaagtttaacaccttcactcaagcactccactaatatattta 
cgttgcattcagaaatactgatgaccttcatatacgtagtctgtatactcatagggagat 
gtactgtattatataacatgtaaagttgattttcttgtgacaagagaacttcttttttta 
acaagaggacatggcattattttaatttgattatggtgagttgaatttaagacatgacca 
tgaaggctgcttgtagaattagtgtatttttattaaactatttttttaaatgtcaaactt 
ctatcatgtaaatggacttatagagaacaaaaagctatttactttggttttctagaaagt 
tgttacatatcatggctggttaacttttatttcttttgatgaaaatttttcctttgatag 
tacttgtattattgtgccattattttcttatgctccaaatgtaccaaagatcttgaacag 
agtggatgttcacaactgagtagaattttcctttcctgtgggcatgctgtgttcagacct 
gacagatctttgatagaggtcagcttattaaagggcaatattgttcttgtttagctacat 
cactgtggtgaatatagatggaattaaggaagtaaatgcaggccagggggttgtgatgag 
aggataggggagataatatcagcatcaaattctttgggtatctctctaagaattaaataa 
tcttttctagcttaatattttaattctaattcaaacaactctgaggttttggtttcatta 
gtaatagttgaggaataatatactagcaaagaatggcctaatgtttgtcataactgttaa 
tggatgaaattttttaaagatacaaccatgataaccattataaatgatctatgatcaaaa 
tctaaagtgatgaattatttgtaggaatgtcttcctaatggggaagaattgcataggagc 
attatgcaaatctacacaagcttttataaatgttgctgctgggtagctccacagtgtttc 
ataaggccatcctgtttcccccaactcccccatttttggtttgtttctttttaaatattt 
gttgagtacttacgtgtttatctaacagttcacttccatttttctagtctggattttttg 
agtatttaggaaagagagctattaaaaactctggggatttctcaatgtgactaactctaa 
tttttctaattataactgcctttaattaacataatattaacttttgctgaggtttatgag 
attttctcaccccacatcgctcccctttttttaaaaaggactgttttgctagtgtgataa 
tgaataggtaagatatgagataattgcaacattgtctagttctagtatggtaactattct 
tgaaatggtattgaaaaataccgttaattcaaattgacagagattgataaaaagaaactg 
atttacctaagtttactttttaattgcataatagagcattttttgttttgagttccctca 
ttcttattaccagaaagagcttgcaaatagttttactttcttggcactggaagggtagtt 
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ctggaaagctactttgttgagagtctcattcttccctggagttaatagagtgattcacaa 
tctttggggttttctcctcatcaaaagcatttcttaagtgcctatctaaaagcaattaaa 
gactgtgtctgccctttagaagctaagaatttgattcatgatgcaaattaactagataat 
ttgcaaagtacccttgagattgaattttctctattatatatttcccatatttcaggtgaa 
taatttaatttaaatgacaaaaccctatctagtcaactgggcataatgacattttcttta 
aattagactctattttgaattaaaagagttttattataaaccgtgtgtttttggtttttc 
taagtatatagaaagcttgtataattcagatttatcaatttcctgatttaatgtagactt 
tgacttttttattaaaaacctttgtattaaagca 

SEQ ID NO: 162. >R14324_T3 # TY Transcript # LN 4301 # Source Gene: R14324 
# Encoded protein: R14324_P2 

ccaccgagaagccgaggaggcaaggctcgcgagagttcagggaggccgccctgagattcc 
ggcgaggccgcgggtcccacctcccgggggcggggcgagggcggagcggggagaagggag 
ctgacgggcgcccggccggctgcggtccgtgcggaggctgagccggccgcgggcgcgacc 
ggaggcagtttccgttactatggcaatgacggcagggactacaacaacctttcctatgag 
caaccatacccgggaaagagtgactgtagccaagctcacattggagaatttttatagcaa 
cctaattttacagcatgaagagagagaaaccaggcagaagaaattagaagtggccatgga 
agaagaaggattagcagatgaagaggtaatgtaattacctataattactgtacaaaagca 
cccccagtgacttttcattcatttgacatcagaaaaagttacgtcgatcacaacacgctc 
gcaaagaaacagagttcttacggctcaaaaggaccagacttggcttggatgactttgagt 
ctctgaaagttataggaagaggagcttttggagaggtgcggttggtccagaagaaagata 
caggccatatctatgcaatgaagatattgagaaagtctgatatgcttgaaaaagagcagg 
tggcccatatccgagcagaaagagatattttggtagaagcagatggtgcctgggtggtga 
agatgttttacagttttcaggataagaggaatctttatctaatcatggaatttctccctg 
gaggtgacatgatgacattgctaatgaagaaagacaccttgacagaagaggaaaca cagt 
tctacatttcagagactgttctggcaatagatgcgatccaccagttgggtttcatccatc 
gggatattaagccagacaaccttttattggatgccaagggtcatgtaaaattatctgatt 
ttggtttatgtacgggattaaagaaagctcacaggactgaattttatagaaatctcacac 
acaacccaccaagtgacttctcatttcagaacatgaactcaaagaggaaagcagaaactt 
ggaagaagaacaggagacaactggcatattccacagttgggacaccagattacattgctc 
cagaagtattcatgcagactggttacaacaaattgtgtgactggtggtctttgggagtga 
ttatgtatgaaatgctaataggatatccacctttctgctctgaaacacctcaagaaacat 
acagaaaagtgatgaactggaaagaaactctggtatttcctccagaggtacctatatctg 
agaaagccaaggacttaattctcagattttgtattgattctgaaaacagaattggaaata 
gtggagtagaagaaataaaaggtcatcccttttttgaaggtgtcgactgggagcacataa 
ggga'aaggccagcagcaatccctatagaaatcaaaagcattgatgatacttcaaattttg 
atgacttccctgaatctgatattttacaaccagtgccaaataccacagaaccggactaca 
aatccaaagactgggtttttctcaattatacctataaaaggtttgaagggttgactcaac 
gtggctctatccccacctacatgaaagctgggaagttatgaatgaagataacattcaccc 
ataaccaagagaactcaggtagctgcatcaccaggcttgcttggcgtagataacaataca 
ctgaaatactcctgaagatggtggtgcttattgactacaagaggaaattctacaggatta 
ggatttctaagactactataggaattggttggcagtgccagctggctcttttttttaata 
ttttattatttttgttaactttattatatgaaggtactggaataaaaggaacagacatcc 
ctttctaactgcactgcctacatgcgtattaaggtccattctgcctgtgtgtgctgtggc 
tttgaactgtaacacctctaatcaattcaggagaaacacatatcatttaaagcaacatag 
gctaacctgtaggtaacactgcagtattgatgttttactgcaaatcttatgggtctagat 
aatcagtaaaagccatcttccatagttggtgttagaacattgccctattggtttggacat 
ctgtagaatatatatgaagacaatttctgtaatggttttaagagatttaaaaagaaattc 
actggttctttacaaaatagaatttatcatcaagttattacacaaacttcacagtaagga 
gtgacaagtttataataaggaagacaaagtttaacaccttcactcaagcactccactaat 
atatttacgttgcattcagaaatactgatgaccttcatatacgtagtctgtatactcata 
gggagatgtactgtattatataacatgtaaagttgattttcttgtgacaagagaacttct 
ttttttaacaagaggacatggcattattttaatttgattatggtgagttgaatttaagac 
atgaccatgaaggctgcttgtagaattagtgtatttttattaaactatttttttaaatgt 
caaacttctatcatgtaaatggacttatagagaacaaaaagctatttactttggttttct 
agaaagttgttacatatcatggctggttaacttttatttcttttgatgaaaatttttcct 
ttgatagtacttgtattattgtgccattattttcttatgctccaaatgtaccaaagatct 
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tgaacagagtggatgttcacaactgagtagaattttcctttcctgtgggcatgctgtgtt 
cagacctgacagatctttgatagaggtcagcttattaaagggcaatattgttcttgttta 
gctacatcactgtggtgaatatagatggaattaaggaagtaaatgcaggccagggggttg 
tgatgagaggataggggagataatatcagcatcaaattctttgggtatctctctaagaat 
taaataatcttttctagcttaatattttaattctaattcaaacaactctgaggttttggt 
ttcattagtaatagttgaggaataatatactagcaaagaatggcctaatgtttgtcataa 
ctgttaatggatgaaattttttaaagatacaaccatgataaccattataaatgatctatg 
atcaaaatctaaagtgatgaattatttgtaggaatgtcttcctaatggggaagaattgca 
taggagcattatgcaaatctacacaagcttttataaatgttgctgctgggtagctccaca 
gtgtttcataaggccatcctgtttcccccaactcccccatttttggtttgtttcttttta 
aatatttgttgagtacttacgtgtttatctaacagttcacttccatttttctagtctgga 
ttttttgagtatttaggaaagagagctattaaaaactctggggatttctcaatgtgacta 
actctaatttttctaattataactgcctttaattaacataatattaacttttgctgaggt 
ttatgagattttctcaccccacatcgctcccctttttttaaaaaggactgttttgctagt 
gtgataatgaataggtaagatatgagataattgcaacattgtctagttctagtatggtaa 
ctattcttgaaatggtattgaaaaataccgttaattcaaattgacagagattgataaaaa 
gaaactgatttacctaagtttactttttaattgcataatagagcattttttgttttgagt 
tccctcattcttattaccagaaagagcttgcaaatagttttactttcttggcactggaag 
ggtagttctggaaagctactttgttgagagtctcattcttccctggagttaatagagtga 
ttcacaatctttggggttttctcctcatcaaaagcatttcttaagtgcctatctaaaagc 
aattaaagactgtgtctgccctttagaagctaagaatttgattcatgatgcaaattaact 
agataatttgcaaagtacccttgagattgaattttctctattatatatttcccatatttc 
aggtgaataatttaatttaaatgacaaaaccctatctagtcaactgggcataatgacatt 
ttctttaaattagactctattttgaattaaaagagttttattataaaccgtgtgtttttg 
gtttttctaagtatatagaaagcttgtataattcagatttatcaatttcctgatttaatg 
tagactttgacttttttattaaaaacctttgtattaaagca 

SEQ ID NO: 163. >R14324_T4 # TY Transcript # LN 4282 # Source Gene: R14324 
# Encoded protein: R14324_P3 

ccaccgagaagccgaggaggcaaggctcgcgagagttcagggaggccgccctgagattcc 
ggcgaggccgcgggtcccacctcccgggggcggggcgagggcggagcggggagaagggag 
ctgacgggcgcccggccggctgcggtccgtgcggaggctgagccggccgcgggcgcgacc 
ggaggcagtttccgttactatggcaatgacggcagggactacaacaacctttcctatgag 
caaccatacccgggaaagagtgactgtagccaagctcacattggagaatttttatagcaa 
cctaattttacagcatgaagagagagaaaccagaggagaaaactggaaatcattcaacaa 
gcagcactgaagtgcctaggggcagaagaaattagaagtggccatggaagaagaaggatt 
agcagatgaagagaaaaagttacgtcgatcacaacacgctcgcaaagaaacagagttctt 
acggctcaaaaggaccagacttggcttggatgactttgagtctctgaaagttataggaag 
aggagcttttggagaggtgcggttggtccagaagaaagatacaggccatatctatgcaat 
gaagatattgagaaagtctgatatgcttgaaaaagagcaggtggcccatatccgagcaga 
aagagatattttggtagaagcagatggtgcctgggtggtgaagatgttttacagttttca 
ggataagaggaatctttatctaatcatggaatttctccctggaggtgacatgatgacatt 
gctaatgaagaaagacaccttgacagaagaggaaacacagttctacatttcagagactgt 
tctggcaatagatgcgatccaccagttgggtttcatccatcgggatattaagccagacaa 
ccttttattggatgccaagggtcatgtaaaattatctgattttggtttatgtacgggatt 
aaagaaagctcacaggactgaattttatagaaatctcacacacaacccaccaagtgactt 
ctcatttcagaacatgaactcaaagaggaaagcagaaacttggaagaagaacaggagaca 
actggcatattccacagttgggacaccagattacattgctccagaagtattcatgcagac 
tggttacaacaaattgtgtgactggtggtctttgggagtgattatgtatgaaatgctaat 
aggatatccacctttctgctctgaaacacctcaagaaacatacagaaaagtgatgaactg 
gaaagaaactctggtatttcctccagaggtacctatatctgagaaagccaaggacttaat 
tctcagattttgtattgattctgaaaacagaattggaaatagtggagtagaagaaataaa 
aggtcatcccttttttgaaggtgtcgactgggagcacataagggaaaggccagcagcaat 
ccctatagaaatcaaaagcattgatgatacttcaaattttgatgacttccctgaatctga 
tattttacaaccagtgccaaataccacagaaccggactacaaatccaaagactgggtttt 
tctcaattatacctataaaaggtttgaagggttgactcaacgtggctctatccccaccta 
catgaaagctgggaagttatgaatgaagataacattcacccataaccaagagaactcagg 
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tagctgcatcaccaggcttgcttggcgtagataacaatacactgaaatactcctgaagat 
ggtggtgcttattgactacaagaggaaattctacaggattaggatttctaagactactat 
aggaattggttggcagtgccagctggctcttttttttaatattttattatttttgttaac 
tttattatatgaaggtactggaataaaaggaacagacatccctttctaactgcactgcct 
acatgcgtattaaggtccattctgcctgtgtgtgctgtggctttgaactgtaacacctct 
aatcaattcaggagaaacacatatcatttaaagcaacataggctaacctgtaggtaacac 
tgcagtattgatgttttactgcaaatcttatgggtctagataatcagtaaaagccatctt 
ccatagttggtgttagaacattgccctattggtttggacatctgtaga'atatatatgaag 
acaatttctgtaatggttttaagagatttaaaaagaaattcactggttctttacaaaata 
gaatttatcatcaagttattacacaaacttcacagtaaggagtgacaagtttataataag 
gaagacaaagtttaacaccttcactcaagcactccactaatatatttacgttgcattcag 
aaatactgatgaccttcatatacgtagtctgtatactcatagggagatgtactgtattat 
ataacatgtaaagttgattttcttgtgacaagagaacttctttttttaacaagaggacat 
ggcattattttaatttgattatggtgagttgaatttaagacatgaccatgaaggctgctt 
gtagaattagtgtatttttattaaactatttttttaaatgtcaaacttctatcatgtaaa 
tggacttatagagaacaaaaagctatttactttggttttctagaaagttgttacatatca 
tggctggttaacttttatttcttttgatgaaaatttttcctttgatagtacttgtattat 
tgtgccattattttcttatgctccaaatgtaccaaagatcttgaacagagtggatgttca 
caactgagtagaattttcctttcctgtgggcatgctgtgttcagacctgacagatctttg 
atagaggtcagcttattaaagggcaatattgttcttgtttagctacatcactgtggtgaa 
tatagatggaattaaggaagtaaatgcaggccagggggttgtgatgagaggataggggag 
ataatatcagcatcaaattctttgggtatctctctaagaattaaataatcttttctagct 
taatattttaattctaattcaaacaactctgaggttttggtttcattagtaatagttgag 
gaataatatactagcaaagaatggcctaatgtttgtcataactgttaatggatgaaattt 
tttaaagatacaaccatgataaccattataaatgatctatgatcaaaatctaaagtgatg 
aattatttgtaggaatgtcttcctaatggggaagaattgcataggagcattatgcaaatc 
tacacaagcttttataaatgttgctgctgggtagctccacagtgtttcataaggccatcc 
tgtttcccccaactcccccatttttggtttgtttctttttaaatatttgttgagtactta 
cgtgtttatctaacagttcacttccatttttctagtctggattttttgagtatttaggaa 
agagagctattaaaaactctggggatttctcaatgtgactaactctaatttttctaatta 
taactgcctttaattaacataatattaacttttgctgaggtttatgagattttctcaccc 
cacatcgctcccctttttttaaaaaggactgttttgctagtgtgataatgaataggtaag 
atatgagataattgcaacattgtctagttctagtatggtaactattcttgaaatggtatt 
gaaaaataccgttaattcaaattgacagagattgataaaaagaaactgatttacctaagt 
ttactttttaattgcataatagagcattttttgttttgagttccctcattcttattacca 
gaaagagcttgcaaatagttttactttcttggcactggaagggtagttctggaaagctac 
tttgttgagagtctcattcttccctggagttaatagagtgattcacaatctttggggttt 
tctcctcatcaaaagcatttcttaagtgcctatctaaaagcaattaaagactgtgtctgc 
cctttagaagctaagaatttgattcatgatgcaaattaactagataatttgcaaagtacc 
cttgagattgaattttctctattatatatttcccatatttcaggtgaataatttaattta 
aatgacaaaaccctatctagtcaactgggcataatgacattttctttaaattagactcta 
ttttgaattaaaagagttttattataaaccgtgtgtttttggtttttctaagtatataga 
aagcttgtataattcagatttatcaatttcctgatttaatgtagactttgacttttttat 
taaaaacctttgtattaaagca 

SEQ ID NO: 164. >R14324_T5 # TY Transcript # LN 4183 # Source Gene: R14324 
# Encoded protein: R14324_P4 

ccaccgagaagccgaggaggcaaggctcgcgagagttcagggaggccgccctgagattcc 
ggcgaggccgcgggtcccacctcccgggggcggggcgagggcggagcggggagaagggag 
ctgacgggcgcccggccggctgcggtccgtgcggaggctgagccggccgcgggcgcgacc 
ggaggcagtttccgttactatggcaatgacggcagggactacaacaacctttcctatgag 
caaccatacccgggaaagagtgactgtagccaagctcacattggagaatttttatagcaa 
cctaattttacagcatgaagagagagaaaccaggcagaagaaattagaagtggccatgga 
agaagaaggattagcagatgaagagaaaaagttacgtcgatcacaacacgctcgcaaaga 
aacagagttcttacggctcaaaaggaccagacttggcttggatgactttgagtctctgaa 
agttataggaagaggagcttttggagaggtgcggttggtccagaagaaagatacaggcca 
tatctatgcaatgaagatattgagaaagtctgatatgcttgaaaaagagcaggtggccca 
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tatccgagcagaaagagatattttggtagaagcagatggtgcctgggtggtgaagatgtt 
ttacagttttcaggataagaggaatctttatctaatcatggaatttctccctggaggtga 
catgatgacattgctaatgaagaaagacaccttgacagaagaggaaacacagttctacat 
ttcagagactgttctggcaatagatgcgatccaccagttgggtttcatccatcgggatat 
taagccagacaaccttttattggatgccaagggtcatgtaaaattatctgattttggttt 
atgtacgggattaaagaaagctcacaggactgaattttatagaaatctcacacacaaccc 
accaagtgacttctcatttcagaacatgaactcaaagaggaaagcagaaacttggaagaa 
gaacaggagacaactggcatattccacagttgggacaccagattacattgctccagaagt 
attcatgcagactggttacaacaaattgtgtgactggtggtctttgggagtgattatgta 
tgaaatgctaataggatatccacctttctgctctgaaacacctcaagaaacatacagaaa 
agtgatgaactggaaagaaactctggtatttcctccagaggtacctatatctgagaaagc 
caaggacttaattctcagattttgtattgattctgaaaacagaattggaaatagtggagt 
agaagaaataaaaggtcatcccttttttgaaggtgtcgactgggagcacataagggaaag 
gccagcagcaatccctatagaaatcaaaagcattgtgccaaataccacagaaccggacta 
caaatccaaagactgggtttttctcaattatacctataaaaggtttgaagggttgactca 
acgtggctctatccccacctacatgaaagctgggaagttatgaatgaagataacattcac 
ccataaccaagagaactcaggtagctgcatcaccaggcttgcttggcgtagataacaata 
cactgaaatactcctgaagatggtggtgcttattgactacaagaggaaattctacaggat 
taggatttctaagactactataggaattggttggcagtgccagctggctcttttttttaa 
tattttattatttttgttaactttattatatgaaggtactggaataaaaggaacagacat 
ccctttctaactgcactgcctacatgcgtattaaggtccattctgcctgtgtgtgctgtg 
gctttgaactgtaacacctctaatcaattcaggagaaacacatatcatttaaagcaacat 
aggctaacctgtaggtaacactgcagtattgatgttttactgcaaatcttatgggtctag 
ataatcagtaaaagccatcttccatagttggtgttagaacattgccctattggtttggac 
atctgtagaatatatatgaagacaatttctgtaatggttttaagagatttaaaaagaaat 
tcactggttctttacaaaatagaatttatcatcaagttattacacaaacttcacagtaag 
gagtgacaagtttataataaggaagacaaagtttaacaccttcactcaagcactccacta 
atatatttacgttgcattcagaaatactgatgaccttcatatacgtagtctgtatactca 
tagggagatgtactgtattatataacatgtaaagttgattttcttgtgacaagagaactt 
ctttttttaacaagaggacatggcattattttaatttgattatggtgagttgaatttaag 
acatgaccatgaaggctgcttgtagaattagtgtatttttattaaactatttttttaaat 
gtcaaacttctatcatgtaaatggacttatagagaacaaaaagctatttactttggtttt 
ctagaaagttgttacatatcatggctggttaacttttatttcttttgatgaaaatttttc 
ctttgatagtacttgtattattgtgccattattttcttatgctccaaatgtaccaaagat 
cttgaacagagtggatgttcacaactgagtagaattttcctttcctgtgggcatgctgtg 
ttcagacctgacagatctttgatagaggtcagcttattaaagggcaatattgttcttgtt 
tagctacatcactgtggtgaatatagatggaattaaggaagtaaatgcaggccagggggt 
tgtgatgagaggataggggagataatatcagcatcaaattctttgggtatctctctaaga 
attaaataatcttttctagcttaatattttaattctaattcaaacaactctgaggttttg 
gtttcattagtaatagttgaggaataatatactagcaaagaatggcctaatgtttgtcat 
aactgttaatggatgaaattttttaaagatacaaccatgataaccattataaatgatcta 
tgatcaaaatctaaagtgatgaattatttgtaggaatgtcttcctaatggggaagaattg 
cataggagcattatgcaaatctacacaagcttttataaatgttgctgctgggtagctcca 
cagtgtttcataaggccatcctgtttcccccaactcccccatttttggtttgtttctttt 
taaatatttgttgagtacttacgtgtttatctaacagttcacttccatttttctagtctg 
gattttttgagtatttaggaaagagagctattaaaaactctggggatttctcaatgtgac 
taactctaatttttctaattataactgcctttaattaacataatattaacttttgctgag 
gtttatgagattttctcaccccacatcgctcccctttttttaaaaaggactgttttgcta 
gtgtgataatgaataggtaagatatgagataat'tgcaacattgtctagttctagtatggt 
aactattcttgaaatggtattgaaaaataccgttaattcaaattgacagagattgataaa 
aagaaactgatttacctaagtttactttttaattgcataatagagcattttttgttttga 
gttccct cat tct tat taccagaaagagcttgcaaatagttt tact ttcttggcactgga 
agggtagttctggaaagctactttgttgagagtctcattcttccctggagttaatagagt 
gattcacaatctttggggttttctcctcatcaaaagcatttcttaagtgcctatctaaaa 
gcaattaaagactgtgtctgccctttagaagctaagaatttgattcatgatgcaaattaa 
ctagataatttgcaaagtacccttgagattgaattttctctattatatatttcccatatt 
tcaggtgaataatttaatttaaatgacaaaaccctatctagtcaactgggcataatgaca 
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ttttctttaaattagactctattttgaattaaaagagttttattataaaccgtgtgtttt 
tggtttttctaagtatatagaaagcttgtataattcagatttatcaatttcctgatttaa 
tgtagactttgacttttttattaaaaacctttgtattaaagca 

SEQ ID NO: 165. >R14324_T6 # TY Transcript 8 LN 898 8 Source Gene: R14324 # 
Encoded protein: R14324_P5 

ccaccgagaagccgaggaggcaaggctcgcgagagttcagggaggccgccctgagattcc 
ggcgaggccgcgggtcccacctcccgggggcggggcgagggcggagcggggagaagggag 
ctgacgggcgcccggccggctgcggtccgtgcggaggctgagccggccgcgggcgcgacc 
ggaggcagtttccgttactatggcaatgacggcagggactacaacaacctttcctatgag 
caaccatacccgggaaagagtgactgtagccaagctcacattggagaatttttatagcaa 
cctaattttacagcatgaagagagagaaaccaggtatagtaagggctaatcaaaactttt 
tttagttaaataagctgttgtgctgtgagtggaatagaaacttgtttaccatgaatgata 
tgaatattatggatccctgcatgctgtttttgagtttcctgtactgacagtgcacataga 
catgtgttttattttcacagagtagtttcctgaggtgccccaagtcttgtgcatctatat 
acccgtatgacatgccaattaacgttttaaacaaaccttttccaaacatgaaattatgat 
atgtctttacattttcttttaaagggatgcctggcttggtgtggtgtggctcatgcctat 
aatcccagcactttgggaagccgaggtgggcagatcacctgaggtcaggagttcaagacc 
agcctggccaacatggtgaaaccccatctctactgaaaatacaaaaaattagccgggtgt 
ggtggcgggtgcctgtaatcccagctactcaggaggctgaggcaggagaattgcttgaag 
gcagcagtgagccaaggctgcagtgagccaagatcacaccactgcactccagcctggg 

SEQ ID NO: 166. >R14324_P1 # TY Protein # CC #LN 464 8 Source Gene: R14324 

# Encoding Transcript: 1 

MAMTAGTTTTFPMSNHTRERVTVAKLTLENFYSNLILQHEERETRQKKLEVAMEEEGLAD 
EEKKLRRSQHARKETEFLRLKRTRLGLDDFESLKVIGRGAFGEVRLVQKKDTGHIYAMKI 
LRKS DMLE KEQVAH I RAE RDI L VEADGAWWKMFY S FQDKRNL YL I ME FL PGGDMMTLLM 
KKDTLTEEETQFYISETVLAIDAIHQLGFIHRDIKPDNLLLDAKGHVKLSDFGLCTGLKK 
AHRTEFYRNLTHNPPSDFSFQNMNSKRKAETWKKNRRQLAYSTVGTPDYIAPEVFMQTGY 
NKLCDWWSLGVIMYEMLIGYPPFCSETPQETYRKVMNWKETLVFPPEVPISEKAKDLILR 
FCIDSENRIGNSGVEEIKGHPFFEGVDWEHIRERPAAIPIEIKSIDDTSNFDDFPESDIL 
QPVPNTTE PDYKSKDWVFLN YT YKRFEGLTQRGS I PT YMKAGKL 

SEQ ID NO: 167 . >R14324__P2 # TY Protein 8 CC 8LN 347 # Source Gene: R14324 

# Encoding Transcript: 3 

MKILRKSDMLEKEQVAHIRAERDILVEADGAWVVKMFYSFQDKRNLYLIMEFLPGGDMMT 
LLMKKDTLTEEETQFYISETVLAIDAIHQLGFIHRDIKPDNLLLDAKGHVKLSDFGLCTG 
LKKAHRTEFYRNLTHNPPSDFSFQNMNSKRKAETWKKNRRQLAYSTVGTPDYIAPEVFMQ 
TGYNKLCDWWSLGVIMYEMLIGYPPFCSETPQETYRKVMNWKETLVFPPEVPISEKAKDL 
ILRFCIDSENRIGNSGVEEIKGHPFFEGVDWEHIRERPAAIPIEIKSIDDTSNFDDFPES 
D I LQPVPNTTEPDYKSKDWVFLNYT YKRFEGLTQRGS I PT YMKAGKL 

SEQ ID NO: 168. >R14324_P3 # TY Protein 8 CC 8LN 412 # Source Gene: R14324 

# Encoding Transcript: 4 

MEEEGLADEEKKLRRSQHARKETEFLRLKRTRLGLDDFESLKVIGRGAFGEVRLVQKKDT 
GH I YAMKI LRKS DMLE KEQVAH I RAE RDI L VEADGAWWKMFY S FQDKRNL Y LI ME FLPG 
GDMMTLLMKKDTLTEEETQFYISETVLAIDAIHQLGFIHRDIKPDNLLLDAKGHVKLSDF 
GLCTGLKKAHRTEFYRNLTHNPPSDFSFQNMNSKRKAETWKKNRRQLAYSTVGTPDYIAP 
EVFMQTGYNKLCDWWSLGVIMYEMLIGYPPFCSETPQETYRKVMNWKETLVFPPEVPISE 
KAKDLILRFCIDSENRIGNSGVEEIKGHPFFEGVDWEHIRERPAAIPIEIKSIDDTSNFD 
DFPESDILQPVPNTTEPDYKSKDWVFLNYTYKRFEGLTQRGSIPTYMKAGKL 

SEQ ID NO: 169. >R14324_P4 # TY Protein 8 CC 8LN 447 # Source Gene: R14324 

# Encoding Transcript : 5 

MAMTAGTTTTFPMSNHTRERVTVAKLTLENFYSNLILQHEERETRQKKLEVAMEEEGLAD 
EEKKLRRSQHARKETEFLRLKRTRLGLDDFESLKVIGRGAFGEVRLVQKKDTGHIYAMKI 
LRKSDMLEKEQVAHIRAERDILVEADGAWWKMFYSFQDKRNLYLIMEFL PGGDMMTLLM 



91/365 



Figure 1 (Cont'd) 



KKDTLTEEETQFYISETVLAIDAIHQLGFIHRDIKPDNLLLDAKGHVKLSDFGLCTGLKK 
AHRTEFYRNLTHNPPSDFSFQNMNSKRKAETWKKNRRQLAYSTVGTPDYIAPEVFMQTGY 
NKLCDWWSLGVIMYEMLIGYPPFCSETPQETYRKVMNWKETLVFPPEVPISEKAKDLILR 
FCIDSENRIGNSGVEEIKGHPFFEGVDWEHIRERPAAIPIEIKSIVPNTTEPDYKSKDWV 
FLNYTYKRFEGLTQRGS I PT YMKAGKL 

SEQ ID NO:170. >R14324_P5 # TY Protein # CC #LN 49 # Source Gene: R14324 
# Encoding Transcript: 6 

MAMTAGTTTTFPMSNHTRERVTVAKLTLENFYSNLILQHEERETRYSKG 



SEQ ID NO: 171. >R25184 # TY Consensus # Length 5086 # Number of exons 9 

ctgaccctgggagcgcggggagcgccaagcctcttctagcgctcacctcgcgctcactcc 

tgcgctcgccgggctgcttgtgcggggaaggggcggggcccgcctgggctcctccccggt 

cggagcccggagcaagggggcagagcgcgccgggcgctggttaaagaggcgtgcggaggc 

gggcgcggcgcatactgctagtggcgcgcggaggagcgacgcgtggagaagcggcccacg 

tgtctgcccagagtcaagtcctgtgttcttcccgctccttacgcatccgcggtccagggc 

gccctttcagccccgctggtgttcgcccaccccgggccgcgtgagtggggccccacgcag 

ctccccgcactccgtgggccaacttggccaagcaactctgtccggggagcggtgcttgcg 

gggggtgagtaccgggcactgcgcatgcggagctccaaattcaaacagctgttttcagag 

gctggagggcgggcggactggtagcagctggggctaggagaggctttctctaggaggcgg 

ccgctcgggagccatggtggaccggggccctctgctcacctcggccatcatcttctacct 

ggccatcggggcggcgatcttcgaagtgctggaggagccacactggaaggaggccaagaa 

aaactactacacacagaagctgcatctgctcaaggagttcccgtgcctgggtcaggaggg 

cctggacaagatcctagaggtcagtcgccgcctgtacccccatatacatctgaagctggc 

ttgagctttagggggcgctgaaaggaagaccccttcacagctttgggtgaccctcagtcc 

ctgtggaaggtccagggggtgatgagagagcgtgtctcatcatgggtcaggcttcgagga 

tctgtcgctcacccgagggacacaggggtcccccaggcgcggagactctttggaacttag 

tttggccggagacaaattggagttgattcccggggtggtatctgatgctgcaggacaggg 

tgtggccatcacagggaaccagaccttcaacaactggaactggcccaatgcaatgatttt 

tgcagcgaccgtcattaccaccattggatatggcaatgtggctcccaagacccccgccgg 

tcgcctcttctgtgttttctatggtctcttcggggtgccgctctgcctgacgtggatcag 

tgccctgggcaagttcttcgggggacgtgccaagagactagggcagttccttaccaagag 

aggtgtgagtctgcggaaggcgcagatcacgtgcacagtcatcttcatcgtgtggggcgt 

cctagtccacctggtgatcccacccttcgtattcatggtgactgaggggtggaactacat 

cgagggcctctactactccttcatcaccatctccaccatcggcttcggtgactttgtggc 

cggtgagtccctgcctctgcctcccacctctgtgtgaaccccagcgccaactaccacgcc 

ctgtaccgctacttcgtggagctctggatctacttggggctggcctggctgtcccttttt 

gtcaactggaaggtgagcatgtttgtggaagtccacaaagccattaagaagcggcggcgg 

cgacggaaggagtcctttgagagctccccacactcccggaaggccctgcaggtgaagggg 

agcacagcctccaaggacgtcaacatcttcagctttctttccaagaaggaagagacctac 

aacgacctcatcaagcagatcgggaagaaggccatgaagacaagcgggggtggggagacg 

ggcccgggcccagggctggggcctcaaggcggtgggctcccagcactgcccccttccctg 

gtgcccctggtagtctactccaagaaccgggtgcccaccttggaagaggtgtcacagaca 

ctgaggagcaaaggccacgtatcaaggtccccagatgaggaggctgtggcacgggcccct 

gaagacagctcccctgcccccgaggtgttcatgaaccagctggaccgcatcagcgaggaa 

tgcgagccatgggacgcccaggactaccacccactcatcttccaggacgccagcatcacc 

ttcgtgaacacggaggctggcctctcagacgaggagacctccaagtcctcgctagaggac 

aacttggcaggggaggagagcccccagcagggggctgaagccaaggcgcccctgaacatg 

ggcgagttcccctcctcctccgagtccaccttcaccagcactgagtctgagctctctgtg 

ccttacgaacagctgatgaatgagtacaacaaggctaacagccccaagggcacatgaggc 

agggccggctccccaccccacctttgatggcctcttcccccctcaccctagggtgtcccg 

agatgaccgggacgcctggcccctggtgggggggcagcctcggaactgggagtggggggc 
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caggggccttcctaaccttccatcatcctcagctagatgtatgcccgggacagggcctct 
gttctccagctgaaccataccctggctgtgggggcatctgtcctgagcttggctggtgta 
tctcacaatgcaaagacatgctggctggcgggacaggtgggcaggactgaccctgaggag 
gccttgcctgcagggtctttgtctcaccatttggtggagtatcacacggttctctgaggt 
ctggggcctcagctgtttaagtttaccggtattactgagctcggcatttggagagggagc 
tctgaagtgtctggggaggtaccgctgtgcgtggggtcaggtgtttccgtaccacagcag 
gagcagggcccgcccgcatcccagctgtgggcctgccggtcaggtcgggcacctactaca 
aaccgtagtggggtggaggctgctggaggtgggagtgaggagatgagggcagggtctcaa 
acagtcctgactcacagggcctggaaacaagtcctatgtgggcctggggcctggggtcct 
catcctccttgttggtctactcaggcccagcccagagctgtgttccctgtctcaggtcaa 
gcagtggcagacgcaaggctttctgtgggcccccaagtggtaggagggagagtagcagag 
catgggttactggaagccgggactgctagggctggtggccagggagctgcaagagtgagg 
ctcagctctggctggttctgcccttacccctcctgcccgcctgagaactgcacaccctgc 
ccgctggccccaggacctgcactcccaatcctgctgtcttctccttccctgtgccctgaa 
caaggacctcactgcccgccttcccctcccaccagcccccttgggccaggcagggtgagg 
ccaaattgctcttggcccacaaatgggtgatggtcagatatgtgaatcaagctcctttct 
ctagctagtgtttgatgtgcacgtgtgtgtgcacagtgcgtgtgtgcacacgcacacctg 
tgcactcgtgtgtgtttaagaaaggaaaggatttgggctggggagcaaaagataatgtga 
aactgttggtggactctctggtgaggggtgggcagaacttgctgctactagagttcttgg 
gttctccatgatgttcaccctggggctggcccactgtgtcctgaatgtttttgttatttt 
ttgttttattttttaaacaaactgctgtttttatatacctggaatctgttgttggcttca 
gagccagtggttaaagagcagggtcccaaggattgggagatctagtgtctgccctcctgc 
cctgcaactcaattgggcctttttcggtgacctcatccaaggccatgatgtcaagggcca 
tgtccccaagcagaggtggagaaggggacactgaggtgagcaaaagcaggaaggggcatc 
cactgcgggtgactggaggccgggcaggaagcaagtcatcagagccgctcagctccgttc 
actctctgccttctgccccactactgtggggcagtggggccagagcccacctccccaaca 
tgtgaagacagtgatgggcacgtgcccacacccccacttctctagccgtttgcagaggcc 
gccacccagcaggggcctgaaaaggagcagcctcgtatttttctgtgaaatgttttaatg 
aaccatgttgttgctggttgtcctggcatcgcgcacactgtatgtacatactggcaacga 
tgtcaaatgtaatttattttaacatttttacaataaaacatgaggtggacaggccagctt 
ggtgtgtgctgtgtgtgggtcaggttagagaggggttggcttctggcctctgagcagcag 
gggagcctgggcgcatccacagaagatagagaggactcaggacagactgataagaggatt 
ggatggtctggaggggagaaaagaatgaaaagagatgaagagagcggggagaggcaggcc 
tctgtggggaggtaaggacgcgggaggaacattgcagtagaggtttgagaatcaggaagg 
agtgggaggagcaaggggagaggcggggtgatcatggcaaacgcatgtcactcaccactt 
tgtcaggcactaaatgcttgcactgaataatacgtgagtggactcatttgacctggcaag 
aaccctgagagggcatcagtacatctccattttatagacgaggaaacaggcacagagaag 
ttgccttgtgcaaaatctcagtaaaaatggtgagactagattcaaacccaggcagcccag 
ctcctgtcccctctaccactgtgatgagatgaaaaagactacctgtggttttgtggcaca 
cccaacatctctggaaagggctcctgtctccaggtaggtggaccagagaacttgagcgaa 
gctcaagccttctcaactcaaggggtgggcaggagatgtagagcctgagcgtgggcactg 
tggactggaaagaatgtgaaggaagaatggagccaggctcagccgggcatctgtctccat 
ctgtctaaagggaggctttgtcaggcttcagctgtttggggactggccccttctatgacc 
atttcagggctcagcaccagaaaggatgatttctgagttcattgga 

SEQ ID NO: 172 . >R25184_T1 # TY Transcript # LN 3693 # Source Gene: R25184 
# Encoded protein: R25184_P1 

ctgaccctgggagcgcggggagcgccaagcctcttctagcgctcacctcgcgctcactcc 
tgcgctcgccgggctgcttgtgcggggaaggggcggggcccgcctgggctcctccccggt 
cggagcccggagcaagggggcagagcgcgccgggcgctggttaaagaggcgtgcggaggc 
gggcgcggcgcatactgctagtggcgcgcggaggagcgacgcgtggagaagcggcccacg 
tgtctgcccagagtcaagtcctgtgttcttcccgctccttacgcatccgcggtccagggc 
gccctttcagccccgctggtgttcgcccaccccgggccgcgtgagtggggccccacgcag 
ctccccgcactccgtgggccaacttggccaagcaactctgtccggggagcggtgcttgcg 
gggggtgagtaccgggcactgcgcatgcggagctccaaattcaaacagctgttttcagag 
gctggagggcgggcggactggtagcagctggggctaggagaggctttctctaggaggcgg 
ccgctcgggagccatggtggaccggggccctctgctcacctcggccatcatcttctacct 
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ggccatcggggcggcgatcttcgaagtgctggaggagccacactggaaggaggccaagaa 
aaactactacacacagaagctgcatctgctcaaggagttcccgtgcctgggtcaggaggg 
cctggacaagatcctagaggtggtatctgatgctgcaggacagggtgtggccatcacagg 
gaaccagaccttcaacaactggaactggcccaatgcaatgatttttgcagcgaccgtcat 
taccaccattggatatggcaatgtggctcccaagacccccgccggtcgcctcttctgtgt 
tttctatggtctcttcggggtgccgctctgcctgacgtggatcagtgccctgggcaagtt 
cttcgggggacgtgccaagagactagggcagttccttaccaagagaggtgtgagtctgcg 
gaaggcgcagatcacgtgcacagtcatcttcatcgtgtggggcgtcctagtccacctggt 
gatcccacccttcgtattcatggtgactgaggggtggaactacatcgagggcctctacta 
ctccttcatcaccatctccaccatcggcttcggtgactttgtggccggtgtgaaccccag 
cgccaactaccacgccctgtaccgctacttcgtggagctctggatctacttggggctggc 
ctggctgtccctttttgtcaactggaaggtgagcatgtttgtggaagtccacaaagccat 
taagaagcggcggcggcgacggaaggagtcctttgagagctccccacactcccggaaggc 
cctgcaggtgaaggggagcacagcctccaaggacgtcaacatcttcagctttctttccaa 
gaaggaagagacctacaacgacctcatcaagcagatcgggaagaaggccatgaagacaag 
cgggggtggggagacgggcccgggcccagggctggggcctcaaggcggtgggctcccagc 
actgcccccttccctggtgcccctggtagtctactccaagaaccgggtgcccaccttgga 
agaggtgtcacagacactgaggagcaaaggccacgtatcaaggtccccagatgaggaggc 
tgtggcacgggcccctgaagacagctcccctgcccccgaggtgttcatgaaccagctgga 
ccgcatcagcgaggaatgcgagccatgggacgcccaggactaccacccactcatcttcca 
ggacgccagcatcaccttcgtgaacacggaggctggcctctcagacgaggagacctccaa 
gtcctcgctagaggacaacttggcaggggaggagagcccccagcagggggctgaagccaa 
ggcgcccctgaacatgggcgagttcccctcctcctccgagtccaccttcaccagcactga 
gtctgagctctctgtgccttacgaacagctgatgaatgagtacaacaaggctaacagccc 
caagggcacatgaggcagggccggctccccaccccacctttgatggcctcttcccccctc 
accctagggtgtcccgagatgaccgggacgcctggcccctggtgggggggcagcctcgga 
actgggagtggggggccaggggccttcctaaccttccatcatcctcagctagatgtatgc 
ccgggacagggcctctgttctccagctgaaccataccctggctgtgggggcatctgtcct 
gagcttggctggtgtatctcacaatgcaaagacatgctggctggcgggacaggtgggcag 
gactgaccctgaggaggccttgcctgcagggtctttgtctcaccatttggtggagtatca 
cacggttctctgaggtctggggcctcagctgtttaagtttaccggtattactgagctcgg 
catttggagagggagctctgaagtgtctggggaggtaccgctgtgcgtggggtcaggtgt 
ttccgtaccacagcaggagcagggcccgcccgcatcccagctgtgggcctgccggtcagg 
tcgggcacctactacaaaccgtagtggggtggaggctgctggaggtgggagtgaggagat 
gagggcagggtctcaaacagtcctgactcacagggcctggaaacaagtcctatgtgggcc 
tggggcctggggtcctcatcctccttgttggtctactcaggcccagcccagagctgtgtt 
ccctgtctcaggtcaagcagtggcagacgcaaggctttctgtgggcccccaagtggtagg 
agggagagtagcagagcatgggttactggaagccgggactgctagggctggtggccaggg 
agctgcaagagtgaggctcagctctggctggttctgcccttacccctcctgcccgcctga 
gaactgcacaccctgcccgctggccccaggacctgcactcccaatcctgctgtcttctcc 
ttccctgtgccctgaacaaggacctcactgcccgccttcccctcccaccagcccccctgg 
ggctggcccactgtgtcctgaatgtttttgttattttttgttttattttttaaacaaact 
gctgtttttatatacctggaatctgttgttggcttcagagccagtggttaaagagcaggg 
tcccaaggattgggagatctagtgtctgccctcctgccctgcaactcaattgggcctttt 
tcggtgacctcatccaaggccatgatgtcaagggccatgtccccaagcagaggtggagaa 
ggggacactgaggtgagcaaaagcaggaaggggcatccactgcgggtgactggaggccgg 
gcaggaagcaagtcatcagagccgctcagctccgttcactctctgccttctgccccacta 
ctgtggggcagtggggccagagcccacctccccaacatgtgaagacagtgatgggcacgt 
gcccacacccccacttctctagccgtttgcagaggccgccacccagcaggggcctgaaaa 
ggagcagcctcgtatttttctgtgaaatgttttaatgaaccatgttgttgctggttgtcc 
tggcatcgcgcacactgtatgtacatactggcaacgatgtcaaatgtaatttattttaac 
atttttacaataaaacatgaggtggacaggcca 

SEQ ID NO: 173. >R25184_T2 # TY Transcript # LN 3972 # Source Gene: R25184 
# Encoded protein: R25184_P1 

ctgaccctgggagcgcggggagcgccaagcctcttctagcgctcacctcgcgctcactcc 
tgcgctcgccgggctgcttgtgcggggaaggggcggggcccgcctgggctcctccccggt 
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cggagcccggagcaagggggcagagcgcgccgggcgctggttaaagaggcgtgcggaggc 
gggcgcggcgcatactgctagtggcgcgcggaggagcgacgcgtggagaagcggcccacg 
tgtctgcccagagtcaagtcctgtgttcttcccgctccttacgcatccgcggtccagggc 
gccctttcagccccgctggtgttcgcccaccccgggccgcgtgagtggggccccacgcag 
ctccccgcactccgtgggccaacttggccaagcaactctgtccggggagcggtgcttgcg 
gggggtgagtaccgggcactgcgcatgcggagctccaaattcaaacagctgttttcagag 
gctggagggcgggcggactggtagcagctggggctaggagaggctttctctaggaggcgg 
ccgctcgggagccatggtggaccggggccctctgctcacctcggccatcatcttctacct 
ggccatcggggcggcgatcttcgaagtgctggaggagccacactggaaggaggccaagaa 
aaactactacacacagaagctgcatctgctcaaggagttcccgtgcctgggtcaggaggg 
cctggacaagatcctagaggtggtatctgatgctgcaggacagggtgtggccatcacagg 
gaaccagaccttcaacaactggaactggcccaatgcaatgatttttgcagcgaccgtcat 
taccaccattggatatggcaatgtggctcccaagacccccgccggtcgcctcttctgtgt 
tttctatggtctcttcggggtgccgctctgcctgacgtggatcagtgccctgggcaagtt 
cttcgggggacgtgccaagagactagggcagttccttaccaagagaggtgtgagtctgcg 
gaaggcgcagatcacgtgcacagtcatcttcatcgtgtggggcgtcctagtccacctggt 
gatcccacccttcgtattcatggtgactgaggggtggaactacatcgagggcctctacta 
ctccttcatcaccatctccaccatcggcttcggtgactttgtggccggtgtgaaccccag 
cgccaactaccacgccctgtaccgctacttcgtggagctctggatctacttggggctggc 
ctggctgtccctttttgtcaactggaaggtgagcatgtttgtggaagtccacaaagccat 
taagaagcggcggcggcgacggaaggagtcctttgagagctccccacactcccggaaggc 
cctgcaggtgaaggggagcacagcctccaaggacgtcaacatcttcagctttctttccaa 
gaaggaagagacctacaacgacctcatcaagcagatcgggaagaaggccatgaagacaag 
cgggggtggggagacgggcccgggcccagggctggggcctcaaggcggtgggctcccagc 
actgcccccttccctggtgcccctggtagtctactccaagaaccgggtgcccaccttgga 
agaggtgtcacagacactgaggagcaaaggccacgtatcaaggtccccagatgaggaggc 
tgtggcacgggcccctgaagacagctcccctgcccccgaggtgttcatgaaccagctgga 
ccgcatcagcgaggaatgcgagccatgggacgcccaggactaccacccactcatcttcca 
ggacgccagcatcaccttcgtgaacacggaggctggcctctcagacgaggagacctccaa 
gtcctcgctagaggacaacttggcaggggaggagagcccccagcagggggctgaagccaa 
ggcgcccctgaacatgggcgagttcccctcctcctccgagtccaccttcaccagcactga 
gtctgagctctctgtgccttacgaacagctgatgaatgagtacaacaaggctaacagccc 
caagggcacatgaggcagggccggctccccaccccacctttgatggcctcttcccccctc 
accctagggtgtcccgagatgaccgggacgcctggcccctggtgggggggcagcctcgga 
actgggagtggggggccaggggccttcctaaccttccatcatcctcagctagatgtatgc 
ccgggacagggcctctgttctccagctgaaccataccctggctgtgggggcatctgtcct 
gagcttggctggtgtatctcacaatgcaaagacatgctggctggcgggacaggtgggcag 
gactgaccctgaggaggccttgcctgcagggtctttgtctcaccatttggtggagtatca 
cacggttctctgaggtctggggcctcagctgtttaagtttaccggtattactgagctcgg 
catttggagagggagctctgaagtgtctggggaggtaccgctgtgcgtggggtcaggtgt 
ttccgtaccacagcaggagcagggcccgcccgcatcccagctgtgggcctgccggtcagg 
tcgggcacctactacaaaccgtagtggggtggaggctgctggaggtgggagtgaggagat 
gagggcagggtctcaaacagtcctgactcacagggcctggaaacaagtcctatgtgggcc 
tggggcctggggtcctcatcctccttgttggtctactcaggcccagcccagagctgtgtt 
ccctgtctcaggtcaagcagtggcagacgcaaggctttctgtgggcccccaagtggtagg 
agggagagtagcagagcatgggttactggaagccgggactgctagggctggtggccaggg 
agctgcaagagtgaggctcagctctggctggttctgcccttacccctcctgcccgcctga 
gaactgcacaccctgcccgctggccccaggacctgcactcccaatcctgctgtcttctcc 
ttccctgtgccctgaacaaggacctcactgcccgccttcccctcccaccagcccccttgg 
gccaggcagggtgaggccaaattgctcttggcccacaaatgggtgatggtcagatatgtg 
aatcaagctcctttctctagctagtgtttgatgtgcacgtgtgtgtgcacagtgcgtgtg 
tgcacacgcacacctgtgcactcgtgtgtgtttaagaaaggaaaggatttgggctgggga 
gcaaaagataatgtgaaactgttggtggactctctggtgaggggtgggcagaacttgctg 
ctactagagttcttgggttctccatgatgttcaccctggggctggcccactgtgtcctga 
atgtttttgttattttttgttttattttttaaacaaactgctgtttttatatacctggaa 
tctgttgttggcttcagagccagtggttaaagagcagggtcccaaggattgggagatcta 
gtgtctgccctcctgccctgcaactcaattgggcctttttcggtgacctcatccaaggcc 
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atgatgtcaagggccatgtccccaagcagaggtggagaaggggacactgaggtgagcaaa 
agcaggaaggggcatccactgcgggtgactggaggccgggcaggaagcaagtcatcagag 
ccgctcagctccgttcactctctgccttctgccccactactgtggggcagtggggccaga 
gcccacctccccaacatgtgaagacagtgatgggcacgtgcccacacccccacttctcta 
gccgtttgcagaggccgccacccagcaggggcctgaaaaggagcagcctcgtatttttct 
gtgaaatgttttaatgaaccatgttgttgctggttgtcctggcatcgcgcacactgtatg 
tacatactggcaacgatgtcaaatgtaatttattttaacatttttacaataaaacatgag 
gtggacaggcca 

SEQ ID NO: 174. >R25184_T3 # TY Transcript # LN 4802 ft Source Gene: R25184 
# Encoded protein: R25184_P1 

ctgaccctgggagcgcggggagcgccaagcctcttctagcgctcacctcgcgctcactcc 
tgcgctcgccgggctgcttgtgcggggaaggggcggggcccgcctgggctcctccccggt 
cggagcccggagcaagggggcagagcgcgccgggcgctggttaaagaggcgtgcggaggc 
gggcgcggcgcatactgctagtggcgcgcggaggagcgacgcgtggagaagcggcccacg 
tgtctgcccagagtcaagtcctgtgttcttcccgctccttacgcatccgcggtccagggc 
gccctttcagccccgctggtgttcgcccaccccgggccgcgtgagtggggccccacgcag 
ctccccgcactccgtgggccaacttggccaagcaactctgtccggggagcggtgcttgcg 
gggggtgagtaccgggcactgcgcatgcggagctccaaat.tcaaacagctgttttcagag 
gctggagggcgggcggactggtagcagctggggctaggagaggctttctctaggaggcgg 
ccgctcgggagccatggtggaccggggccctctgctcacctcggccatcatcttctacct 
ggccatcggggcggcgatcttcgaagtgctggaggagccacactggaaggaggccaagaa 
aaactactacacacagaagctgcatctgctcaaggagttcccgtgcctgggtcaggaggg 
cctggacaagatcctagaggtggtatctgatgctgcaggacagggtgtggccatcacagg 
gaaccagaccttcaacaactggaactggcccaatgcaatgatttttgcagcgaccgtcat 
taccaccattggatatggcaatgtggctcccaagacccccgccggtcgcctcttctgtgt 
tttctatggtctcttcggggtgccgctctgcctgacgtggatcagtgccctgggcaagtt 
cttcgggggacgtgccaagagactagggcagttccttaccaagagaggtgtgagtctgcg 
gaaggcgcagatcacgtgcacagtcatcttcatcgtgtggggcgtcctagtccacctggt 
gatcccacccttcgtattcatggtgactgaggggtggaactacatcgagggcctctacta 
ctccttcatcaccatctccaccatcggcttcggtgactttgtggccggtgtgaaccccag 
cgccaactaccacgccctgtaccgctacttcgtggagctctggatctacttggggctggc 
ctggctgtccctttttgtcaactggaaggtgagcatgtttgtggaagtccacaaagccat 
taagaagcggcggcggcgacggaaggagtcctttgagagctccccacactcccggaaggc 
cctgcaggtgaaggggagcacagcctccaaggacgtcaacatcttcagctttctttccaa 
gaaggaagagacctacaacgacctcatcaagcagatcgggaagaaggccatgaagacaag 
cgggggtggggagacgggcccgggcccagggctggggcctcaaggcggtgggctcccagc 
actgcccccttccctggtgcccctggtagtctactccaagaaccgggtgcccaccttgga 
agaggtgtcacagacactgaggagcaaaggccacgtatcaaggtccccagatgaggaggc 
tgtggcacgggcccctgaagacagctcccctgcccccgaggtgttcatgaaccagctgga 
ccgcatcagcgaggaatgcgagccatgggacgcccaggactaccacccactcatcttcca 
ggacgccagcatcaccttcgtgaacacggaggctggcctctcagacgaggagacctccaa 
gtcctcgctagaggacaacttggcaggggaggagagcccccagcagggggctgaagccaa 
ggcgcccctgaacatgggcgagttcccctcctcctccgagtccaccttcaccagcactga 
gtctgagctctctgtgccttacgaacagctgatgaatgagtacaacaaggctaacagccc 
caagggcacatgaggcagggccggctccccaccccacctttgatggcctcttcccccctc 
accctagggtgtcccgagatgaccgggacgcctggcccctggtgggggggcagcctcgga 
actgggagtggggggccaggggccttcctaaccttccatcatcctcagctagatgtatgc 
ccgggacagggcctctgttctccagctgaaccataccctggctgtgggggcatctgtcct 
gagcttggctggtgtatctcacaatgcaaagacatgctggctggcgggacaggtgggcag 
gactgaccctgaggaggccttgcctgcagggtctttgtctcaccatttggtggagtatca 
cacggttctctgaggtctggggcctcagctgtttaagtttaccggtattactgagctcgg 
catttggagagggagctctgaagtgtctggggaggtaccgctgtgcgtggggtcaggtgt 
ttccgtaccacagcaggagcagggcccgcccgcatcccagctgtgggcctgccggtcagg 
tcgggcacctactacaaaccgtagtggggtggaggctgctggaggtgggagtgaggagat 
gagggcagggtctcaaacagtcctgactcacagggcctggaaacaagtcctatgtgggcc 
tggggcctggggtcctcatcctccttgttggtctactcaggcccagcccagagctgtgtt 
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ccctgtctcaggtcaagcagtggcagacgcaaggctttctgtgggcccccaagtggtagg 
agggagagtagcagagcatgggttactggaagccgggactgctagggctggtggccaggg 
agctgcaagagtgaggctcagctctggctggttctgcccttacccctcctgcccgcctga 
gaactgcacaccctgcccgctggccccaggacctgcactcccaatcctgctgtcttctcc 
ttccctgtgccctgaacaaggacctcactgcccgccttcccctcccaccagcccccttgg 
gccaggcagggtgaggccaaattgctcttggcccacaaatgggtgatggtcagatatgtg 
aatcaagctcctttctctagctagtgtttgatgtgcacgtgtgtgtgcacagtgcgtgtg 
tgcacacgcacacctgtgcactcgtgtgtgtttaagaaaggaaaggatttgggctgggga 
gcaaaagataatgtgaaactgttggtggactctctggtgaggggtgggcagaacttgctg 
ctactagagttcttgggttctccatgatgttcaccctggggctggcccactgtgtcctga 
atgtttttgttattttttgttttattttttaaacaaactgctgtttttatatacctggaa 
tctgttgttggcttcagagccagtggttaaagagcagggtcccaaggattgggagatcta 
gtgtctgccctcctgccctgcaactcaattgggcctttttcggtgacctcatccaaggcc 
atgatgtcaagggccatgtccccaagcagaggtggagaaggggacactgaggtgagcaaa 
agcaggaaggggcatccactgcgggtgactggaggccgggcaggaagcaagtcatcagag 
ccgctcagctccgttcactctctgccttctgccccactactgtggggcagtggggccaga 
gcccacctccccaacatgtgaagacagtgatgggcacgtgcccacacccccacttctcta 
gccgtttgcagaggccgccacccagcaggggcctgaaaaggagcagcctcgtatttttct 
gtgaaatgttttaatgaaccatgttgttgctggttgtcctggcatcgcgcacactgtatg 
tacatactggcaacgatgtcaaatgtaatttattttaacatttttacaataaaacatgag 
gtggacaggccagcttggtgtgtgctgtgtgtgggtcaggttagagaggggttggcttct 
ggcctctgagcagcaggggagcctgggcgcatccacagaagatagagaggactcaggaca 
gactgataagaggattggatggtctggaggggagaaaagaatgaaaagagatgaagagag 
cggggagaggcaggcctctgtggggaggtaaggacgcgggaggaacattgcagtagaggt 
ttgagaatcaggaaggagtgggaggagcaaggggagaggcggggtgatcatggcaaacgc 
atgtcactcaccactttgtcaggcactaaatgcttgcactgaataatacgtgagtggact 
catttgacctggcaagaaccctgagagggcatcagtacatctccattttatagacgagga 
aacaggcacagagaagttgccttgtgcaaaatctcagtaaaaatggtgagactagattca 
aacccaggcagcccagctcctgtcccctctaccactgtgatgagatgaaaaagactacct 
gtggttttgtggcacacccaacatctctggaaagggctcctgtctccaggtaggtggacc 
agagaacttgagcgaagctcaagccttctcaactcaaggggtgggcaggagatgtagagc 
ctgagcgtgggcactgtggactggaaagaatgtgaaggaagaatggagccaggctcagcc 
gggcatctgtctccatctgtctaaagggaggctttgtcaggcttcagctgtttggggact 
ggccccttctatgaccatttcagggctcagcaccagaaaggatgatttctgagttcattg 
ga 

SEQ ID NO: 175. >R25184JT4 # TY Transcript # LN 1216 # Source Gene: R25184 
# Encoded protein: R25184_P2 

ctgaccctgggagcgcggggagcgccaagcctcttctagcgctcacctcgcgctcactcc 
tgcgctcgccgggctgcttgtgcggggaaggggcggggcccgcctgggctcctccccggt 
cggagcccggagcaagggggcagagcgcgccgggcgctggttaaagaggcgtgcggaggc 
gggcgcggcgcatactgctagtggcgcgcggaggagcgacgcgtggagaagcggcccacg 
tgtctgcccagagtcaagtcctgtgttcttcccgctccttacgcatccgcggtccagggc 
gccctttcagccccgctggtgttcgcccaccccgggccgcgtgagtggggccccacgcag 
ctccccgcactccgtgggccaacttggccaagcaactctgtccggggagcggtgcttgcg 
gggggtgagtaccgggcactgcgcatgcggagctccaaattcaaacagctgttttcagag 
gctggagggcgggcggactggtagcagctggggctaggagaggctttctctaggaggcgg 
ccgctcgggagccatggtggaccggggccctctgctcacctcggccatcatcttctacct 
ggccatcggggcggcgatcttcgaagtgctggaggagccacactggaaggaggccaagaa 
aaactactacacacagaagctgcatctgctcaaggagttcccgtgcctgggtcaggaggg 
cctggacaagatcctagaggtggtatctgatgctgcaggacagggtgtggccatcacagg 
gaaccagaccttcaacaactggaactggcccaatgcaatgatttttgcagcgaccgtcat 
taccaccattggatatggcaatgtggctcccaagacccccgccggtcgcctcttctgtgt 
tttctatggtctcttcggggtgccgctctgcctgacgtggatcagtgccctgggcaagtt 
cttcgggggacgtgccaagagactagggcagttccttaccaagagaggtgtgagtctgcg 
gaaggcgcagatcacgtgcacagtcatcttcatcgtgtggggcgtcctagtccacctggt 
gatcccacccttcgtattcatggtgactgaggggtggaactacatcgagggcctctacta 
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ctccttcatcaccatctccaccatcggcttcggtgactttgtggccggtgagtccctgcc 
tctgcctcccacctct 

SEQ ID NO: 176. >R25184_T5 # TY Transcript # LN 994 # Source Gene: R25184 # 
Encoded protein: R25184_P3 

ctgaccctgggagcgcggggagcgccaagcctcttctagcgctcacctcgcgctcactcc 
tgcgctcgccgggctgcttgtgcggggaaggggcggggcccgcctgggctcctccccggt 
cggagcccggagcaagggggcagagcgcgccgggcgctggttaaagaggcgtgcggaggc 
gggcgcggcgcatactgctagtggcgcgcggaggagcgacgcgtggagaagcggcccacg 
tgtctgcccagagtcaagtcctgtgttcttcccgctccttacgcatccgcggtccagggc 
gccctttcagccccgctggtgttcgcccaccccgggccgcgtgagtggggccccacgcag 
ctccccgcactccgtgggccaacttggccaagcaactctgtccggggagcggtgcttgcg 
gggggtgagtaccgggcactgcgcatgcggagctccaaattcaaacagctgttttcagag 
gctggagggcgggcggactggtagcagctggggctaggagaggctttctctaggaggcgg 
ccgctcgggagccatggtggaccggggccctctgctcacctcggccatcatcttctacct 
ggccatcggggcggcgatcttcgaagtgctggaggagccacactggaaggaggccaagaa 
aaactactacacacagaagctgcatctgctcaaggagttcccgtgcctgggtcaggaggg 
cctggacaagatcctagaggtcagtcgccgcctgtacccccatatacatctgaagctggc 
ttgagctttagggggcgctgaaaggaagaccccttcacagctttgggtgaccctcagtcc 
ctgtggaaggtccagggggtgatgagagagcgtgtctcatcatgggtcaggcttcgagga 
tctgtcgctcacccgagggacacaggggtcccccaggcgcggagactctttggaacttag 
tttggccggagacaaattggagttgattcccggg 

SEQ ID NO: 177 . >R25184_P1 # TY Protein # CC #LN 4 99 # Source Gene: R25184 

# Encoding Transcript: 1 

MVDRGPLLTSAI I FYLAIGAAI FEVLEEPHWKEAKKNY YTQKLHLLKE FPCLGQEGLDKI 
LEWS DAAGQG VA I TGNQT FNN WNW PNAM I FAAT V I TT I G YGNVAPKT PAGRLFC VFYGL 
FGVPLCLTWISALGKFFGGRAKRLGQFLTKRGVSLRKAQITCTVIFIVWGVLVHLVIPPF 
VFMVTEGWNYIEGLYYSFITISTIGFGDFVAGVNPSANYHALYRYFVELWIYLGLAWLSL 
FVNWKVSMFVEVHKAIKKRRRRRKESFESSPHSRKALQVKGSTASKDVNIFSFLSKKEET 
YNDLIKQIGKKAMKTSGGGETGPGPGLGPQGGGLPALPPSLVPLVVYSKNRVPTLEEVSQ 
TLRSKGHVSRSPDEEAVARAPEDSSPAPEVFMNQLDRISEECEPWDAQDYHPLIFQDASI 
TFVNTEAGLSDEETSKSSLEDNLAGEESPQQGAEAKAPLNMGEFPSSSESTFTSTESELS 
VPYEQLMNEYNKANSPKGT 

SEQ ID NO: 178 - >R25184_P2 # TY Protein # CC #LN 221 # Source Gene: R25184 

# Encoding Transcript: 4 

MVDRGPLLTSAI I FYLAIGAAI FEVLEEPHWKEAKKNY YTQKLHLLKE FPCLGQEGLDKI 
LE WSDAAGQG VAI TGNQT FNNWNW PNAM I FAAT V I TTIG YGNVAPKT PAGRLFC VFYGL 
FGVPLCLTWISALGKFFGGRAKRLGQFLTKRGVSLRKAQITCTVIFIVWGVLVHLVIPPF 
VFMVTEGWNYIEGLYYSFITISTIGFGDFVAGESLPLPPTS 

SEQ ID NO: 179. >R25184_P3 # TY Protein # CC #LN 76 # Source Gene: R25184 

# Encoding Transcript: 5 

MVDRGPLLTSAI I FYLAIGAAI FEVLEEPHWKEAKKNY YTQKLHLLKE FPCLGQEGLDKI 
LEVSRRLYPHIHLKLA 



SEQ ID NO: 180. >T08090 # TY Consensus # Length 9500 # Number of exons 58 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 

tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 

agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 

ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
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aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgagcccccacacccc 
cgccccccccccccccccacctcaccccccacacccccgcccccccaccacccccaggcc 
ancgaagccctggcgtcgcgcaattcgcgcaccaagtagcgaggcctgggtggcgattca 
aacgcacgagcggcacgagctcgtgscgattcggcacggtaagtacttccctaaggggcg 
gggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtctcgcgcgcgg 
gcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaagaggtggccccg 
gatgctgcggcgcccgctggccgggctggctgcggccgccctgggccgggccccaccgga 
cggtgagtacgacaagcgcgatcgcgagggtgactcgggtcgaggcggcagcggtccctg 
cgctttgcggtcgcccagtccacgccgcctccggatctcctcctagccgagacgccctgt 
gccgcattcaactgtgggccctgagaacgggggagctctggagggcggacgggctggcaa 
cgcagcggagcccgggtcccggagggcggggcgctcctcaggagctgggcaggccggcag 
ttatcgcagtgtcgagctgctccccgggcgtataggagcgctcggtgtcttcccgacacc 
tgggcgcgccgccaaagagagcagctgggagtccctaccgctggcccggcgcggtgtgcg 
ctccggttcccacctcaccctgcaagccttggggtcctgggcggatggagccggctattc 
cggggctcccaggattctgcctactccgcggcacggcgcagagacggggacttggggaaa 
cagcaggccttcccgacgcatgggtgtggcatttggggttcggattctccaaagccagag 
tccgggttggggttcttgaagctgccctgctgccattcttgagcgtcgccttagaagtat 
gccccagtaaggactggtgtgggaagcaggtgcctcaaagttgattttgaccctgcgtgt 
ggtgggcctcggtttcgtctctgcatgaaagaggtcggtcctgaggccccgggctgggtc 
ctgctctgctgtctttgggctcatggccccttcctgtctcaggcttgctctgctctttac 
ctggggttgctgtcgaggaccctgtgcaagactcggccggtttttctttctccctgatgg 
acagacccaaacatagccgcgcagcatcgtgaagggctggggccttcactcctctgtggc 
tctggaagagcccgatttcctcaggagggtaaaggaatgttcctgtcccccccggggaaa 
gaaagtgaaaatgaaagcgccctggctgctcctggtgtggtcccatttaaaacgaggttg 
ctgaaggccaggatgcgtgctacccggatgtgtgcccagaaaccctgcagccaagagcag 
ccgcctggaccaccagcctccccttcatccacctggcccagcccagagcccaggcccacg 
tctggtggtttgtgtcttacgagaggccaggcgggggctgaattctctcggttctcagaa 
ggtcaagccctctgttagcatctcctcgaacagtgggccgggggcaaggtggtgactctg 
ttcagtccttaggatggctggaagccagaaggaggagattatgccgccacagcagggagt 
gcctttccaggaaagcggctgctggggcagcatgggagcccagtccagggagcgttctgg 
cagagaccattccggaacatggatctgcgccggcggctcccaggagaccttccatctcgg 
agctcctgtggagaccacgtgcctggcagagantcttangatggcnggaacaaaangang 
agattatgcngccacancagggaatgcctttncangaaaacagctgaangggcaaatggg 
ancacaatccanggaacgttctngcanaaaccattcnngaacanggatcngcgcnggcng 
ctcccaagagaccttccatctngganctcctgtngagaccacgtgcctnncaaatggttg 
gctggcacgtagaggaggtggcagggctgggtgagcagcaggagccaggttagtggggag 
gcccgagctaagcgcaggggcaaattctgcgggtgggagagccaggagggcatggggtgc 
atggtgagatgaaaacgatgtgtgcttgtgacaaaaagcccagaagaggcagtaaggagg 
gaggagtctatcccttatggagagcacatggacttctgggtgggtgtcctgcagttccct 
tcctcttgagtctttttaactgggggtccggtgtgtttgtgatcaatgacatgtatgcgt 
gtgcatgtgtacctttgcaggtacacatgtgtggtgtgtgtgcatttgcttggtgtgcgt 
gcatttgtgtgtgtgttttgtgtgtgcatgcatgtgtgtgtgtttgtggtaggtttgtgc 
acacatcctgcatacctttgtgcctgggagtggcatgtgtgtggtgggtgtcgctgtgca 
tgatggagttgcagccgtggtgcatttgtgagattacctcggtggcctaaattggggatt 
aggagggcatcctcaggtccttcccactgtctgctcgtctgcccacaggcggctgtgccc 
taaacaggaggaggccattcacgcctcgcctgagttgtgtccaaggtgtgcgtgtggcca 
ggggtccatccgcttccctctagcccagcccctgaacacagctgcagtgcacggccccac 
tcctcagctctgctccccatcccaactcgaagacgctgccctggccctgtgtgtgcagct 
catgtggactgggagggcagggcaggtgcaggtcttggggcaagagctggagctgtcttt 
tccttcctgcacagccgcagagcaggtggatggggctgcttccctgcaagggcccagggc 
caggccccctggggatttattcgtggcttagaagggtggggccagaagcaggcgtagtgg 
ggattagggactcagcacccccagctctcagtccagcagacagacccaccccaggctgac 
tacagaggctgcacctcagcaaacagcatcacagataggagtcccaagtcctatgagaaq 
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Figure 1 (Cont'd) 



ttcctggaataggtgtagattcagtagatctttacaagaccatatctgcagggcaaggta 
ccagaggacagaggcggggacagggacacttccattccagacctagcagcccagcactca 
gcaccatgcatgggagcaaatggctggactcctgggtggggtgggggtctcagagcaggc 
tcccagagggcttggaggtgactccaccaggtggggacggcagctcccaggtagggtgtc 
atcagagtagacagcattgcttgctagggacccctggggaggctgacagggtcagtgggt 
ttcagttggggggctcccctgctgagaacccagtaaagccggccttccattcgtctcccg 
tgtgcccagagcctggtctgagggccgccctgtgcatgccggcccttccaacgtggcaga 
gctcagggggaagaacacccaggctctcaggagactctcaggccaatgtctccatccctg 
ggtcagccctttcctgccatgaattcaggaaggcagaggcagctcagcagatggggacta 
gaggccgcactgctatccacagcctctcttctcacccccagaccaggcgatcttgctgcc 
tctcatcatcggtgtgatccttcttggcctcctcgctgtggcgcttattgctttctgcat 
cgtccggacacacccatccggctgaccaagccctctgagcatatgcttcaagacccactg 
cactaaagggtgcgccgtgcatgtcgggccccaggcctgtggtgctgagcgggccttcgg 
gagctgggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggct 
tcagcgtgtcccgtgagtccagggctctcgtggaggggtgcgtagacctcaaggctgctg 
agtagtcctagcaccgtgagcaggccaggagcccaaacccaacaggcacacccaccctgc 
agactgtccgaactcttgcacactcccccccacacagaacctgaggttatcacactcctg 
ctgtcctgcgtgcctgtgtctcccttccctgggtctgttgagtactgataactgggccac 
agtgtttctttctgggagaaccctcgccttgtaggctcctgcgccttcccagtggtgtgc 
ttcactggctgcctgcatcctggggctcaagtgctgtcgggactgcaagggaaacgctgg 
gtggggcattgggctccgagcagcccccgatgggtgacaggtctctctgctagataccac 
gaggaacccgaggcccggcgaggagaacggcaaaggtgagtggggtggggccctatggct 
ggagcacccccagtgtgggcagggctgctgggccctgcagctgtgttggctgtgctgccc 
gtctcctgcccccatcaatccctaatctgtgagatgggtccttgcctccaagggccggtg 
aactcaatcagggtgtcagcgccacagcgtggtgtcgccttccttgggtacagtgtgaga 
ggccggccaaggcctggggctgtcttcctcccaccttggaggcggccacagtgctgctgt 
ccccagccctgtcctggactcggcacttatcagcacttttgagctgtcttctggggtcct 
ggtaaaaagggctactctgcctgcctgattcaagacaagggacccccttcccaacagcac 
ccccgccccttgccgtgcaacccagtggtctccagtcaccccaccacatcgtcccctctg 
taacctgacggtctccagttcccccaccaccttcccccagaacctgttggtctccagtcc 
ccaccccatcaccaccaactcccaactccccactggaacccagcagtctcgtatctccat 
cagtgaggacagtgtgagaaatggtgtctggctcaggcacttggcagcacttgaggggtc 
ctcagatgtctcctgcccagcaaggctgactaaagcagtcatgggtgtgggaggggcctg 
caggcatgcctgggttgggggcagctggccctgggcaccctggtgcaggtccagtctgcc 
ctctggatggcccctcctcttccccagattagctgagagcaagatgggtcaccagcagct 
gtactggagccacccgcgaaaattcggccagggttctcgctcttgtcgtgtctgttcaaa 
ccggcacggtctgatccggaaatatggcctcaatatgtgccgccagtgtttccgtcagta 
cgcgaaggatatcggtttcattaagttggactaaatgctcttccttcagaggattatccg 
gggcatctactcaatgaaaaaccatgataattctttgtatataaaataaacatttgaaaa 
aaccaaaaaaaaaaaaaaaaaaaaattactactttgtaaccagggaggtgatgcagcgtg 
acatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgtatggcacga 
gctgttccctgtaggcggggcagggcgtgggggtagcaggtttgagatgctgtcgggtgc 
tgggtccaggccaggcctaggctcagctgtgggaggagaacgctgggcccgggagggcct 
gggtgtccctgaagctcctgtaggcctcagagagccctggcacccctgctgacctggcac 
ctctccccagcaaggtggcggtgcaggccgtgcaggccatgaacgtccggaattctccgg 
atctggtcctgcgtaccctgacctcccgtccttccctttctaattgagaggcccgcctct 
tcctactatacctcttcctgtgtgcctcaactgaggcctcgtgcctgaccattctaggct 
actcctctcccttgcctgggcctgtgtgaacccgtctcgctggattttagagccttggga 
cggcgtgcatagacatatatcccttcagagatacgtacaagctccgcatctgtgtgctgg 
acgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatct 
ctgtgcagccgccttcactgcacgtgctggtgtgtgctgggcagggttgggggctggggg 
ccagggcatgccaggctctgattgccaccccctttttaggagcagcggctgcggcagcgc 
aacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgacatggag 
agcagtgagtgtgccgtgggatcaccagggaatgccaggaggggagtcagggttctgagg 
tctgtggcaccagggaccctgtgggtccccagacctcctgacacctggagtccctgtgag 
ggtcctcagacctctcaactacctcccaacacctagagtccccgtgagggtccccagaac 
ccacccccagtcaccaagggtctcattgagggtcctcagatttccctctgttacccagag 
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Figure 1 (Cont'd) 



tctccgtgagggcccccagaccccccatcgcccagggtcccatgagatgtccccaacctt 
ctagccccgggggtgtcatgtgcatcctcttacagctgttgcctcttctctgggtctgac 
tgcagcccacaagaatgtgtgtgtagaggatagtgtagcccctaaccagagtcctgatgg 
gtgctggtgtccagacccaagttctggggctccagagagagcaggagtggtgcctgagga 
ctgaggcccagggggcggcccttccctaccctgcacaggcccggctgggctggaaagctg 
tcccacagccgcagtgaggacagccgcaggccagtgggctgctctgggggtcgtgtggga 
cctggggtggggctgcatgggctcactgtgccctgaccccaggccccacccacaggcaag 
gagcccggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcagag 
ctgaaggaggcgctctctgaggtgggcccatccttgtgcctacctgggcaaggcccaagg 
ggaggcctgggggccaggcctttgttgtccatgaggccactgaggttagatgggacagtc 
ctacccaagcactggcatgagacaccgaggtccacggtggagggagagcaggaagcccag 
cccttcctggataccagccctcccaactccctttcttcctcactggcaggaaatcaagaa 
agctcaaaggaccggcgcctgaggcttgctgtctgttctcggcaccccgggcccatacag 
gaccagggcagcagcattgagccacccccttggcaggcgatacggcagctctgtgccctt 
ggccagcgggggggtttttttgggggcgccccnccccccccccccctttttttttggggg 
gccccccccaaggganccccnaatgtggagtggaggagatgctgcccctgtggttggaac 
atcctggggtgacccccgacccagcctcgctgggctgtcccctgtccctatctctcactc 
tggacccagggctgacatcctaataaaataactgttggattagaaactccaaaaaaaraa 
caaraaaaaaaaacttgggcaaaccttggtcattcccccgcttcctaaaatcttccagag 
gcattctttggccgggtggccggaaaaaaaaaggggaaatgaaaaagggcccccaggggg 
ccaataaattgtgttttcctggaccccaggtttttaaaaaacgcgaacgagatcgcggcc 
cagcgaggcaacactcgaggcatctttggtccggtgcttgacaaaaaggcattgaaccgc 
ccacagggatcatgctagtctccgaaacccaggttatctttgaacaaacaggtaaaaaag 
cgggggcgaaccccgaatgggagcggcgcggtgcacctaccaaggaacgccgacgcgccg 
gcgcggcttgcagggagtggcccgccagcgaccctacaagatgcaccgcccctccaaaac 
aaccggncgctgcattcatccgcgtccggggaaaggcccacgatgcgcccatcgtcgacc 
agttagaaaaactacacgcgcgctaatacacaagaagacaagctccctctcgtaaaatcg 
actcaccgcttctctgtcacactgccccggcgtaaccgtccggctatgctcgcacccggg 
acasgggggattkragaaaaaccgctgccgcgccccgccccagcctccccgggcattccac 
tttcgcgcggccgggcaaaaaggcggtacactgcaccgggcccactgggggtcacacccc 
ctggttccgtcgcgacccactcttgtttttagcccccacaggaggcgggcaaatcattgc 
acggtgtcatcgccgcaatgaggaccctgtgcaagactcggccggtttttctttctccct 
gatggacagacccaaacatagccgcgcagcatcgtgaagggctggggccttcactcctct 
gtggctctggaagagcccgatttcctcaggaggcatgtcgggccccaggcctgtggtgct 
gagcgggccttcgggagctgggaagagcaccctgctgaagaggctgctccaggagcacag 
cggcatctttggcttcagcgtgtcccgtgagtccagggctctcgtggagggataccacga 
ggaacccgaggcccggcgaggagaacggcaaaaaaaaaaaaaaaaaaaaatacaaaaaaa 
aaaaaaaaaaaaaaaagrgggggcgaaaaaaaataacacctcccgggcttttatgtggcg 
aggtggcggcgaacaaaaggcatgtgaataggccgctggggcgtacaaagctgatttctc 
ggacacgtgcttttcttctgggacagaagacgaaaaagatcttcgcgcggctctccccta 
taacggagcgcggatatatc 

SEQ ID NO: 181. >T08090_T1 # TY Transcript # LN 1584 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
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Figure 1 (Cont'd) 



ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagt 
tctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaacc 
gcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgc 
ggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggc 
agcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgaca 
tggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggacc 
aggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccg 
gcgcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcagcag 
cattgagccacccccttggcaggcgatacggcagctctgtgcccttggccagcatgtgga 
gtggaggagatgctgcccctgtggttggaacatcctggggtgacccccgacccagcctcg 
ctgggctgtcccctgtccctatctctcactctggacccagggctgacatcctaataaaat 
aactgttggattagaaactccaaa 

SEQ ID NO: 182. >T08090_T2 # TY Transcript # LN 958 # Source Gene: T08090 # 
Encoded protein: T08090_P25 

agtagcgaggcctgggtggcgattcaaacgaagaggtggccccggatgctgcggcgcccg 
ctggccgggctggctgcggccgccctgggccgggccccaccggacggcatgtcgggcccc 
aggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctgctgaagaggctg 
ctccaggagcacagcggcatctttggcttcagcgtgtcccataccacgaggaacccgagg 
cccggcgaggagaacggcaaagattactactttgtaaccagggaggtgatgcagcgtgac 
atagcagccggcgacttcatcgagcatgccgagttctcggggaacctgtatggcacgagc 
aaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctggacgtggacctg 
cagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatctctgtgcagccg 
ccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaaaccgaggagagc 
ctggtgaagcggctggctgctgcccaggccgacatggagagcagcaaggagcccggcctg 
tttgatgtggtcatcattaacgacagcctggaccaggcctacgcagagctgaaggaggcg 
ctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtctgttctc 
ggcaccccgggcccatacaggaccagggcagcagcattgagccacccccttggcaggcga 
tacggcagctctgtgcccttggccagcatgtggagtggaggagatgctgcccctgtggtt 
ggaacatcctggggtgacccccgacccagcctcgctgggctgtcccctgtccctatctct 
cactctggacccagggctgacatcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO: 183. >T08090_T3 # TY Transcript # LN 964 # Source Gene: T08090 # 
Encoded protein: T08090_P25 

gcacgagcggcacgagctcgtgscgattcggcacggaagaggtggccccggatgctgcgg 
cgcccgctggccgggctggctgcggccgccctgggccgggccccaccggacggcatgtcg 
ggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctgctgaag 
aggctgctccaggagcacagcggcatctttggcttcagcgtgtcccataccacgaggaac 
ccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggaggtgatgcag 
cgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgtatggc 
acgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctggacgtg 
gacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatctctgtg 
cagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaaaccgag 
gagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagcaaggagccc 
ggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcagagctgaag 
gaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtct 
gttctcggcaccccgggcccatacaggaccagggcagcagcattgagccacccccttggc 
aggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagatgctgcccct 
gtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccctgtccct 
atctctcactctggacccagggctgacatcctaataaaataactgttggattagaaactc 
caaa 

SEQ ID NO: 184. >T08090JT4 # TY Transcript # LN 1749 # Source Gene: T08090 
# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
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Figur 1 (Cont'd) 



tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcttgctctgctctttacctggggttgctgtcgaggaccctgtgcaag 
actcggccggtttttctttctccctgatggacagacccaaacatagccgcgcagcatcgt 
gaagggctggggccttcactcctctgtggctctggaagagcccgatttcctcaggaggca 
tgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctgc 
tgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccataccacga 
ggaacccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggaggtga 
tgcagcgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgt 
atggcacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctgg 
acgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatct 
ctgtgcagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaaa 
ccgaggagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagcaagg 
agcccggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcagagc 
tgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttgc 
tgtctgttctcggcaccccgggcccatacaggaccagggcagcagcattgagccaccccc 
ttggcaggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagatgctg 
cccctgtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccctg 
tccctatctctcactctggacccagggctgacatcctaataaaataactgttggattaga 
aactccaaa 

SEQ ID NO: 185. >T08090_T5 # TY Transcript # LN 2509 # Source Gene: T08090 
# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggtgagtacgacaagcgcgatcgcgagggtgactcgggtcgaggcggca 
gcggtccctgcgctttgcggtcgcccagtccacgccgcctccggatctcctcctagccga 
gacgccctgtgccgcattcaactgtgggccctgagaacgggggagctctggagggcggac 
gggctggcaacgcagcggagcccgggtcccggagggcggggcgctcctcaggagctgggc 
aggccggcagttatcgcagtgtcgagctgctccccgggcgtataggagcgctcggtgtct 
tcccgacacctgggcgcgccgccaaagagagcagctgggagtccctaccgctggcccggc 
gcggtgtgcgctccggttcccacctcaccctgcaagccttggggtcctgggcggatggag 
ccggctattccggggctcccaggattctgcctactccgcggcacggcgcagagacgggga 
cttggggaaacagcaggccttcccgacgcatgggtgtggcatttggggttcggattctcc 
aaagccagagtccgggttggggttcttgaagctgccctgctgccattcttgagcgtcgcc 
ttagaagtatgccccagtaaggactggtgtgggaagcaggtgcctcaaagttgattttga 
ccctgcgtgtggtgggcctcggtttcgtctctgcatgaaagaggtcggtcctgaggcccc 
gggctgggtcctgctctgctgtctttgggctcatggccccttcctgtctcaggcttgctc 
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tgctctttacctggggttgctgtcgaggaccctgtgcaagactcggccggtttttctttc 
tccctgatggacagacccaaacatagccgcgcagcatcgtgaagggctggggccttcact 
cctctgtggctctggaagagcccgatttcctcaggaggcatgtcgggccccaggcctgtg 
gtgctgagcgggccttcgggagctgggaagagcaccctgctgaagaggctgctccaggag 
cacagcggcatctttggcttcagcgtgtcccataccacgaggaacccgaggcccggcgag 
gagaacggcaaagattactactttgtaaccagggaggtgatgcagcgtgacatagcagcc 
ggcgacttcatcgagcatgccgagttctcggggaacctgtatggcacgagcaaggtggcg 
gtgcaggccgtgcaggccatgaaccgcatctgtgtgctggacgtggacctgcagggtgtg 
cggaacatcaaggccaccgatctgcggcccatctacatctctgtgcagccgccttcactg 
cacgtgctggagcagcggctgcggcagcgcaacactgaaaccgaggagagcctggtgaag 
cggctggctgctgcccaggccgacatggagagcagcaaggagcccggcctgtttgatgtg 
gtcatcattaacgacagcctggaccaggcctacgcagagctgaaggaggcgctctctgag 
gaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtctgttctcggcaccccg 
ggcccatacaggaccagggcagcagcattgagccacccccttggcaggcgatacggcagc 
tctgtgcccttggccagcatgtggagtggaggagatgctgcccctgtggttggaacatcc 
tggggtgacccccgacccagcctcgctgggctgtcccctgtccctatctctcactctgga 
cccagggctgacatcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO: 186 . >T08090_T6 # TY Transcript # LN 1380 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggcatgtcgggcc 
ccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctgctgaagaggc 
tgctccaggagcacagcggcatctttggcttcagcgtgtcccataccacgaggaacccga 
ggcccggcgaggagaacggcaaagattactactttgtaaccagggaggtgatgcagcgtg 
acatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgtatggcacga 
gcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctggacgtggacc 
tgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatctctgtgcagc 
cgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaaaccgaggaga 
gcctggtgaagcggctggctgctgcccaggccgacatggagagcagcaaggagcccggcc 
tgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcagagctgaaggagg 
cgctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtctgttc 
tcggcaccccgggcccatacaggaccagggcagcagcattgagccacccccttggcaggc 
gatacggcagctctgtgcccttggccagcatgtggagtggaggagatgctgcccctgtgg 
ttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccctgtccctatct 
ctcactctggacccagggctgacatcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO: 187. >T08090_T7 # TY Transcript # LN 2291 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
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ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcttgctctgctctttacctggggttgctgtcgaggaccctgtgcaag 
actcggccggtttttctttctccctgatggacagacccaaacatagccgcgcagcatcgt 
gaagggctggggccttcactcctctgtggctctggaagagcccgatttcctcaggagggt 
aaaggaatgttcctgtcccccccggggaaagaaagtgaaaatgaaagcgccctggctgct 
cctggtgtggtcccatttaaaacgaggttgctgaaggccaggatgcgtgctacccggatg 
tgtgcccagaaaccctgcagccaagagcagccgcctggaccaccagcctccccttcatcc 
acctggcccagcccagagcccaggcccacgtctggtggtttgtgtcttacgagaggccag 
gcgggggctgaattctctcggttctcagaaggtcaagccctctgttagcatctcctcgaa 
cagtgggccgggggcaaggtggtgactctgttcagtccttaggatggctggaagccagaa 
ggaggagattatgccgccacagcagggagtgcctttccaggaaagcggctgctggggcag 
catgggagcccagtccagggagcgttctggcagagaccattccggaacatggatctgcgc 
cggcggctcccaggagaccttccatctcggagctcctgtggagaccacgtgcctggcagg 
catgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccct 
gctgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccataccac 
gaggaacccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggaggt 
gatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacct 
gtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgct 
ggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacat 
ctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactga 
aaccgaggagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagcaa 
ggagcccggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcaga 
gctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggctt 
gctgtctgttctcggcaccccgggcccatacaggaccagggcagcagcattgagccaccc 
ccttggcaggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagatgc 
tgcccctgtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccc 
tgtccctatctctcactctggacccagggctgacatcctaataaaataactgttggatta 
gaaactccaaa 

SEQ ID NO: 188. >T08090_T8 # TY Transcript # LN 3070 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggtgagtacgacaagcgcgatcgcgagggtgactcgggtcgaggcggca 
gcggtccctgcgctttgcggtcgcccagtccacgccgcctccggatctcctcctagccga 
gacgccctgtgccgcattcaactgtgggccctgagaacgggggagctctggagggcggac 
gggctggcaacgcagcggagcccgggtcccggagggcggggcgctcctcaggagctgggc 
aggccggcagttatcgcagtgtcgagctgctccccgggcgtataggagcgctcggtgtct 
tcccgacacctgggcgcgccgccaaagagagcagctgggagtccctaccgctggcccggc 
gcggtgtgcgctccggttcccacctcaccctgcaagccttggggtcctgggcggatggag 
ccggctattccggggctcccaggattctgcctactccgcggcacggcgcagagacgggga 
cttggggaaacagcaggccttcccgacgcatgggtgtggcatttggggttcggattctcc 
aaagccagagtccgggttggggttcttgaagctgccctgctgccattcttgagcgtcgcc 
ttagaagtatgccccagtaaggactggtgtgggaagcaggtgcctcaaagttgattttga 
ccctgcgtgtggtgggcctcggtttcgtctctgcatgaaagaggtcggtcctgaggcccc 
gggctgggtcctgctctgctgtctttgggctcatggccccttcctgtctcaggcttgctc 
tgctctttacctggggttgctgtcgaggaccctgtgcaagactcggccggtttttctttc 
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tccctgatggacagacccaaacatagccgcgcagcatcgtgaagggctggggccttcact 
cctctgtggctctggaagagcccgatttcctcaggagggtaaaggaatgttcctgtcccc 
cccggggaaagaaagtgaaaatgaaagcgccctggctgctcctggtgtggtcccatttaa 
aacgaggttgctgaaggccaggatgcgtgctacccggatgtgtgcccagaaaccctgcag 
ccaagagcagccgcctggaccaccagcctccccttcatccacctggcccagcccagagcc 
caggcccacgtctggtggtttgtgtcttacgagaggccaggcgggggctgaattctctcg 
gttctcagaaggtcaagccctctgttagcatctcctcgaacagtgggccgggggcaaggt 
ggtgactctgttcagtccttaggatggctggaagccagaaggaggagattatgccgccac 
agcagggagtgcctttccaggaaagcggctgctggggcagcatgggagcccagtccaggg 
agcgttctggcagagaccattccggaacatggatctgcgccggcggctcccaggagacct 
tccatctcggagctcctgtggagaccacgtgcctggcagcctctcttctcacccccaggc 
atgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctg 
ctgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccataccacg 
aggaacccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggaggtg 
atgcagcgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctg 
tatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctg 
gacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatc 
tctgtgcagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaa 
accgaggagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagcaag 
gagcccggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcagag 
ctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttg 
ctgtctgttctcggcaccccgggcccatacaggaccagggcagcagcattgagccacccc 
cttggcaggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagatgct 
gcccctgtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccct 
gtccctatctctcactctggacccagggctgacatcctaataaaataactgttggattag 
aaactccaaa 

SEQ ID NO: 189. >T08090_T9 # TY Transcript # LN 2710 # Source Gene: T08090 
# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggtgagtacgacaagcgcgatcgcgagggtgactcgggtcgaggcggca 
gcggtccctgcgctttgcggtcgcccagtccacgccgcctccggatctcctcctagccga 
gacgccctgtgccgcattcaactgtgggccctgagaacgggggagctctggagggcggac 
gggctggcaacgcagcggagcccgggtcccggagggcggggcgctcctcaggagctgggc 
aggccggcagttatcgcagtgtcgagctgctccccgggcgtataggagcgctcggtgtct 
tcccgacacctgggcgcgccgccaaagagagcagctgggagtccctaccgctggcccggc 
gcggtgtgcgctccggttcccacctcaccctgcaagccttggggtcctgggcggatggag 
ccggctattccggggctcccaggattctgcctactccgcggcacggcgcagagacgggga 
cttggggaaacagcaggccttcccgacgcatgggtgtggcatttggggttcggattctcc 
aaagccagagtccgggttggggttcttgaagctgccctgctgccattcttgagcgtcgcc 
ttagaagtatgccccagtaaggactggtgtgggaagcaggtgcctcaaagttgattttga 
ccctgcgtgtggtgggcctcggtttcgtctctgcatgaaagaggtcggtcctgaggcccc 
gggctgggtcctgctctgctgtctttgggctcatggccccttcctgtctcaggcttgctc 
tgctctttacctggggttgctgtcgaggaccctgtgcaagactcggccggtttttctttc 
tccctgatggacagacccaaacatagccgcgcagcatcgtgaagggctggggccttcact 
cctctgtggctctggaagagcccgatttcctcaggagttaggatggctggaagccagaag 
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gaggagattatgccgccacagcagggagtgcctttccaggaaagcggctgctggggcagc 
atgggagcccagtccagggagcgttctggcagagaccattccggaacatggatctgcgcc 
ggcggctcccaggagaccttccatctcggagctcctgtggagaccacgtgcctggcaggc 
atgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctg 
ctgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccataccacg 
aggaacccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggaggtg 
atgcagcgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctg 
tatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctg 
gacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatc 
tctgtgcagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaa 
accgaggagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagcaag 
gagcccggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcagag 
ctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttg 
ctgtctgttctcggcaccccgggcccatacaggaccagggcagcagcattgagccacccc 
cttggcaggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagatgct 
gcccctgtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccct 
gtccctatctctcactctggacccagggctgacatcctaataaaataactgttggattag 
aaactccaaa 

SEQ ID NO: 190. >T08090_T10 # TY Transcript # LN 1905 # Source Gene: T08090 

# Encoded protein: T08090_P25 

atggttggctggcacgtagaggaggtggcagggctgggtgagcagcaggagccaggttag 
tggggaggcccgagctaagcgcaggggcaaattctgcgggtgggagagccaggagggcat 
ggggtgcatggtgagatgaaaacgatgtgtgcttgtgacaaaaagcccagaagaggcagt 
aaggagggaggagtctatcccttatggagagcacatggacttctgggtgggtgtcctgca 
gttcccttcctcttgagtctttttaactgggggtccggtgtgtttgtgatcaatgacatg 
tatgcgtgtgcatgtgtacctttgcaggtacacatgtgtggtgtgtgtgcatttgcttgg 
tgtgcgtgcatttgtgtgtgtgttttgtgtgtgcatgcatgtgtgtgtgtttgtggtagg 
tttgtgcacacatcctgcatacctttgtgcctgggagtggcatgtgtgtggtgggtgtcg 
ctgtgcatgatggagttgcagccgtggtgcatttgtgagattacctcggtggcctaaatt 
ggggattaggagggcatcctcaggtccttcccactgtctgctcgtctgcccacaggcggc 
tgtgccctaaacaggaggaggccattcacgcctcgcctgagttgtgtccaaggtgtgcgt 
gtggccaggggtccatccgcttccctctagcccagcccctgaacacagctgcagtgcacg 
gccccactcctcagctctgctccccatcccaactcgaagacgctgccctggccctgtgtg 
tgcagctcatgtggactgggagggcagggcaggtgcaggtcttggggcaagagctggagc 
tgtcttttccttcctgcacagccgcagagcaggtggatggggctgcttccctgcaagggc 
ccagggccaggccccctggggatttattcgtggcttagaagggtggggccagaagcaggc 
gtagtggggattagggactcagcacccccagctctcagtccagcagacagacccacccca 
ggctgactacagaggctgcacctcagcaaacaggcatgtcgggccccaggcctgtggtgc 
tgagcgggccttcgggagctgggaagagcaccctgctgaagaggctgctccaggagcaca 
gcggcatctttggcttcagcgtgtcccataccacgaggaacccgaggcccggcgaggaga 
acggcaaagattactactttgtaaccagggaggtgatgcagcgtgacatagcagccggcg 
acttcatcgagcatgccgagttctcggggaacctgtatggcacgagcaaggtggcggtgc 
aggccgtgcaggccatgaaccgcatctgtgtgctggacgtggacctgcagggtgtgcgga 
acatcaaggccaccgatctgcggcccatctacatctctgtgcagccgccttcactgcacg 
tgctggagcagcggctgcggcagcgcaacactgaaaccgaggagagcctggtgaagcggc 
tggctgctgcccaggccgacatggagagcagcaaggagcccggcctgtttgatgtggtca 
tcattaacgacagcctggaccaggcctacgcagagctgaaggaggcgctctctgaggaaa 
tcaagaaagctcaaaggaccggcgcctgaggcttgctgtctgttctcggcaccccgggcc 
catacaggaccagggcagcagcattgagccacccccttggcaggcgatacggcagctctg 
tgcccttggccagcatgtggagtggaggagatgctgcccctgtggttggaacatcctggg 
gtgacccccgacccagcctcgctgggctgtcccctgtccctatctctcactctggaccca 
gggctgacatcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO: 191. >T08090JT11 # TY Transcript # LN 2713 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
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tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggtgagtacgacaagcgcgatcgcgagggtgactcgggtcgaggcggca 
gcggtccctgcgctttgcggtcgcccagtccacgccgcctccggatctcctcctagccga 
gacgccctgtgccgcattcaactgtgggccctgagaacgggggagctctggagggcggac 
gggctggcaacgcagcggagcccgggtcccggagggcggggcgctcctcaggagctgggc 
aggccggcagttatcgcagtgtcgagctgctccccgggcgtataggagcgctcggtgtct 
tcccgacacctgggcgcgccgccaaagagagcagctgggagtccctaccgctggcccggc 
gcggtgtgcgctccggttcccacctcaccctgcaagccttggggtcctgggcggatggag 
ccggctattccggggctcccaggattptgcctactccgcggcacggcgcagagacgggga 
cttggggaaacagcaggccttcccgacgcatgggtgtggcatttggggttcggattctcc 
aaagccagagtccgggttggggttcttgaagctgccctgctgccattcttgagcgtcgcc 
ttagaagtatgccccagtaaggactggtgtgggaagcaggtgcctcaaagttgattttga 
ccctgcgtgtggtgggcctcggtttcgtctctgcatgaaagaggtcggtcctgaggcccc 
gggctgggtcctgctctgctgtctttgggctcatggccccttcctgtctcaggcttgctc 
tgctctttacctggggttgctgtcgaggaccctgtgcaagactcggccggtttttctttc 
tccctgatggacagacccaaacatagccgcgcagcatcgtgaagggctggggccttcact 
cctctgtggctctggaagagcccgatttcctcaggagagantcttangatggcnggaaca 
aaangangagattatgcngccacancagggaatgcctttncangaaaacagctgaanggg 
caaatgggancacaatccanggaacgttctngcanaaaccattcnngaacanggatcngc 
gcnggcngctcccaagagaccttccatctngganctcctgtngagaccacgtgcctnnca 
agcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcacc 
ctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccatacc 
acgaggaacccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggag 
gtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaac 
ctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtg 
ctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctac 
atctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacact 
gaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagc 
aaggagcccggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgca 
gagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggc 
ttgctgtctgttctcggcaccccgggcccatacaggaccagggcagcagcattgagccac 
ccccttggcaggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagat 
gctgcccctgtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcc 
cctgtccctatctctcactctggacccagggctgacatcctaataaaataactgttggat 
tagaaactccaaa 

SEQ ID NO: 192. >T08090JT12 # TY Transcript # LN 14 67 # Source Gene: T08090 
# Encoded protein: T08090_P25 

catcacagataggagtcccaagtcctatgagaagttcctggaataggtgtagattcagta 
gatctttacaagaccatatctgcagggcaaggtaccagaggacagaggcggggacaggga 
cacttccattccagacctagcagcccagcactcagcaccatgcatgggagcaaatggctg 
gactcctgggtggggtgggggtctcagagcaggctcccagagggcttggaggtgactcca 
ccaggtggggacggcagctcccaggtagggtgtcatcagagtagacagcattgcttgcta 
gggacccctggggaggctgacagggtcagtgggtttcagttggggggctcccctgctgag 
aacccagtaaagccggccttccattcgtctcccgtgtgcccagagcctggtctgagggcc 
gccctgtgcatgccggcccttccaacgtggcagagctcagggggaagaacacccaggctc 
tcaggagactctcaggccaatgtctccatccctgggtcagccctttcctgccatgaattc 
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aggaaggcagaggcagctcagcagatggggactagaggccgcactgctatccacagcctc 
tcttctcacccccaggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggag 
ctgggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttca 
gcgtgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactact 
ttgtaaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccg 
agttctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatga 
accgcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatc 
tgcggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgc 
ggcagcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccg 
acatggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctgg 
accaggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaagga 
ccggcgcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcag 
cagcattgagccacccccttggcaggcgatacggcagctctgtgcccttggccagcatgt 
ggagtggaggagatgctgcccctgtggttggaacatcctggggtgacccccgacccagcc 
tcgctgggctgtcccctgtccctatctctcactctggacccagggctgacatcctaataa 
aataactgttggattagaaactccaaa 

SEQ ID NO: 193. >T08090_T13 # TY Transcript # LN 1603 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacgcctctcttctcacccccaggcatgtcgggccccaggcctgtggtgctg 
agcgggccttcgggagctgggaagagcaccctgctgaagaggctgctccaggagcacagc 
ggcatctttggcttcagcgtgtcccataccacgaggaacccgaggcccggcgaggagaac 
ggcaaagattactactttgtaaccagggaggtgatgcagcgtgacatagcagccggcgac 
ttcatcgagcatgccgagttctcggggaacctgtatggcacgagcaaggtggcggtgcag 
gccgtgcaggccatgaaccgcatctgtgtgctggacgtggacctgcagggtgtgcggaac 
atcaaggccaccgatctgcggcccatctacatctctgtgcagccgccttcactgcacgtg 
ctggagcagcggctgcggcagcgcaacactgaaaccgaggagagcctggtgaagcggctg 
gctgctgcccaggccgacatggagagcagcaaggagcccggcctgtttgatgtggtcatc 
attaacgacagcctggaccaggcctacgcagagctgaaggaggcgctctctgaggaaatc 
aagaaagctcaaaggaccggcgcctgaggcttgctgtctgttctcggcaccccgggccca 
tacaggaccagggcagcagcattgagccacccccttggcaggcgatacggcagctctgtg 
cccttggccagcatgtggagtggaggagatgctgcccctgtggttggaacatcctggggt 
gacccccgacccagcctcgctgggctgtcccctgtccctatctctcactctggacccagg 
gctgacatcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO: 194. >T08090_T14 # TY Transcript # LN 1624 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
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gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacgaggccgcactgctatccacagcctctcttctcacccccaggcatgtcg 
ggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctgctgaag 
aggctgctccaggagcacagcggcatctttggcttcagcgtgtcccataccacgaggaac 
ccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggaggtgatgcag 
cgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgtatggc 
acgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctggacgtg 
gacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatctctgtg 
cagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaaaccgag 
gagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagcaaggagccc 
ggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcagagctgaag 
gaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtct 
gttctcggcaccccgggcccatacaggaccagggcagcagcattgagccacccccttggc 
aggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagatgctgcccct 
gtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccctgtccct 
atctctcactctggacccagggctgacatcctaataaaataactgttggattagaaactc 
caaa 

SEQ ID NO: 195. >T08090_T15 # TY Transcript # LN 1969 # Source Gene: T08090 
# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcttgctctgctctttacctggggttgctgtcgaggaccctgtgcaag 
actcggccggtttttctttctccctgatggacagacccaaacatagccgcgcagcatcgt 
gaagggctggggccttcactcctctgtggctctggaagagcccgatttcctcaggagggt 
aaaggaatgttcctgtcccccccggggaaagaaagtgaaaatgaaagcgccctggctgct 
cctggtgtggtcccatttaaaacgaggttgctgaaggccaggatgcgtgctacccggatg 
tgtgcccagaaaccctgcagccaagagcagccgcctggaccaccagcctccccttcatcc 
acctggcccagcccagagcccaggcccacgtctggtggcatgtcgggccccaggcctgtg 
gtgctgagcgggccttcgggagctgggaagagcaccctgctgaagaggctgctccaggag 
cacagcggcatctttggcttcagcgtgtcccataccacgaggaacccgaggcccggcgag 
gagaacggcaaagattactactttgtaaccagggaggtgatgcagcgtgacatagcagcc 
ggcgacttcatcgagcatgccgagttctcggggaacctgtatggcacgagcaaggtggcg 
gtgcaggccgtgcaggccatgaaccgcatctgtgtgctggacgtggacctgcagggtgtg 
cggaacatcaaggccaccgatctgcggcccatctacatctctgtgcagccgccttcactg 
cacgtgctggagcagcggctgcggcagcgcaacactgaaaccgaggagagcctggtgaag 
cggctggctgctgcccaggccgacatggagagcagcaaggagcccggcctgtttgatgtg 
gtcatcattaacgacagcctggaccaggcctacgcagagctgaaggaggcgctctctgag 
gaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtctgttctcggcaccccg 
ggcccatacaggaccagggcagcagcattgagccacccccttggcaggcgatacggcagc 
tctgtgcccttggccagcatgtggagtggaggagatgctgcccctgtggttggaacatcc 
tggggtgacccccgacccagcctcgctgggctgtcccctgtccctatctctcactctgga 
cccagggctgacatcctaataaaataactgttggattagaaactccaaa 
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SEQ ID NO: 196. >T08090JT16 # TY Transcript ft LN 1657 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagt 
tctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaacc 
gcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgc 
ggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggc 
agcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgaca 
tggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggacc 
aggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccg 
gcgcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcagcag 
cattgagccacccccttggcaggcgatacggcagctctgtgcccttggccagcatgtgga 
gtggaggagatgctgcccctgtggttggaacatcctggggtgacccccgacccagcctcg 
ctgggctgtcccctgtccctatctctcactctggacccagggctgacatcctaataaaat 
aactgttggattagaaactccaaaaaaaraacaaraaaaaaaaaaaaaaaaaaaaaaaaa 
aatacaaaaaaaaaaaaaaaaaaaaaaagrgggggcg 

SEQ ID NO: 197. >T08090_T17 # TY Transcript # LN 2032 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgactt ca tcgagcatgccgagt 
tctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaacc 
gcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgc 
ggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggc 
agcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgaca 
tggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggacc 
aggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccg 
gcgcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcagcag 
cattgagccacccccttggcaggcgatacggcagctctgtgcccttggccagcatgtgga 
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gtggaggagatgctgcccctgtggttggaacatcctggggtgacccccgacccagcctcg 
ctgggctgtcccctgtccctatctctcactctggacccagggctgacatcctaataaaat 
aactgttggattagaaactccaaaaaaaaacgcgaacgagatcgcggcccagcgaggcaa 
cactcgaggcatctttggtccggtgcttgacaaaaaggcattgaaccgcccacagggatc 
atgctagtctccgaaacccaggttatctttgaacaaacaggtaaaaaagcgggggcgaac 
cccgaatgggagcggcgcggtgcacctaccaaggaacgccgacgcgccggcgcggcttgc 
agggagtggcccgccagcgaccctacaagatgcaccgcccctccaaaacaaccggncgct 
gcattcatccgcgtccggggaaaggcccacgatgcgcccatcgtcgaccagttagaaaaa 
ctacacgcgcgctaatacacaagaagacaagctccctctcgtaaaatcgactcaccgctt 
ctctgtcacactgccccggcgtaaccgtccggctatgctcgcacccgggaca 

SEQ ID NO: 198. >T08090_T18 # TY Transcript # LN 1780 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagt 
tctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaacc 
gcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgc 
ggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggc 
agcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgaca 
tggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggacc 
aggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccg 
gcgcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcagcag 
cattgagccacccccttggcaggcgatacggcagctctgtgcccttggccagcatgtgga 
gtggaggagatgctgcccctgtggttggaacatcctggggtgacccccgacccagcctcg 
ctgggctgtcccctgtccctatctctcactctggacccagggctgacatcctaataaaat 
aactgttggattagaaactccaaasgggggattkmgaaaaaccgctgccgcgccccgccc 
cagcctccccgggcattccactttcgcgcggccgggcaaaaaggcggtacactgcaccgg 
gcccactgggggtcacaccccctggttccgtcgcgacccactcttgtttttagcccccac 
aggaggcgggcaaatcattgcacggtgtcatcgccgcaat 

SEQ ID NO: 199. >T08090_T19 # TY Transcript # LN 1759 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
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Figur 1 (Cont'd) 



ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagt 
tctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaacc 
gcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgc 
ggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggc 
agcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgaca 
tggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggacc 
aggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccg 
gcgcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcagcag 
cattgagccacccccttggcaggcgatacggcagctctgtgcccttggccagcatgtgga 
gtggaggagatgctgcccctgtggttggaacatcctggggtgacccccgacccagcctcg 
ctgggctgtcccctgtccctatctctcactctggacccagggctgacatcctaataaaat 
aactgttggattagaaactccaaaaaaaaaaataacacctcccgggcttttatgtggcga 
ggtggcggcgaacaaaaggcatgtgaataggccgctggggcgtacaaagctgatttctcg 
gacacgtgcttttcttctgggacagaagacgaaaaagatcttcgcgcggctctcccctat 
aacggagcgcggatatatc 

SEQ ID NO: 200. >T08090_T20 # TY Transcript # LN 1637 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagt 
tctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaacc 
gcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgc 
ggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggc 
agcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgaca 
tggagagcagcaaggagcccggcctgttt gat gtggtca teat taacgacagcctggacc 
aggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccg 
gcgcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcagcag 
cattgagccacccccttggcaggcgatacggcagctctgtgcccttggccagcatgtgga 
gtggaggagatgctgcccctgtggttggaacatcctggggtgacccccgacccagcctcg 
ctgggctgtcccctgtccctatctctcactctggacccagggctgacatcctaataaaat 
aactgttggattagaaactccaaaaaaaaaaaaaaaaaaaaatacaaaaaaaaaaaaaaa 
aaaaaaaagrgggggcg 

SEQ ID NO: 201 . >T08090JT21 # TY Transcript S LN 1726 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
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Figure 1 (Conf d) 



aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgcc'ctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagt 
tctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaacc 
gcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgc 
ggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggc 
agcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgaca 
tggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggacc 
aggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccg 
gcgcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcagcag 
cattgagccacccccttggcaggcgatacggcagctctgtgcccttggccagcatgtgga 
gtggaggagatgctgcccctgtggttggaacatcctggggtgacccccgacccagcctcg 
ctgggctgtcccctgtccctatctctcactctggacccagggctgacatcctaataaaat 
aactgttggattagaaactccaaacttgggcaaaccttggtcattcccccgcttcctaaa 
atcttccagaggcattctttggccgggtggccggaaaaaaaaaggggaaatgaaaaaggg 
cccccagggggccaataaattgtgttttcctggaccccaggttttt 

SEQ ID NO: 202 . >T08090_T22 # TY Transcript # LN 1612 # Source Gene: T08090 
# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagt 
tctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaacc 
gcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgc 
ggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggc 
agcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgaca 
tggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggacc 
aggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccg 
gcgcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcagcag 
cattgagccacccccttggcaggcgatacggcagctctgtgcccttggccagcatgtgga 
gtggaggagatgctgcccctgtggttggaacatcctggggtgacccccgacccagcctcg 
ctgggctgtcccctgtccctatctctcactctggacccagggctgacatcctaataaaat 
aactgttggattagaaactccaaaaaaaaaaaaaaaaaaaaaagrgggggcg 
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Figure 1 (Cont'd) 



SEQ ID NO: 203. >T08090_T23 # TY Transcript # LN 1160 # Source Gene: T08090 

# Encoded protein: T08090_P25 

gcacgagcggcacgagctcgtgscgattcggcacggaagaggtggccccggatgctgcgg 
cgcccgctggccgggctggctgcggccgccctgggccgggccccaccggacggcatgtcg 
ggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctgctgaag 
aggctgctccaggagcacagcggcatctttggcttcagcgtgtcccataccacgaggaac 
ccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggaggtgatgcag 
cgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgtatggc 
acgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctggacgtg 
gacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatctctgtg 
cagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaaaccgag 
gagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagcaaggagccc 
ggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcagagctgaag 
gaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtct 
gttctcggcaccccgggcccatacaggaccagggcagcagcattgagccacccccttggc 
aggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagatgctgcccct 
gtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccctgtccct 
atctctcactctggacccagggctgacatcctaataaaataactgttggattagaaactc 
caaasgggggattkmgaaaaaccgctgccgcgccccgccccagcctccccgggcattcca 
ctttcgcgcggccgggcaaaaaggcggtacactgcaccgggcccactgggggtcacaccc 
cctggttccgtcgcgacccactcttgtttttagcccccacaggaggcgggcaaatcattg 
cacggtgtcatcgccgcaat 

SEQ ID NO:204. >T08090JT24 # TY Transcript # LN 2910 # Source Gene: T08090 

# Encoded protein: T08090_P8 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggtgagtacgacaagcgcgatcgcgagggtgactcgggtcgaggcggca 
gcggtccctgcgctttgcggtcgcccagtccacgccgcctccggatctcctcctagccga 
gacgccctgtgccgcattcaactgtgggccctgagaacgggggagctctggagggcggac 
gggctggcaacgcagcggagcccgggtcccggagggcggggcgctcctcaggagctgggc 
aggccggcagttatcgcagtgtcgagctgctccccgggcgtataggagcgctcggtgtct 
tcccgacacctgggcgcgccgccaaagagagcagctgggagtccctaccgctggcccggc 
gcggtgtgcgctccggttcccacctcaccctgcaagccttggggtcctgggcggatggag 
ccggctattccggggctcccaggattctgcctactccgcggcacggcgcagagacgggga 
cttggggaaacagcaggccttcccgacgcatgggtgtggcatttggggttcggattctcc 
aaagccagagtccgggttggggttcttgaagctgccctgctgccattcttgagcgtcgcc 
ttagaagtatgccccagtaaggactggtgtgggaagcaggtgcctcaaagttgattttga 
ccctgcgtgtggtgggcctcggtttcgtctctgcatgaaagaggtcggtcctgaggcccc 
gggctgggtcctgctctgctgtctttgggctcatggccccttcctgtctcaggcttgctc 
tgctctttacctggggttgctgtcgaggaccctgtgcaagactcggccggtttttctttc 
tccctgatggacagacccaaacatagccgcgcagcatcgtgaagggctggggccttcact 
cctctgtggctctggaagagcccgatttcctcaggaggcatgtcgggccccaggcctgtg 
gtgctgagcgggccttcgggagctgggaagagcaccctgctgaagaggctgctccaggag 
cacagcggcatctttggcttcagcgtgtcccgtgagtccagggctctcgtggaggggtgc 
gtagacctcaaggctgctgagtagtcctagcaccgtgagcaggccaggagcccaaaccca 
acaggcacacccaccctgcagactgtccgaactcttgcacactcccccccacacagaacc 
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Figure 1 (Cont'd) 



tgaggttatcacactcctgctgtcctgcgtgcctgtgtctcccttccctgggtctgttga 
gtactgataactgggccacagtgtttctttctgggagaaccctcgccttgtaggctcctg 
cgccttcccagtggtgtgcttcactggctgcctgcatcctggggctcaagtgctgtcggg 
actgcaagggaaacgctgggtggggcattgggctccgagcagcccccgatgggtgacagg 
tctctctgctagataccacgaggaacccgaggcccggcgaggagaacggcaaagattact 
actttgtaaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatg 
ccgagttctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggcca 
tgaaccgcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccg 
atctgcggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggc 
tgcggcagcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccagg 
ccgacatggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcc 
tggaccaggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaa 
ggaccggcgcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccaggg 
cagcagcattgagccacccccttggcaggcgatacggcagctctgtgcccttggccagca 
tgtggagtggaggagatgctgcccctgtggttggaacatcctggggtgacccccgaccca 
gcctcgctgggctgtcccctgtccctatctctcactctggacccagggctgacatcctaa 
taaaataactgttggattagaaactccaaa 

SEQ ID NO: 205. >T08090_T25 # TY Transcript # LN 2316 # Source Gene: T08090 
# Encoded protein: T08090_P8 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcttgctctgctctttacctggggttgctgtcgaggaccctgtgcaag 
actcggccggtttttctttctccctgatggacagacccaaacatagccgcgcagcatcgt 
gaagggctggggccttcactcctctgtggctctggaagagcccgatttcctcaggagggt 
aaaggaatgttcctgtcccccccggggaaagaaagtgaaaatgaaagcgccctggctgct 
cctggtgtggtcccatttaaaacgaggttgctgaaggccaggatgcgtgctacccggatg 
tgtgcccagaaaccctgcagccaagagcagccgcctggaccaccagcctccccttcatcc 
acctggcccagcccagagcccaggcccacgtctggtggtttgtgtcttacgagaggccag 
gcgggggctgaattctctcggttctcagaaggtcaagccctctgttagcatctcctcgaa 
cagtgggccgggggcaaggtggtgactctgttcagtccttaggatggctggaagccagaa 
ggaggagattatgccgccacagcagggagtgcctttccaggaaagcggctgctggggcag 
catgggagcccagtccagggagcgttctggcagagaccattccggaacatggatctgcgc 
cggcggctcccaggagaccttccatctcggagctcctgtggagaccacgtgcctggcagg 
catgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccct 
gctgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccgtgagtc 
cagggctctcgtggagggataccacgaggaacccgaggcccggcgaggagaacggcaaag 
attactactttgtaaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcg 
agcatgccgagttctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgc 
aggccatgaaccgcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaagg 
ccaccgatctgcggcccatctacatctctgtgcagccgccttcactgcacgtgctggagc 
agcggctgcggcagcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctg 
cccaggccgacatggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacg 
acagcctggaccaggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaag 
ctcaaaggaccggcgcctgaggcttgctgtctgttctcggcaccccgggcccatacagga 
ccagggcagcagcattgagccacccccttggcaggcgatacggcagctctgtgcccttgg 
ccagcatgtggagtggaggagatgctgcccctgtggttggaacatcctggggtgaccccc 
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gacccagcctcgctgggctgtcccctgtccctatctctcactctggacccagggctgaca 
tcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO:206. >T08090_T26 # TY Transcript # LN 2150 # Source Gene: T08090 

# Encoded protein: T08090_P8 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcttgctctgctctttacctggggttgctgtcgaggaccctgtgcaag 
actcggccggtttttctttctccctgatggacagacccaaacatagccgcgcagcatcgt 
gaagggctggggccttcactcctctgtggctctggaagagcccgatttcctcaggaggca 
tgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctgc 
tgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccgtgagtcca 
gggctctcgtggaggggtgcgtagacctcaaggctgctgagtagtcctagcaccgtgagc 
aggccaggagcccaaacccaacaggcacacccaccctgcagactgtccgaactcttgcac 
actcccccccacacagaacctgaggttatcacactcctgctgtcctgcgtgcctgtgtct 
cccttccctgggtctgttgagtactgataactgggccacagtgtttctttctgggagaac 
cctcgccttgtaggctcctgcgccttcccagtggtgtgcttcactggctgcctgcatcct 
ggggctcaagtgctgtcgggactgcaagggaaacgctgggtggggcattgggctccgagc 
agcccccgatgggtgacaggtctctctgctagataccacgaggaacccgaggcccggcga 
ggagaacggcaaagattactactttgtaaccagggaggtgatgcagcgtgacatagcagc 
cggcgacttcatcgagcatgccgagttctcggggaacctgtatggcacgagcaaggtggc 
ggtgcaggccgtgcaggccatgaaccgcatctgtgtgctggacgtggacctgcagggtgt 
gcggaacatcaaggccaccgatctgcggcccatctacatctctgtgcagccgccttcact 
gcacgtgctggagcagcggctgcggcagcgcaacactgaaaccgaggagagcctggtgaa 
gcggctggctgctgcccaggccgacatggagagcagcaaggagcccggcctgtttgatgt 
ggtcatcattaacgacagcctggaccaggcctacgcagagctgaaggaggcgctctctga 
ggaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtctgttctcggcacccc 
gggcccatacaggaccagggcagcagcattgagccacccccttggcaggcgatacggcag 
ctctgtgcccttggccagcatgtggagtggaggagatgctgcccctgtggttggaacatc 
ctggggtgacccccgacccagcctcgctgggctgtcccctgtccctatctctcactctgg 
acccagggctgacatcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO: 207. >T08090_T27 # TY Transcript # LN 1609 # Source Gene: T08090 

# Encoded protein: T08090_P8 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
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Figure 1 (Cont'd) 



ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccgtgagtccagggctctcgtggagggataccacgaggaacccgaggcccggcgag 
gagaacggcaaagattactactttgtaaccagggaggtgatgcagcgtgacatagcagcc 
ggcgacttcatcgagcatgccgagttctcggggaacctgtatggcacgagcaaggtggcg 
gtgcaggccgtgcaggccatgaaccgcatctgtgtgctggacgtggacctgcagggtgtg 
cggaacatcaaggccaccgatctgcggcccatctacatctctgtgcagccgccttcactg 
cacgtgctggagcagcggctgcggcagcgcaacactgaaaccgaggagagcctggtgaag 
cggctggctgctgcccaggccgacatggagagcagcaaggagcccggcctgtttgatgtg 
gtcatcattaacgacagcctggaccaggcctacgcagagctgaaggaggcgctctctgag 
gaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtctgttctcggcaccccg 
ggcccatacaggaccagggcagcagcattgagccacccccttggcaggcgatacggcagc 
tctgtgcccttggccagcatgtggagtggaggagatgctgcccctgtggttggaacatcc 
tggggtgacccccgacccagcctcgctgggctgtcccctgtccctatctctcactctgga 
cccagggctgacatcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO: 208. >T08090_T28 # TY Transcript # LN 931 # Source Gene: T08090 

# Encoded protein: T08090_P9 

accaggcgatcttgctgcctctcatcatcggtgtgatccttcttggcctcctcgctgtgg 
cgcttattgctttctgcatcgtccggacacacccatccggctgaccaagccctctgagca 
tatgcttcaagacccactgcactaaagggtgcgccgtaggagcacagcggcatctttggc 
ttcagcgtgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattac 
tactttgtaaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcat 
gccgagttctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggcc 
atgaaccgcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccacc 
gatctgcggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcgg 
ctgcggcagcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccag 
gccgacatggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagc 
ctggaccaggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaa 
aggaccggcgcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagg 
gcagcagcattgagccacccccttggcaggcgatacggcagctctgtgcccttggccagc 
atgtggagtggaggagatgctgcccctgtggttggaacatcctggggtgacccccgaccc 
agcctcgctgggctgtcccctgtccctatctctcactctggacccagggctgacatccta 
ataaaataactgttggattagaaactccaaa 

SEQ ID NO: 209. >T08090_T29 # TY Transcript « LN 1868 # Source Gene: T08090 

# Encoded protein: T08090_P8 

catcacagataggagtcccaagtcctatgagaagttcctggaataggtgtagattcagta 
gatctttacaagaccatatctgcagggcaaggtaccagaggacagaggcggggacaggga 
cacttccattccagacctagcagcccagcactcagcaccatgcatgggagcaaatggctg 
gactcctgggtggggtgggggtctcagagcaggctcccagagggcttggaggtgactcca 
ccaggtggggacggcagctcccaggtagggtgtcatcagagtagacagcattgcttgcta 
gggacccctggggaggctgacagggtcagtgggtttcagttggggggctcccctgctgag 
aacccagtaaagccggccttccattcgtctcccgtgtgcccagagcctggtctgagggcc 
gccctgtgcatgccggcccttccaacgtggcagagctcagggggaagaacacccaggctc 
tcaggagactctcaggccaatgtctccatccctgggtcagccctttcctgccatgaattc 
aggaaggcagaggcagctcagcagatggggactagaggccgcactgctatccacagcctc 
tcttctcacccccaggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggag 
ctgggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttca 
gcgtgtcccgtgagtccagggctctcgtggaggggtgcgtagacctcaaggctgctgagt 
agtcctagcaccgtgagcaggccaggagcccaaacccaacaggcacacccaccctgcaga 
ctgtccgaactcttgcacactcccccccacacagaacctgaggttatcacactcctgctg 
tcctgcgtgcctgtgtctcccttccctgggtctgttgagtactgataactgggccacagt 
gtttctttctgggagaaccctcgccttgtaggctcctgcgccttcccagtggtgtgcttc 
actggctgcctgcatcctggggctcaagtgctgtcgggactgcaagggaaacgctgggtg 
gggcattgggctccgagcagcccccgatgggtgacaggtctctctgctagataccacgag 
gaacccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggaggtgat 
gcagcgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgta 
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Figure 1 (Cont'd) 



tggcacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctgga 
cgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatctc 
tgtgcagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaaac 
cgaggagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagcaagga 
gcccggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcagagct 
gaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttgct 
gtctgttctcggcaccccgggcccatacaggaccagggcagcagcattgagccaccccct 
tggcaggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagatgctgc 
ccctgtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccctgt 
ccctatctctcactctggacccagggctgacatcctaataaaataactgttggattagaa 
actccaaa 

SEQ ID NO: 210. >T08090_T30 # TY Transcript ft LN 1985 # Source Gene: T0809O 

# Encoded protein: T08090_P8 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccgtgagtccagggctctcgtggaggggtgcgtagacctcaaggctgctgagtagt 
cctagcaccgtgagcaggccaggagcccaaacccaacaggcacacccaccctgcagactg 
tccgaactcttgcacactcccccccacacagaacctgaggttatcacactcctgctgtcc 
tgcgtgcctgtgtctcccttccctgggtctgttgagtactgataactgggccacagtgtt 
tctttctgggagaaccctcgccttgtaggctcctgcgccttcccagtggtgtgcttcact 
ggctgcctgcatcctggggctcaagtgctgtcgggactgcaagggaaacgctgggtgggg 
cattgggctccgagcagcccccgatgggtgacaggtctctctgctagataccacgaggaa 
cccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggaggtgatgca 
gcgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgtatgg 
cacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctggacgt 
ggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatctctgt 
gcagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaaaccga 
ggagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagcaaggagcc 
cggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcagagctgaa 
ggaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtc 
tgttctcggcaccccgggcccatacaggaccagggcagcagcattgagccacccccttgg 
caggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagatgctgcccc 
tgtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccctgtccc 
tatctctcactctggacccagggctgacatcctaataaaataactgttggattagaaact 
ccaaa 

SEQ ID NO: 211. >T08090_T31 # TY Transcript ft LN 3067 # Source Gene: T08090 

# Encoded protein: T08090_P8 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
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ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcttgctctgctctttacctggggttgctgtcgaggaccctgtgcaag 
actcggccggtttttctttctccctgatggacagacccaaacatagccgcgcagcatcgt 
gaagggctggggccttcactcctctgtggctctggaagagcccgatttcctcaggagggt 
aaaggaatgttcctgtcccccccggggaaagaaagtgaaaatgaaagcgccctggctgct 
cctggtgtggtcccatttaaaacgaggttgctgaaggccaggatgcgtgctacccggatg 
tgtgcccagaaaccctgcagccaagagcagccgcctggaccaccagcctccccttcatcc 
acctggcccagcccagagcccaggcccacgtctggtggtttgtgtcttacgagaggccag 
gcgggggctgaattctctcggttctcagaaggtcaagccctctgttagcatctcctcgaa 
cagtgggccgggggcaaggtggtgactctgttcagtccttaggatggctggaagccagaa 
ggaggagattatgccgccacagcagggagtgcctttccaggaaagcggctgctggggcag 
catgggagcccagtccagggagcgttctggcagagaccattccggaacatggatctgcgc 
cggcggctcccaggagaccttccatctcggagctcctgtggagaccacgtgcctggcagg 
catgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccct 
gctgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccataccac 
gaggaacccgaggcccggcgaggagaacggcaaaggtgagtggggtggggccctatggct 
ggagcacccccagtgtgggcagggctgctgggccctgcagctgtgttggctgtgctgccc 
gtctcctgcccccatcaatccctaatctgtgagatgggtccttgcctccaagggccggtg 
aactcaatcagggtgtcagcgccacagcgtggtgtcgccttccttgggtacagtgtgaga 
ggccggccaaggcctggggctgtcttcctcccaccttggaggcggccacagtgctgctgt 
ccccagccctgtcctggactcggcacttatcagcacttttgagctgtcttctggggtcct 
ggtaaaaagggctactctgcctgcctgattcaagacaagggacccccttcccaacagcac 
ccccgccccttgccgtgcaacccagtggtctccagtcaccccaccacatcgtcccctctg 
taacctgacggtctccagttcccccaccaccttcccccagaacctgttggtctccagtcc 
ccaccccatcaccaccaactcccaactccccactggaacccagcagtctcgtatctccat 
cagtgaggacagtgtgagaaatggtgtctggctcaggcacttggcagcacttgaggggtc 
ctcagatgtctcctgcccagcaaggctgactaaagcagtcatgggtgtgggaggggcctg 
caggcatgcctgggttgggggcagctggccctgggcaccctggtgcaggtccagtctgcc 
ctctggatggcccctcctcttccccagattaattactactttgtaaccagggaggtgatg 
cagcgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgtat 
ggcacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctggac 
gtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatctct 
gtgcagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaaacc 
gaggagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagcaaggag 
cccggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcagagctg 
aaggaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttgctg 
tctgttctcggcaccccgggcccatacaggaccagggcagcagcattgagccaccccctt 
ggcaggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagatgctgcc 
cctgtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccctgtc 
cctatctctcactctggacccagggctgacatcctaataaaataactgttggattagaaa 
ctccaaa 

SEQ ID NO: 212 . >T08090_T32 # TY Transcript I LN 3686 # Source Gene: T08090 
# Encoded protein: T08090_P10 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
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Figur 1 (Cont'd) 



gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggtgagtacgacaagcgcgatcgcgagggtgactcgggtcgaggcggca 
gcggtccctgcgctttgcggtcgcccagtccacgccgcctccggatctcctcctagccga 
gacgccctgtgccgcattcaactgtgggccctgagaacgggggagctctggagggcggac 
gggctggcaacgcagcggagcccgggtcccggagggcggggcgctcctcaggagctgggc 
aggccggcagttatcgcagtgtcgagctgctccccgggcgtataggagcgctcggtgtct 
tcccgacacctgggcgcgccgccaaagagagcagctgggagtccctaccgctggcccggc 
gcggtgtgcgctccggttcccacctcaccctgcaagccttggggtcctgggcggatggag 
ccggctattccggggctcccaggattctgcctactccgcggcacggcgcagagacgggga 
cttggggaaacagcaggccttcccgacgcatgggtgtggcatttggggttcggattctcc 
aaagccagagtccgggttggggttcttgaagctgccctgctgccattcttgagcgtcgcc 
ttagaagtatgccccagtaaggactggtgtgggaagcaggtgcctcaaagttgattttga 
ccctgcgtgtggtgggcctcggtttcgtctctgcatgaaagaggtcggtcctgaggcccc 
gggctgggtcctgctctgctgtctttgggctcatggccccttcctgtctcaggcttgctc 
tgctctttacctggggttgctgtcgaggaccctgtgcaagactcggccggtttttctttc 
tccctgatggacagacccaaacatagccgcgcagcatcgtgaagggctggggccttcact 
cctctgtggctctggaagagcccgatttcctcaggaggcatgtcgggccccaggcctgtg 
gtgctgagcgggccttcgggagctgggaagagcaccctgctgaagaggctgctccaggag 
cacagcggcatctttggcttcagcgtgtcccgtgagtccagggctctcgtggaggggtgc 
gtagacctcaaggctgctgagtagtcctagcaccgtgagcaggccaggagcccaaaccca 
acaggcacacccaccctgcagactgtccgaactcttgcacactcccccccacacagaacc 
tgaggttatcacactcctgctgtcctgcgtgcctgtgtctcccttccctgggtctgttga 
gtactgataactgggccacagtgtttctttctgggagaaccctcgccttgtaggctcctg 
cgccttcccagtggtgtgcttcactggctgcctgcatcctggggctcaagtgctgtcggg 
actgcaagggaaacgctgggtggggcattgggctccgagcagcccccgatgggtgacagg 
tctctctgctagataccacgaggaacccgaggcccggcgaggagaacggcaaaggtgagt 
ggggtggggccctatggctggagcacccccagtgtgggcagggctgctgggccctgcagc 
tgtgttggctgtgctgcccgtctcctgcccccatcaatccctaatctgtgagatgggtcc 
ttgcctccaagggccggtgaactcaatcagggtgtcagcgccacagcgtggtgtcgcctt 
ccttgggtacagtgtgagaggccggccaaggcctggggctgtcttcctcccaccttggag 
gcggccacagtgctgctgtccccagccctgtcctggactcggcacttatcagcacttttg 
agctgtcttctggggtcctggtaaaaagggctactctgcctgcctgattcaagacaaggg 
acccccttcccaacagcacccccgccccttgccgtgcaacccagtggtctccagtcaccc 
caccacatcgtcccctctgtaacctgacggtctccagttcccccaccaccttcccccaga 
acctgttggtctccagtccccaccccatcaccaccaactcccaactccccactggaaccc 
agcagtctcgtatctccatcagtgaggacagtgtgagaaatggtgtctggctcaggcact 
tggcagcacttgaggggtcctcagatgtctcctgcccagcaaggctgactaaagcagtca 
tgggtgtgggaggggcctgcaggcatgcctgggttgggggcagctggccctgggcaccct 
ggtgcaggtccagtctgccctctggatggcccctcctcttccccagattaattactactt 
tgtaaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccga 
gttctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaa 
ccgcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatct 
gcggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcg 
gcagcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccga 
catggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctgga 
ccaggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggac 
cggcgcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcagc 
agcattgagccacccccttggcaggcgatacggcagctctgtgcccttggccagcatgtg 
gagtggaggagatgctgcccctgtggttggaacatcctggggtgacccccgacccagcct 
cgctgggctgtcccctgtccctatctctcactctggacccagggctgacatcctaataaa 
ataactgttggattagaaactccaaa 

SEQ ID NO: 213. >T08090_T33 # TY Transcript # LN 3285 # Source Gene: T08090 
# Encoded protein: T08090_P8 
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cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggtgagtacgacaagcgcgatcgcgagggtgactcgggtcgaggcggca 
gcggtccctgcgctttgcggtcgcccagtccacgccgcctccggatctcctcctagccga 
gacgccctgtgccgcattcaactgtgggccctgagaacgggggagctctggagggcggac 
gggctggcaacgcagcggagcccgggtcccggagggcggggcgctcctcaggagctgggc 
aggccggcagttatcgcagtgtcgagctgctccccgggcgtataggagcgctcggtgtct 
tcccgacacctgggcgcgccgccaaagagagcagctgggagtccctaccgctggcccggc 
gcggtgtgcgctccggttcccacctcaccctgcaagccttggggtcctgggcggatggag 
ccggctattccggggctcccaggattctgcctactccgcggcacggcgcagagacgggga 
cttggggaaacagcaggccttcccgacgcatgggtgtggcatttggggttcggattctcc 
aaagctagagtccgggttggggttcttgaagctgccctgctgccattcttgagcgtcgcc 
ttagaagtatgccccagtaaggactggtgtgggaagcaggtgcctcaaagttgattttga 
ccctgcgtgtggtgggcctcggtttcgtctctgcatgaaagaggtcggtcctgaggcccc 
gggctgggtcctgctctgctgtctttgggctcatggccccttcctgtctcaggcttgctc 
tgctctttacctggggttgctgtcgaggaccctgtgcaagactcggccggtttttctttc 
tccctgatggacagacccaaacatagccgcgcagcatcgtgaagggctggggccttcact 
cctctgtggctctggaagagcccgatttcctcaggaggcatgtcgggccccaggcctgtg 
gtgctgagcgggccttcgggagctgggaagagcaccctgctgaagaggctgctccaggag 
cacagcggcatctttggcttcagcgtgtcccataccacgaggaacccgaggcccggcgag 
gagaacggcaaaggtgagtggggtggggccctatggctggagcacccccagtgtgggcag 
ggctgctgggccctgcagctgtgttggctgtgctgcccgtctcctgcccccatcaatccc 
taatctgtgagatgggtccttgcctccaagggccggtgaactcaatcagggtgtcagcgc 
cacagcgtggtgtcgccttccttgggtacagtgtgagaggccggccaaggcctggggctg 
tcttcctcccaccttggaggcggccacagtgctgctgtccccagccctgtcctggactcg 
gcacttatcagcacttttgagctgtcttctggggtcctggtaaaaagggctactctgcct 
gcctgattcaagacaagggacccccttcccaacagcacccccgccccttgccgtgcaacc 
cagtggtctccagtcaccccaccacatcgtcccctctgtaacctgacggtctccagttcc 
cccaccaccttcccccagaacctgttggtctccagtccccaccccatcaccaccaactcc 
caactccccactggaacccagcagtctcgtatctccatcagtgaggacagtgtgagaaat 
ggtgtctggctcaggcacttggcagcacttgaggggtcctcagatgtctcctgcccagca 
aggctgactaaagcagtcatgggtgtgggaggggcctgcaggcatgcctgggttgggggc 
agctggccctgggcaccctggtgcaggtccagtctgccctctggatggcccctcctcttc 
cccagattaattactactttgtaaccagggaggtgatgcagcgtgacatagcagccggcg 
acttcatcgagcatgccgagttctcggggaacctgtatggcacgagcaaggtggcggtgc 
aggccgtgcaggccatgaaccgcatctgtgtgctggacgtggacctgcagggtgtgcgga 
acatcaaggccaccgatctgcggcccatctacatctctgtgcagccgccttcactgcacg 
tgctggagcagcggctgcggcagcgcaacactgaaaccgaggagagcctggtgaagcggc 
tggctgctgcccaggccgacatggagagcagcaaggagcccggcctgtttgatgtggtca 
tcattaacgacagcctggaccaggcctacgcagagctgaaggaggcgctctctgaggaaa 
tcaagaaagctcaaaggaccggcgcctgaggcttgctgtctgttctcggcaccccgggcc 
catacaggaccagggcagcagcattgagccacccccttggcaggcgatacggcagctctg 
tgcccttggccagcatgtggagtggaggagatgctgcccctgtggttggaacatcctggg 
gtgacccccgacccagcctcgctgggctgtcccctgtccctatctctcactctggaccca 
gggctgacatcctaataaaataactgttggattagaaactccaaa 
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SEQ ID NO: 214. >T08090_T34 # TY Transcript # LN 1891 # Source Gene: T08090 
ft Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcttgctctgctctttacctggggttgctgtcgaggaccctgtgcaag 
actcggccggtttttctttctccctgatggacagacccaaacatagccgcgcagcatcgt 
gaagggctggggccttcactcctctgtggctctggaagagcccgatttcctcaggaggca 
tgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctgc 
tgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccataccacga 
ggaacccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggaggtga 
tgcagcgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgt 
atggcacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctgg 
acgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatct 
ctgtgcagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaaa 
ccgaggagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagcaagg 
agcccggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcagagc 
tgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttgc 
tgtctgttctcggcaccccgggcccatacaggaccagggcagcagcattgagccaccccc 
ttggcaggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagatgctg 
cccctgtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccctg 
tccctatctctcactctggacccagggctgacatcctaataaaataactgttggattaga 
aactccaaacttgggcaaaccttggtcattcccccgcttcctaaaatcttccagaggcat 
tctttggccgggtggccggaaaaaaaaaggggaaatgaaaaagggcccccagggggccaa 
taaattgtgttttcctggaccccaggttttt 

SEQ ID NO: 215. >T08090_T35 # TY Transcript # LN 682 # Source Gene: T08090 

# Encoded protein: T08090_P11 

gcacgagcggcacgagctcgtgscgattcggcacgcgacttcatcgagcatgccgagttc 
tcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgc 
atctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcgg 
cccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggcag 
cgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgacatg 
gagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggaccag 
gcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccggc 
gcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcagcagca 
ttgagccacccccttggcaggcgatacggcagctctgtgcccttggccagcatgtggagt 
ggaggagatgctgcccctgtggttggaacatcctggggtgacccccgacccagcctcgct 
gggctgtcccctgtccctatctctcactctggacccagggctgacatcctaataaaataa 
ctgttggattagaaactccaaa 

SEQ ID NO: 216. >T08090_T36 # TY Transcript * LN 1645 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
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tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagt 
tctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaacc 
gcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgc 
ggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggc 
agcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgaca 
tggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggacc 
aggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccg 
gcgcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcagcag 
cattgagccacccccttggcaggcgatacggcagctctggggggggtttttttgggggcg 
ccccnccccccccccccctttttttttggggggccccccccaaggganccccnaatgtgg 
agtggaggagatgctgcccctgtggttggaacatcctggggtgacccccgacccagcctc 
gctgggctgtcccctgtccctatctctcactctggacccagggctgacatcctaataaaa 
taactgttggattagaaactccaaa 

SEQ ID NO: 217. >T08090_T37 # TY Transcript # LN 1819 # Source Gene: T08090 
# Encoded protein: T08090_P13 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcttgctctgctctttacctggggttgctgtcgaggaccctgtgcaag 
actcggccggtttttctttctccctgatggacagacccaaacatagccgcgcagcatcgt 
gaagggctggggccttcactcctctgtggctctggaagagcccgatttcctcaggaggca 
tgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctgc 
tgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccataccacga 
ggaacccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggaggtga 
tgcagcgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgt 
atggcacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctgg 
acgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatct 
ctgtgcagccgccttcactgcacgtgctggtgtgtgctgggcagggttgggggctggggg 
ccagggcatgccaggctctgattgccaccccctttttaggagcagcggctgcggcagcgc 
aacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgacatggag 
agcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggaccaggcc 
tacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcc 
tgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcagcagcattg 
agccacccccttggcaggcgatacggcagctctgtgcccttggccagcatgtggagtgga 
ggagatgctgcccctgtggttggaacatcctggggtgacccccgacccagcctcgctggg 
ctgtcccctgtccctatctctcactctggacccagggctgacatcctaataaaataactg 
ttggattagaaactccaaa 
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SEQ ID NO: 218. >T08090_T38 » TY Transcript # LN 1652 # Source Gene: T08090 

# Encoded protein: T08090_P25 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagt 
tctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaacc 
gcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgc 
ggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggc 
agcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgaca 
tggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggacc 
aggcctacgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccg 
gcgcctgaggcttgctgtcgaggaccctgtgcaagactcggccggtttttctttctccct 
gatggacagacccaaacatagccgcgcagcatcgtgaagggctggggccttcactcctct 
gtggctctggaagagcccgatttcctcaggaggcatgtcgggccccaggcctgtggtgct 
gagcgggccttcgggagctgggaagagcaccctgctgaagaggctgctccaggagcacag 
cggcatctttggcttcagcgtgtcccgtgagtccagggctctcgtggagggataccacga 
ggaacccgaggcccggcgaggagaacggcaaa 

SEQ ID NO: 219. >T08090_T39 # TY Transcript # LN 1792 # Source Gene: T08090 

# Encoded protein: T08090_P15 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagt 
tctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaacc 
gcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgc 
ggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggc 
agcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgaca 
tggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggacc 
aggcctacgcagagctgaaggaggcgctctctgaggtgggcccatccttgtgcctacctg 
ggcaaggcccaaggggaggcctgggggccaggcctttgttgtccatgaggccactgaggt 
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tagatgggacagtcctacccaagcactggcatgagacaccgaggtccacggtggagggag 
agcaggaagcccagcccttcctggataccagccctcccaactccctttcttcctcactgg 
caggaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtctgttctcggcacc 
ccgggcccatacaggaccagggcagcagcattgagccacccccttggcaggcgatacggc 
agctctgtgcccttggccagcatgtggagtggaggagatgctgcccctgtggttggaaca 
tcctggggtgacccccgacccagcctcgctgggctgtcccctgtccctatctctcactct 
ggacccagggctgacatcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO: 220. >T08090_T40 # TY Transcript * LN 1667 # Source Gene: T08090 
# Encoded protein: T08090_P14 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagt 
tctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaacc 
gcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgc 
ggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggc 
agcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgaca 
tggagagcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggacc 
aggcctacgcagagctgaaggaggcgctctctgaggtgggcccatccttgtgcctacctg 
ggcaaggcccaaggggaggcctgggggccaggcctttgttgtccatgaggccactgagga 
aatcaagaaagctcaaaggaccggcgcctgaggcttgctgtctgttctcggcaccccggg 
cccatacaggaccagggcagcagcattgagccacccccttggcaggcgatacggcagctc 
tgtgcccttggccagcatgtggagtggaggagatgctgcccctgtggttggaacatcctg 
gggtgacccccgacccagcctcgctgggctgtcccctgtccctatctctcactctggacc 
cagggctgacatcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO:221. >T08090_T41 # TY Transcript # LN 1498 # Source Gene: T08090 
< Encoded protein: T08090_P14 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagt 
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tctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaacc 
gcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgc 
ggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggc 
agcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgaca 
tggagagcagaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtctgttctc 
ggcaccccgggcccatacaggaccagggcagcagcattgagccacccccttggcaggcga 
tacggcagctctgtgcccttggccagcatgtggagtggaggagatgctgcccctgtggtt 
ggaacatcctggggtgacccccgacccagcctcgctgggctgtcccctgtccctatctct 
cactctggacccagggctgacatcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO:222. >T08090_T42 # TY Transcript # LN 949 # Source Gene: T08090 

# Encoded protein: T08090_P16 

cccccacacccccgccccccccccccccccacctcaccccccacacccccgcccccccac 
cacccccaggccancgaagccctggcgtcgcgcaattcgcgcaccagaagaggtggcccc 
ggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccgggccccaccgg 
acggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagca 
ccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccata 
ccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttgtaaccaggg 
aggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagttctcgggga 
acctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtg 
tgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatct 
acatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaaca 
ctgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgacatggagagca 
gaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtctgttctcggcaccccg 
ggcccatacaggaccagggcagcagcattgagccacccccttggcaggcgatacggcagc 
tctgtgcccttggccagcatgtggagtggaggagatgctgcccctgtggttggaacatcc 
tggggtgacccccgacccagcctcgctgggctgtcccctgtccctatctctcactctgga 
cccagggctgacatcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO:223. >T08090_T43 # TY Transcript # LN 1445 # Source Gene: T08090 

# Encoded protein: T08090_P17 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagt 
tctcggggaacctgtatggcacgaggagcagcggctgcggcagcgcaacactgaaaccga 
ggagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagcaaggagcc 
cggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcagagctgaa 
ggaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtc 
tgttctcggcaccccgggcccatacaggaccagggcagcagcattgagccacccccttgg 
caggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagatgctgcccc 
tgtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccctgtccc 
tatctctcactctggacccagggctgacatcctaataaaataactgttggattagaaact 
ccaaa 
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SEQ ID NO: 224 . >T08090_T44 # TY Transcript # LN 858 # Source Gene: T08090 

# Encoded protein: T08090_P18 

ctgttccctgtaggcggggcagggcgtgggggtagcaggtttgagatgctgtcgggtgct 
gggtccaggccaggcctaggctcagctgtgggaggagaacgctgggcccgggagggcctg 
ggtgtccctgaagctcctgtaggcctcagagagccctggcacccctgctgacctggcacc 
tctccccagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctgga 
cgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatctc 
tgtgcagccgccttcactgcacgtgctggtgtgtgctgggcagggttgggggctgggggc 
cagggcatgccaggctctgattgccaccccctttttaggagcagcggctgcggcagcgca 
acactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgacatggaga 
gcagcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggaccaggcct 
acgcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcct 
gaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcagcagcattga 
gccacccccttggcaggcgatacggcagctctgtgcccttggccagcatgtggagtggag 
gagatgctgcccctgtggttggaacatcctggggtgacccccgacccagcctcgctgggc 
tgtcccctgtccctatctctcactctggacccagggctgacatcctaataaaataactgt 
tggattagaaactccaaa 

SEQ ID NO: 225 . >T08090_T45 # TY Transcript # LN 1696 # Source Gene: T08090 

# Encoded protein: T08090_P19 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcttgctctgctctttacctggggttgctgtcgaggaccctgtgcaag 
actcggccggtttttctttctccctgatggacagacccaaacatagccgcgcagcatcgt 
gaagggctggggccttcactcctctgtggctctggaagagcccgatttcctcaggaggca 
tgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctgc 
tgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccataccacga 
ggaacccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggaggtga 
tgcagcgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgt 
atggcacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctgg 
acgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatct 
ctgtgcagccgccttcactgcacgtgctggctggctgctgcccaggccgacatggagagc 
agcaaggagcccggcctgtttgatgtggtcatcattaacgacagcctggaccaggcctac 
gcagagctgaaggaggcgctctctgaggaaatcaagaaagctcaaaggaccggcgcctga 
ggcttgctgtctgttctcggcaccccgggcccatacaggaccagggcagcagcattgagc 
cacccccttggcaggcgatacggcagctctgtgcccttggccagcatgtggagtggagga 
gatgctgcccctgtggttggaacatcctggggtgacccccgacccagcctcgctgggctg 
tcccctgtccctatctctcactctggacccagggctgacatcctaataaaataactgttg 
gattagaaactccaaa 

SEQ ID NO:226. >T08090_T46 # TY Transcript # LN 1541 # Source Gene: T08090 

# Encoded protein: T08090_P14 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
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tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggagctg 
ggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttcagcg 
tgtcccataccacgaggaacccgaggcccggcgaggagaacggcaaagattactactttg 
taaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcgagcatgccgagt 
tctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgcaggccatgaacc 
gcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaaggccaccgatctgc 
ggcccatctacatctctgtgcagccgccttcactgcacgtgctggagcagcggctgcggc 
agcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgaca 
tggagagcacctggaccaggcctacgcagagctgaaggaggcgctctctgaggaaatcaa 
gaaagctcaaaggaccggcgcctgaggcttgctgtctgttctcggcaccccgggcccata 
caggaccagggcagcagcattgagccacccccttggcaggcgatacggcagctctgtgcc 
cttggccagcatgtggagtggaggagatgctgcccctgtggttggaacatcctggggtga 
cccccgacccagcctcgctgggctgtcccctgtccctatctctcactctggacccagggc 
tgacatcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO: 227. >T08090_T47 # TY Transcript # LN 2558 # Source Gene: T08090 
# Encoded protein: T08090_P20 

catcacagataggagtcccaagtcctatgagaagttcctggaataggtgtagattcagta 
gatctttacaagaccatatctgcagggcaaggtaccagaggacagaggcggggacaggga 
cacttccattccagacctagcagcccagcactcagcaccatgcatgggagcaaatggctg 
gactcctgggtggggtgggggtctcagagcaggctcccagagggcttggaggtgactcca 
ccaggtggggacggcagctcccaggtagggtgtcatcagagtagacagcattgcttgcta 
gggacccctggggaggctgacagggtcagtgggtttcagttggggggctcccctgctgag 
aacccagtaaagccggccttccattcgtctcccgtgtgcccagagcctggtctgagggcc 
gccctgtgcatgccggcccttccaacgtggcagagctcagggggaagaacacccaggctc 
tcaggagactctcaggccaatgtctccatccctgggtcagccctttcctgccatgaattc 
aggaaggcagaggcagctcagcagatggggactagaggccgcactgctatccacagcctc 
tcttctcacccccaggcatgtcgggccccaggcctgtggtgctgagcgggccttcgggag 
ctgggaagagcaccctgctgaagaggctgctccaggagcacagcggcatctttggcttca 
gcgtgtcccgtgagtccagggctctcgtggaggggtgcgtagacctcaaggctgctgagt 
agtcctagcaccgtgagcaggccaggagcccaaacccaacaggcacacccaccctgcaga 
ctgtccgaactcttgcacactcccccccacacagaacctgaggttatcacactcctgctg 
tcctgcgtgcctgtgtctcccttccctgggtctgttgagtactgataactgggccacagt 
gtttctttctgggagaaccctcgccttgtaggctcctgcgccttcccagtggtgtgcttc 
actggctgcctgcatcctggggctcaagtgctgtcgggactgcaagggaaacgctgggtg 
gggcattgggctccgagcagcccccgatgggtgacaggtctctctgctagataccacgag 
gaacccgaggcccggcgaggagaacggcaaaggtgagtggggtggggccctatggctgga 
gcacccccagtgtgggcagggctgctgggccctgcagctgtgttggctgtgctgcccgtc 
tcctgcccccatcaatccctaatctgtgagatgggtccttgcctccaagggccggtgaac 
tcaatcagggtgtcagcgccacagcgtggtgtcgccttccttgggtacagtgtgagaggc 
cggccaaggcctggggctgtcttcctcccaccttggaggcggccacagtgctgctgtccc 
cagccctgtcctggactcggcacttatcagcacttttgagctgtcttctggggtcctggt 
aaaaagggctactctgcctgcctgattcaagacaagggacccccttcccaacagcacccc 
cgccccttgccgtgcaacccagtggtctccagtcaccccaccacatcgtcccctctgtaa 
cctgacggtctccagttcccccaccaccttcccccagaacctgttggtctccagtcccca 
ccccatcaccaccaactcccaactccccactggaacccagcagtctcgtatctccatcag 
tgaggacagtgtgagaaatggtgtctggctcaggcacttggcagcacttgaggggtcctc 
agatgtctcctgcccagcaaggctgactaaagcagtcatgggtgtgggaggggcctgcag 
gcatgcctgggttgggggcagctggccctgggcaccctggtgcaggtccagtctgccctc 
tggatggcccctcctcttccccagattaattactactttgtaaccagggaggtgatgcag 
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cgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgtatggc 
acgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctggacgtg 
gacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatctctgtg 
cagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaaaccgag 
gagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagaaatcaagaa 
agctcaaaggaccggcgcctgaggcttgctgtctgttctcggcaccccgggcccatacag 
gaccagggcagcagcattgagccacccccttggcaggcgatacggcagctctgtgccctt 
ggccagcatgtggagtggaggagatgctgcccctgtggttggaacatcctggggtgaccc 
ccgacccagcctcgctgggctgtcccctgtccctatctctcactctggacccagggctga 
catcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO: 228. >T08090_T48 # TY Transcript # LN 1718 # Source Gene: T08090 

# Encoded protein: T08090_P21 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacgttaggatggctggaagccagaaggaggagattatgccgccacagcagg 
gagtgcctttccaggaaagcggctgctggggcagcatgggagcccagtccagggagcgtt 
ctggcagagaccattccggaacatggatctgcgccggcggctcccaggagaccttccatc 
tcggagctcctgtggagaccacgtgcctggcagcctctcttctcacccccaggcatgtcg 
ggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctgctgaag 
aggctgctccaggagcacagcggcatctttggcttcagcgtgtcccataccacgaggaac 
ccgaggcccggcgaggagaacggcaaagat tact act ttgtaaccagggaggtgatgcag 
cgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgtatggc 
acgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctggacgtg 
gacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatctctgtg 
cagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaaaccgag 
gagagcctggtgaagcggctggctgctgcccaggccgacatggagagcagaaatcaagaa 
agctcaaaggaccggcgcctgaggcttgctgtctgttctcggcaccccgggcccatacag 
gaccagggcagcagcattgagccacccccttggcaggcgatacggcagctctgtgccctt 
ggccagcatgtggagtggaggagatgctgcccctgtggttggaacatcctggggtgaccc 
ccgacccagcctcgctgggctgtcccctgtccctatctctcactctggacccagggctga 
catcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO: 229. >T08090_T49 # TY Transcript < LN 1115 « Source Gene: T08O90 

# Encoded protein: T08090_P7 

gctgagagcaagatgggtcaccagcagctgtactggagccacccgcgaaaattcggccag 
ggttctcgctcttgtcgtgtctgttcaaaccggcacggtctgatccggaaatatggcctc 
aatatgtgccgccagtgtttccgtcagtacgcgaaggatatcggtttcattaagttggac 
taaatgctcttccttcagaggattatccggggcatctactcaatgaaaaaccatgataat 
tctttgtatataaaataaacatttgaaaaaaccaaaaaaaaaaaaaaaaaaaacgacttc 
atcgagcatgccgagttctcggggaacctgtatggcacgagcaaggtggcggtgcaggcc 
gtgcaggccatgaaccgcatctgtgtgctggacgtggacctgcagggtgtgcggaacatc 
aaggccaccgatctgcggcccatctacatctctgtgcagccgccttcactgcacgtgctg 
gagcagcggctgcggcagcgcaacactgaaaccgaggagagcctggtgaagcggctggct 
gctgcccaggccgacatggagagcagcaaggagcccggcctgtttgatgtggtcatcatt 
aacgacagcctggaccaggcctacgcagagctgaaggaggcgctctctgaggaaatcaag 
aaagctcaaaggaccggcgcctgaggcttgctgtctgttctcggcaccccgggcccatac 
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aggaccagggcagcagcattgagccacccccttggcaggcgatacggcagctctgtgccc 
ttggccagcatgtggagtggaggagatgctgcccctgtggttggaacatcctggggtgac 
ccccgacccagcctcgctgggctgtcccctgtccctatctctcactctggacccagggct 
gacatcctaataaaataactgttggattagaaactccaaaaaaaaaaataacacctcccg 
ggcttttatgtggcgaggtggcggcgaacaaaaggcatgtgaataggccgctggggcgta 
caaagctgatttctcggacacgtgcttttcttctgggacagaagacgaaaaagatcttcg 
cgcggctctcccctataacggagcgcggatatatc 

SEQ ID NO: 230 . >T08090_T50 # TY Transcript # LN 1554 # Source Gene: T08090 

# Encoded protein: T08090_P1 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcttgctctgctctttacctggggttgctgtcgaggaccctgtgcaag 
actcggccggtttttctttctccctgatggacagacccaaacatagccgcgcagcatcgt 
gaagggctggggccttcactcctctgtggctctggaagagcccgatttcctcaggaggca 
tgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctgc 
tgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccataccacga 
ggaacccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggaggtga 
tgcagcgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgt 
atggcacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctgg 
acgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatct 
ctgtgcagccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaaa 
ccgaggagagcctggtgaagcgggcagcagcattgagccacccccttggcaggcgatacg 
gcagctctgtgcccttggccagcatgtggagtggaggagatgctgcccctgtggttggaa 
catcctggggtgacccccgacccagcctcgctgggctgtcccctgtccctatctctcact 
ctggacccagggctgacatcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO: 231. >T08090_T51 # TY Transcript # LN 1686 # Source Gene: T08090 

# Encoded protein: T08090_P22 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacgcctctcttctcacccccaggcatgtcgggccccaggcctgtggtgctg 
agcgggccttcgggagctgggaagagcaccctgctgaagaggctgctccaggagcacagc 
ggcatctttggcttcagcgtgtcccataccacgaggaacccgaggcccggcgaggagaac 
ggcaaagattactactttgtaaccagggaggtgatgcagcgtgacatagcagccggcgac 
ttcatcgagcatgccgagttctcggggaacctgtatggcacgagcaaggtggcggtgcag 
gccgtgcaggccatgaaccgcatctgtgtgctggacgtggacctgcagggtgtgcggaac 



131/365 



Figure 1 (Cont'd) 



atcaaggccaccgatctgcggcccatctacatctctgtgcagccgccttcactgcacgtg 
ctggagcagcggctgcggcagcgcaacactgaaaccgaggagagcctggtgaagcggctg 
gctgctgcccaggccgacatggagagcagcaaggagcccggcctgtttgatgtggtcatc 
attaacgacagcctggaccaggcctacgcagagctgaaggaggcgctctctgaggtgggc 
ccatccttgtgcctacctgggcaaggcccaaggggaggcctgggggccaggcctttgttg 
tccatgaggccactgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgt 
ctgttctcggcaccccgggcccatacaggaccagggcagcagcattgagccacccccttg 
gcaggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagatgctgccc 
ctgtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccctgtcc 
ctatctctcactctggacccagggctgacatcctaataaaataactgttggattagaaac 
tccaaa 

SEQ ID NO: 232 . >T08090_T52 # TY Transcript # LN 2230 # Source Gene: T08090 

# Encoded protein: T08090_P23 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcttgctctgctctttacctggggttgctgtcgaggaccctgtgcaag 
actcggccggtttttctttctccctgatggacagacccaaacatagccgcgcagcatcgt 
gaagggctggggccttcactcctctgtggctctggaagagcccgatttcctcaggagggt 
aaaggaatgttcctgtcccccccggggaaagaaagtgaaaatgaaagcgccctggctgct 
cctggtgtggtcccatttaaaacgaggttgctgaaggccaggatgcgtgctacccggatg 
tgtgcccagaaaccctgcagccaagagcagccgcctggaccaccagcctccccttcatcc 
acctggcccagcccagagcccaggcccacgtctggtggtttgtgtcttacgagaggccag 
gcgggggctgaattctctcggttctcagaaggtcaagccctctgttagcatctcctcgaa 
cagtgggccgggggcaaggtggtgactctgttcagtccttaggatggctggaagccagaa 
ggaggagattatgccgccacagcagggagtgcctttccaggaaagcggctgctggggcag 
catgggagcccagtccagggagcgttctggcagagaccattccggaacatggatctgcgc 
cggcggctcccaggagaccttccatctcggagctcctgtggagaccacgtgcctggcagg 
catgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccct 
gctgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccgtgagtc 
cagggctctcgtggagggataccacgaggaacccgaggcccggcgaggagaacggcaaag 
attactactttgtaaccagggaggtgatgcagcgtgacatagcagccggcgacttcatcg 
agcatgccgagttctcggggaacctgtatggcacgagcaaggtggcggtgcaggccgtgc 
aggccatgaaccgcatctgtgtgctggacgtggacctgcagggtgtgcggaacatcaagg 
ccaccgatctgcggcccatctacatctctgtgcagccgccttcactgcacgtgctggagc 
agcggctgcggcagcgcaacactgaaaccgaggagagcctggtgaagcggctggctgctg 
cccaggccgacatggagagcagaaatcaagaaagctcaaaggaccggcgcctgaggcttg 
ctgtctgttctcggcaccccgggcccatacaggaccagggcagcagcattgagccacccc 
cttggcaggcgatacggcagctctgtgcccttggccagcatgtggagtggaggagatgct 
gcccctgtggttggaacatcctggggtgacccccgacccagcctcgctgggctgtcccct 
gtccctatctctcactctggacccagggctgacatcctaataaaataactgttggattag 
aaactccaaa 

SEQ ID NO: 233. >T08090_T53 # TY Transcript # LN 655 # Source Gene: T08090 

# Encoded protein: 

tgtgtgtgtagaggatagtgtagcccctaaccagagtcctgatgggtgctggtgtccaga 
cccaagttctggggctccagagagagcaggagtggtgcctgaggactgaggcccaggggg 
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cggcccttccctaccctgcacaggcccggctgggctggaaagctgtcccacagccgcagt 
gaggacagccgcaggccagtgggctgctctgggggtcgtgtgggacctggggtggggctg 
catgggctcactgtgccctgaccccaggccccacccacaggcaaggagcccggcctgttt 
gatgtggtcatcattaacgacagcctggaccaggcctacgcagagctgaaggaggcgctc 
tctgaggaaatcaagaaagctcaaaggaccggcgcctgaggcttgctgtctgttctcggc 
accccgggcccatacaggaccagggcagcagcattgagccacccccttggcaggcgatac 
ggcagctctgtgcccttggccagcatgtggagtggaggagatgctgcccctgtggttgga 
acatcctggggtgacccccgacccagcctcgctgggctgtcccctgtccctatctctcac 
tctggacccagggctgacatcctaataaaataactgttggattagaaactccaaa 

SEQ ID NO: 234. >T08090_T54 # TY Transcript # LN 1815 # Source Gene: T08090 

# Encoded protein: T08090_P13 

cacaaaaaagtttttaaattttttggtacagaaggggtttccctatgatccccaggctgg 
tctccaattcctggcctcaagcggtcctcccggctcggactcccaaagagctgggatcac 
agacgtgaaccacgcgcccggccacattccccaattaatggtttttggagagtggccggt 
ttggaacgggctagcggagtcgttcgttagttaggtccctgcagtgtagatggaggacgg 
aaacggcgctgtgacataagcgggaggaggcgggctgaggcagcgctgtgacgtagcaga 
tgaaagccgggcccgcgctgcactgaggtacttctctgagtgacaagcgggacaagtacg 
ccgtgggcggggccggcgggtcgtgatgttagttcgcgctgtgtggaggcgggccggcgt 
ggcgctgtgacgcagcgggagaagacgggctgggagggcgctgtgctgaacgcccctgcg 
gtcgcgacaggccagcgcaggtggaggcggggctagcgaggcactgaggtaagtacttcc 
ctaaggggcggggaagaacgagtgagacatgggcggggcagtcgcctcatgacgtcagtc 
tcgcgcgcgggcggaggtgggccggtgcggcgctgtcacgtaggttcagtgggcggaaga 
ggtggccccggatgctgcggcgcccgctggccgggctggctgcggccgccctgggccggg 
ccccaccggacggcttgctctgctctttacctggggttgctgtcgaggaccctgtgcaag 
actcggccggtttttctttctccctgatggacagacccaaacatagccgcgcagcatcgt 
gaagggctggggccttcactcctctgtggctctggaagagcccgatttcctcaggaggca 
tgtcgggccccaggcctgtggtgctgagcgggccttcgggagctgggaagagcaccctgc 
tgaagaggctgctccaggagcacagcggcatctttggcttcagcgtgtcccataccacga 
ggaacccgaggcccggcgaggagaacggcaaagattactactttgtaaccagggaggtga 
tgcagcgtgacatagcagccggcgacttcatcgagcatgccgagttctcggggaacctgt 
atggcacgagcaaggtggcggtgcaggccgtgcaggccatgaaccgcatctgtgtgctgg 
acgtggacctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatct 
ctgtgcagccgccttcactgcacgtgctggtgtgtgctgggcagggttgggggctggggg 
ccagggcatgccaggctctgattgccaccccctttttaggagcagcggctgcggcagcgc 
aacactgaaaccgaggagagcctggtgaagcggctggctgctgcccaggccgacatggag 
agcagtgagtgtgccgtgggatcaccagggaatgccaggaggggagtcagggttctgagg 
tctgtggcaccagggaccctgtgggtccccagacctcctgacacctggagtccctgtgag 
ggtcctcagacctctcaactacctcccaacacctagagtccccgtgagggtccccagaac 
ccacccccagtcaccaagggtctcattgagggtcctcagatttccctctgttacccagag 
tctccgtgagggcccccagaccccccatcgcccagggtcccatgagatgtccccaacctt 
ctagccccgggggtgtcatgtgcatcctcttacagctgttgcctcttctctgggtctgac 
tgcagcccacaagaa 

SEQ ID NO: 235. >T08090_T55 # TY Transcript # LN 989 # Source Gene: T08090 

# Encoded protein: T08090_P24 

gtccggaattctccggatctggtcctgcgtaccctgacctcccgtccttccctttctaat 
tgagaggcccgcctcttcctactatacctcttcctgtgtgcctcaactgaggcctcgtgc 
ctgaccattctaggctactcctctcccttgcctgggcctgtgtgaacccgtctcgctgga 
ttttagagccttgggacggcgtgcatagacatatatcccttcagagatacgtacaagctc 
ctgcagggtgtgcggaacatcaaggccaccgatctgcggcccatctacatctctgtgcag 
ccgccttcactgcacgtgctggagcagcggctgcggcagcgcaacactgaaaccgaggag 
agcctggtgaagcggctggctgctgcccaggccgacatggagagcagcaaggagcccggc 
ctgtttgatgtggtcatcattaacgacagcctggaccaggcctacgcagagctgaaggag 
gcgctctctgaggtgggcccatccttgtgcctacctgggcaaggcccaaggggaggcctg 
ggggccaggcctttgttgtccatgaggccactgaggttagatgggacagtcctacccaag 
cactggcatgagacaccgaggtccacggtggagggagagcaggaagcccagcccttcctg 
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gataccagccctcccaactccctttcttcctcactggcaggaaatcaagaaagctcaaag 
gaccggcgcctgaggcttgctgtctgttctcggcaccccgggcccatacaggaccagggc 
agcagcattgagccacccccttggcaggcgatacggcagctctgtgcccttggccagcat 
gtggagtggaggagatgctgcccctgtggttggaacatcctggggtgacccccgacccag 
cctcgctgggctgtcccctgtccctatctctcactctggacccagggctgacatcctaat 
aaaataactgttggattagaaactccaaa 

SEQ ID NO: 236. >T08090_P25 # TY Protein « CC #LN 197 # Source Gene: 

T08090 # Encoding Transcript: 1 

MSGPRPWLSGPSGAGKSTLLKRLLQEHSGIFGFSVSHTTRNPRPGEENGKDYYFVTREV 
MQRDIAAGDFIEHAEFSGNLYGTSKVAVQAVQAMNRICVLDVDLQGVRNIKATDLRPIYI 
SVQPPSLHVLEQRLRQRNTETEESLVKRLAAAQADMESSKEPGLFDWIINDSLDQAYAE 
LKEALSEEIKKAQRTGA 

SEQ ID NO: 237. >T08090_P8 # TY Protein # CC #LN 79 # Source Gene: T08090 

# Encoding Transcript: 24 

MQRDIAAGDFIEHAEFSGNLYGTSKVAVQAVQAIEPHLCAGRGPAGCAEHQGHRSAAHLH 
LCAAAFTARAGAAAAAAQH 

SEQ ID NO: 238 . >T08090_P9 # TY Protein # CC #LN 166 # Source Gene: T08090 

# Encoding Transcript : 28 

QAILLPLIIGVILLGLLAVALIAFCIVLDTPIRLTKPSEHMLQDPLHYKGAPEEHSGIFG 
FSVSHTTRNPRPGEENGKDYYFVTREVMQRDIAAGDFIEHAEFSGNLYGTSKVAVQAVQA 
I EPHLCAGRGPAGCAEHQGHRSAAHLH LCAAAFTARAGAAAAAAQH 

SEQ ID NO:239. >T08090_P10 # TY Protein # CC #LN 96 # Source Gene: T08090 

# Encoding Transcript: 32 

MAPPLPQINYYFVTREVMQRDIAAGDFIEHAEFSGNLYGTSKVAVQAVQAIEPHLCAGRG 
PAGCAEHQGHRSAAHLHLCAAAFTARAGAAAAAAQH 

SEQ ID NO:240. >T08090_P11 # TY Protein # CC #LN 83 # Source Gene: T08090 

# Encoding Transcript: 35 

ARAARARGDSARDFIEHAEFSGNLYGTSKVAVQAVQAIEPHLCAGRGPAGCAEHQGHRSA 
AHLHLCAAAFTARAGAAAAAAQH 

SEQ ID NO: 241. >T08090_P13 # TY Protein > CC #LN 169 * Source Gene: 

T08090 # Encoding Transcript: 37 

MSGPRPWLSGPSGAGKSTLLKRLLQEHSGIFGFSVSHTTRNPRPGEENGKDYYFVTREV 
MQRDIAAGDFIEHAEFSGNLYGTSKVAVQAVQAIEPHLCAGRGPAGCAEHQGHRSAAHLH 
LCAAAFTARAGVCWAGLGAGGQGMPGSDCHPLFRSSGCGSATLKPRRAW 

SEQ ID NO: 242. >T08090_P15 # TY Protein f CC #LN 214 # Source Gene: 

T08090 # Encoding Transcript: 39 

MPSSRGTCMARARWRCRPCRPLNRICVLDVDLQGVRNIKATDLRPIYISVQPPSLHVLEQ 
RLRQRNTETEESLVKRLAAAQADMESSKEPGLFDVVIINDSLDQAYAELKEALSEVGPSL 
CLPGQGPRGGLGARPLLSMRPLRLDGTVLPKHWHETPRSTVEGEQEAQPFLDTSPPNSLS 
SSLAGNQESSKDRAPEACCLFSAPRAHTGPGQQH 

SEQ ID NO:243. >T08090_P14 # TY Protein # CC #LN 160 « Source Gene: 

T08090 # Encoding Transcript: 40 

MLRRPLAGLAAAALGRAPPDGMSGPRPWLSGPSGAGKSTLLKRLLQEHSGIFGFSVSHT 
TRNPRPGEENGKDYYFVTREVMQRDIAAGDFIEHAEFSGNLYGTSKVAVQAVQAIEPHLC 
AGRGPAGCAEHQGHRSAAHLH LCAAAFTARAGAAAAAAQH 

SEQ ID NO: 244. >T08090_P16 # TY Protein « CC #LN 200 # Source Gene: 

T08090 # Encoding Transcript: 42 

PTPPPPPPPHLTPHTPAPPPPPGHRSPGVAQFAHQKRWPRMLRRPLAGLAAAALGRAPPD 
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GMSGPRPWLSGPSGAGKSTLLKRLLQEHSGIFGFSVSHTTRKPRPGEENGKDYYFVTRE 
VMQRDIAAGDFIEHAEFSGNLYGTSKVAVQAVQAIEPHLCAGRGPAGCAEHQGHRSAAHL 
HLCAAAFTARAGAAAAAAQH 

SEQ ID NO: 24 5. >T08090_P17 # TY Protein # CC #LN 120 # Source Gene: 

T08090 # Encoding Transcript: 43 

MLRRPLAGLAAAALGRAPPDGMSGPRPWLSGPSGAGKSTLLKRLLQEHSGIFGFSVSHT 
TRNPRPGEENGKDYYFVTREVMQRDIAAGDFIEHAEFSGNLYGTRSSGCGSATLKPRRAW 

SEQ ID NO: 24 6. >T08090_P18 # TY Protein # CC #LN 142 # Source Gene: 

T08090 # Encoding Transcript: 44 

LFPVGGAGRGGSRFEMLSGAGSRPGLGSAVGGERWAREGLGVPEAPVGLQRALAPLLTWH 
LSPARWRCRPCRPLNRICVLDVDLQGVRNIKATDLRPIYISVQPPSLHVLVCAGQGWGLG 
ARACQALI AT P FLGAAAAAAQH 

SEQ ID NO: 247. >T08090_P19 # TY Protein # CC #LN 163 # Source Gene: 

T08090 # Encoding Transcript: 45 

MSGPRPWLSGPSGAGKSTLLKRLLQEHSGIFGFSVSHTTRNPRPGEENGKDYYFVTREV 
MQRDIAAGDFIEHAEFSGNLYGTSKVAVQAVQAIEPHLCAGRGPAGCAEHQGHRSAAHLH 
LCAAAFTARAGWLLPRPTWRAARSPACLMWSSLTTAWTRPTQS 

SEQ ID NO: 248. >T0809(TP20 # TY Protein # CC #LN 161 # Source Gene: 

T08090 # Encoding Transcript: 4 7 

MHGSKWLDSWVGWGSQSRLPEGLEVTPPGGDGSSQVGCHQSRQHCLLGTPGEADRVSGFQ 
LGGSPAENPVKPAFHSSPVCPEPGLRAALCMPALPTWQSSGGRTPRLSGDSQANVSIPGS 
ALSCHEFRKAEAAQQMGTRGRTAIHSLSSHPQACRAPGLWC 

SEQ ID NO: 249. >T08090_P21 # TY Protein # CC #LN 261 # Source Gene: 

T08090 # Encoding Transcript: 48 

MTSVSRAGGGGPVRRCHVGSVGGRGGPGCCGARWPGWLRPPWAGPHRTLGWLEARRRRLC 
RHSRECLSRKAAAGAAWEPSPGSVLAETIPEHGSAPAAPRRPSISELLWRPRAWQPLFSP 
PGMSGPRPWLSGPSGAGKSTLLKRLLQEHSGIFGFSVSHTTRNPRPGEENGKDYYFVTR 
EVMQRDIAAGDFIEHAEFSGNLYGTSKVAVQAVQAIEPHLCAGRGPAGCAEHQGHRSAAH 
LHLCAAAFTARAGAAAAAAQH 

SEQ ID NO: 250. >T08090_P7 # TY Protein # CC #LN 202 # Source Gene: T08090 

# Encoding Transcript: 49 

MPSSRGTCMARARWRCRPCRPLNRICVLDVDLQGVRNIKATDLRPIYISVQPPSLHVLEQ 
RLRQRNTETEESLVKRLAAAQADMESSKEPGLFDWIINDSLDQAYAELKEALSEEIKKA 
QRTVRLRLAVCSRHPGPIQDQGSSIEPPPLAGDTAALCPLASMWSGGDAAPVVGTSWGDP 
LNPASLGCPLSLSLTLDPGLTS 

SEQ ID N0:251. >T08090_P1 # TY Protein # CC #LN 139 # Source Gene: T08090 

# Encoding Transcript: 50 

MSGPRPWLSGPSGAGKSTLLKRLLQEHSGIFGFSVSHTTRNPRPGEENGKDYYFVTREV 
MQRDIAAGDFIEHAEFSGNLYGTSKVAVQAVQAIEPHLCAGRGPAGCAEHQGHRSAAHLH 
LCAAAFTARAGAAAAAAQH 

SEQ ID NO: 252. >T08090_P22 # TY Protein # CC #LN 194 # Source Gene: 

T08090 # Encoding Transcript: 51 

MTSVSRAGGGGPVRRCHVGSVGGRGGPGCCGARWPGWLRPPWAGPHRTPLFSPPGMSGPR 
PWLSGPSGAGKSTLLKRLLQEHSGIFGFSVSHTTRNPRPGEENGKDYYFVTREVMQRDI 
AAGDFIEHAEFSGNLYGTSKVAVQAVQAIEPHLCAGRGPAGCAEHQGHRSAAHLHLCAAA 
FTARAGAAAAAAQH 

SEQ ID NO: 253 . >T08090_P23 # TY Protein # CC #LN 137 # Source Gene: 

T08090 # Encoding Transcript: 52 
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MAGSQKEEIMPPQQGVPFQESGCWGSMGAQSRERSGRDHSGTWICAGGSQETFHLGAPVE 
TTCLAGMSGPRPWLSGPSGAGKSTLLKRLLQEHSGIFGFSVSRESRALVEGYHEEPEAR 
RGERQRLLLCNQGGDAA 

SEQ ID NO: 254. >T08090_P24 « TY Protein # CC #LN 193 # Source Gene: 

T08090 # Encoding Transcript: 55 

MYPFRDTYKLLQGVRNIKATDLRPIYISVQPPSLHVLEQRLRQRNTETEESLVKRLAAAQ 
ADMESSKEPGLFDVVIINDSLDQAYAELKEALSEVGPSLCLPGQGPRGGLGARPLLSMRP 
LRLDGTVLPKHWHETPRSTVEGEQEAQPFLDTSPPNSLSSSLAGNQESSKDRAPEACCLF 
SAPRAHTGPGQQH 



SEQ ID NO:255. >T11445 # TY Consensus # Length 12715 # Number of exons 41 

ttccccctccggcgcacgtcgccgcttactcacccccagacgatggtaccggtccggaat 

tctgcagatagggccagcgcaccggtcccttgncccagccgagcccgcccgcgcgccggg 

yasgagcaatgctaggcgatggggaatccgaagacgcacaggagtccaagggttggctga 

ctgcagaccctcatcataagtctagagccggagaagatggggagacgccggcctgcgccc 

tgggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgc 

ggccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgca 

ccggtcccccagcggcagccgagcccgcccgcgcgccggtgagtcgcctggggctggggc 

cgggccggtggggccccgcggcccggagaagggcgggggccggccgcgcggggctgagcg 

tgtcggcggccgggcgggcgggggcggcgtgtccgcgtgtcccggggtgtgcgcgtcccc 

ggcgggtgtgcgtgtgcaccccggagcgtctggtggcgttgttggggtgagtgcgtgcgg 

ggacgcgggcatggctccccgctacccacttcccgagtggggacgacgggcggaggagtg 

tgccgggcgccgcagggagtacgtgcgggtctcagggagtgtgtgtgtgcacgtgtgtgc 

attccaaagtgatttgtgtgcaagcacatgcgtgagcctagaagggcgaaagtrtgcctg 

tgtgggtgtgagtataacgagtgtgcagggccgtgcgagcgtgtttattactgaggttga 

ggggccgccccctttgcgcccaccaggcctttgcacagtgatggccccgagccagggagc 

cgtggagagagaagcggcctggcatgggctgggggtcggggctgggcctgatccggtgaa 

ccaggggccagragggcagactcagagcctggctgggcctgacagaggcaaaggtcttag 

agccaggctgcggcagggccactggcaggaccctctctgcatacacaggctgggcgagcg 

gaggctgtggtgggcccctcggccgggcacccgaggacgtgtgttttcctaaatatcctc 

caggctctcctgtctttgttcttgtctctctccagccaggcctggggggctgaccctaga 

cttgcctgtgtatgtgggggcctctctgcctgccctgctaagtagacggatcttttgatt 

tatttatacctctcctaggctccaagtgtgaatcccaggctggagtacagtggcgcaatc 

tcagctycctgcaccctctgcctcctgggttcaagtgagcctcctgcctcagcctcctcc 

tgagtagctgggattataggcaggtgccaccacgcccagataatttttgtatttttagta 

gagatggggtttcactgtgttggccagccaggctggtctcgaactcctgacctcaggtga 

tccgcccgccttggcctccaaagtgctggaattacaggcatgagccaccatgcccggcca 

tgtccccatttactgaggatatactgtgtgctaggtctggtgctgtattttacccagtcc 

actcaccaacactaagatggaagctgtcacgatgcccatttacagatgaggaaactgggg 

cacagagagatgagagaatgcgccaaagaccatgtgctaatacgcagcagagctggattt 

gaacccaggcttgtctgagcagrgcccatggtctccccactgtactaggaaggtagcagg 

cactgtgggctggatgcctggcctgtgctgaggttcagtgtcrttttcctgatcttacag 

agagggaggttaagtaacttgggcagggtcataaacccacaaatggcagagtcgggattc 

gaacccatgtctctgattccagagttcctcaccttaaaccacatctggctgcctttgtcg 

tgctttgctcctgttgcatgctgcaaagccaagtaggttaagacaggctaggttcctgca 

gggctgggttgcccgagacctgggagaggtcttccagaaggcactgctggccttcctcgc 

ccagggccagagctactggagagaaagggctgctgttcacattttttcttttctttaagt 

gccccagatacttttcataagggaagtgtcaaaacaattgtcaaaaggagaagaatattt 

ccttcctcttctcaggctagaccaagcaagggagttggagggtggagagtggagggacct 

ggtgaatcatccaagattaaatacgggcaggcactgccaggcttgtgtggagatgaggct 
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gggacactgttcgcaggaagagtccggtgttctggccccctgatgtcaccttcacgggcc 
tgactcacagtcctaaatatctgacagcgagatcgcttgtaggtaggcttcttgtcactt 
gtcatttaagtcccctgcctgcctttttttccctgctcatttccatttcgatcacggctt 
ctactacatgtgcttgaaatgaatgttgtgtgtgaggacctgtgtggtttgggccatgtt 
gtgcccccctgccagtgtccttcggtgttgaaaggaaaaggtatttctctcccttccatg 
tatgttctgcaaattgacgaacatttgcaaattgctttgagatgcgttaatttaaggtgt 
gaaataatagatcttgtgcgaaggctcagccacatttctggcattgatggatttttctcc 
cgtctggcctgtgatttggggaggtaaaaatagtccttttttccacatttttatcttttt 
ttttttttgctgacatcattcttaggaagtggtgtgcacatgccgtggaactttacatgg 
aacatggagctctggggagcaggagtgctcttggagtgaggggtggtctgcggtggccca 
ggcaccccctggctgtccgtgaaatggctttgaaatgagcaggcactgttgagggcctca 
gtggctttttggtatttggctgtcttttcacttatgactcttgggtaatggcaagataac 
ttttggggaaaatgttagtttaaaaagaaaaaaaatcaggccagcactttcactcattcc 
tgtctgagctggtcccttcctttggccaaagctgaactaaaccaatggccttctcagact 
cgacttccagacagaggagccgacattggggcaaatcagttggtggtgtaatttcctgtg 
atgttaactttgggaacttgaacttggagtgggagctagcattttgacatgtgtccctct 
aacaggacttgcttgtgtggaaacagacccccacatacatggctgcgccattgtctccaa 
ttctggatttttccacagatgacttctcgggtgatctggtggtaaaggcacaggtcttct 
agccaggccttcttggcctccagcaagggcaggatcacagcaccccaccagacctgactg 
gccaccaccgctgtagctgtggctgggcgagtgggggggccatgggaattgaaagatggg 
gtggccgagatctgaaatgcattcagttgcagtggcctggatctgcacggagccgttctt 
tctttctcttagacatctgcrtggtggtgacctcatagaattctaagaaaaaaggggctt 
tatgtaggaattccatagagtgtgaaggaatttggacaggggaagggggagggaagttgc 
cattcagagcctgcagtgcctgcattttccccgaattgtttaaccctcatgcttcagaat 
taggctgaggcttgcggggtgggtcatgttgacctgggtgaacagagatccctttaagaa 
gaacttctccatgttccagaggcgcgttcttactgcaggtgagtggcagtatgggaatta 
gtccacaggccccttctcgaatgcctgccctctcttgttccttgtcctcaacgtctttga 
aacttgggcttgttgggaagacacctgcaaaaggatggatgcacatgaccttcagctcta 
atgaatcaagctgctgatgaggaattcactgggctcccaatccagagagcttcgcacaca 
cccactggggttgagacgagcactgggtttatttattgtggacttttggagtctgaagga 
ctcttgccacccatctgtttcacggaacagaacctgaggctcagagggagcaaggcgctc 
ccccacagccgcattgagagcctgagcctgggcacccgatggagtgaatgaggctggagt 
cccagacctgcctctcatgagcacgtcgcccactgagcctcagcgtcatcatctggaaaa 
cagggataatattatgacctcagaagcttcgggggaggaagtaagtgaartaatgcattt 
cagatgctcagcgccttttaagtgcttgatgctcattccccaagattacatgagagacat 
ggaaaatctttaatgaccaaggacccacccagggtcactcagccgggacccttggtccgt 
ggcccagagtgtctccagtgccccwgcattgaggccctaaacaaggccagaagcaggtgc 
cggggacccctctggattccaccagagcaccttcctaggatcatggctcccaaaacggaa 
gggaaggagacagcgcagtttgcaaagaggcaggatttaagcaccagggtggccctgtgg 
cgcctcaggaaaatgtttgcctgtcagtatctgctctcgttcccacctgtccccacaaag 
cgaggccataagtcctcggcgtggcattggagggtctctgaaggccctgagagctgtgtc 
agccacggtgtgttatgaagcaaggcagatgttttgttaatwatttacacagcgtcggcc 
ccgcagaggactgcgctgacaggagcggctgtcacaggcctggccgtggggcagaagtga 
gcagccgtcttcccctggcagtcctcctgaaaaggtctgcatggcaaggcctgagggagt 
cctgcacattttatgcccccgccccccaaagccatttgggtttcccttaaactggcttgt 
tttcctgagccggtggagagatccttgtcctccggaagtggctatcgctctggggcggct 
tctctgccagctcgtcacaccctagacccagctgtagtctgtgtggtgggagagggtgtc 
accaggctctggaggtccactcctctgtagtcacctcatgcaaggagggcttcacagggg 
cccagcctctactccctcatccggaaaacgggccagtaacaccaggcaccagccccgtga 
tcctcaggcacccttgggggtgatctgccttagaaattcaactttaggattagaattctg 
ctaagaggtaccatgtgacaaaaaaggtagtgtaaaaatcacaaagaccaggacaggctc 
atgcctataatcccagcactttgagaggccgaggcgggcagatcacttgaggtcaggagt 
ttgagaccagcccagccaacatggcaaaaccccatcgctactaaaaatacgaaaaattag 
ctgggcgtggtggtggacacctttaatcccagctactcgggagcctgaggagcctgaggc 
gtgagaatcacttgaacctgggagacagaggctgcagtgagctgagattgtgccactgca 
ctccagcctaggcaacagagcgagactctgtatcaaaaatggcgacactgaggcccagag 
agacatcatgaccagcctcaggtcccacatgcagcctgctgctctgtagcctaaatcaga 
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tcactgtcccagctttcaggtcaaggctctggttgtgacacctgggcctggactgcgggt 
gaaaagttttgatgcaaaaacacagaccgtccattgtcctatgggctaagtggactccct 
tctgctaaccagagatgcttgtgctggcccagcccagagaggcgacccagctccctggag 
gcccaggcacctcatgtctgccctcaacccagagtggagcagaccttccctggagcttct 
caggagccacttcaaaaggagaagaagacacatgaattagagacagcacgggggagcagg 
ctgtggagctgggagtgacggggtgagtccaggaaggctgcctggaggagatgcccagga 
gagaagtttgctgggaggcagctcatttccggcaggaggagcagagcctgccaaggcctc 
gagtattttgagagagcaggtttaccccagaacagtctgtggaaggtgtggggcagggag 
gagagctgggccgccctgaggacacagagggagaagggtgctgggagccagcctgcaagt 
tctggggagcctcagggtgcctttgagggcacaacttgggaggccaagctcagcgggccg 
ctgttttctgttgcagttcgtgccctcgtgaggctggcatgcaggatggcaggacagccc 
ggccacatgccccatggagggagttccaacaacctctgccacaccctggggcctgtgcat 
cctcctgacccacagaggcatcccaacacgctgtcttttcgctgctcgctggcggacttc 
cagatcgaaaagaagataggccgaggacagttcagcgaggtgtacaaggccacctgcctg 
ctggacaggaagacagtggctctgaagaaggtgcagatctttgagatgatggacgccaag 
gcgaggcaggactgtgtcaaggagatcggcctcttgaagcaactgaaccacccaaatatc 
atcaagtatttggactcgtttatcgaagacaapgagctgaacattgtgctggagttggct 
gacgcaggggacctctcgcagatgatcaagtactttaagaagcagaagcggctcatcccg 
gagaggacagtatggaagtactttgtgcagctgtgcagcgccgtggagcacatgcattca 
cgccgggtgatgcaccgagacatcaagcctgccaacgtgttcatcacagccacgggcgtc 
gtgaagctcggtgaccttggtctgggccgcttcttcagctctgagaccaccgcagcccac 
tccctagtggggacgccctactacatgtcaccggagaggatccatgagaacggctacaac 
ttcaagtccgacatctggtccttgggctgtctgctgtacgagatggcagccctccagagc 
cccttctatggagataagatgaatctcttctccctgtgccagaagatcgagcagtgtgac 
taccccccactccccggggagcactactccgagaagttacgagaactggtcagcatgtgc 
atctgccctgacccccaccagagacctgacatcggatacgtgcaccaggtggccaagcag 
atgcacatctggatgtccagcacctgagcgtggatgcaccgtgccttatcaaagccagca 
ccactttgccttacttgagtcgtcttctcttcgagtggccacctggtagcctagaacagc 
taagaccacagggttcagcaggttccccaaaaggctgcccagccttacagcagatgctga 
aggcagagcagctgagggaggggcgctggccacatgtcactgatggtcagattccaaagt 
cctttctttatactgttgtggacaatctcagctgggtcaataagggcaggtggttcagcg 
agccacggcagccccctgtatctggattgtaatgtgaatctttagggtaattcctccagt 
gacctgtcaaggcttatgctaacaggagacttgcaggagaccgtgtgatttgtgtagtga 
gcctttgaaaatggttagtaccgggttcagtttagttcttagtatcttttcaatcaagct 
gtgtgcttaatttactctgttgtaaagggataaagtggaaatcatttttttccgtggagt 
ggtgattctgctaacatttttatctacgttttataacttggtgagtgacgatgagagccc 
tgcacctggccagagtgtcacaggcaaaaggcatcgggaagcaggagcatcttcttggca 
gccaggctgggccatcttctcctggacacctgctgtgtaccaggaacttcgtcacctcct 
tgaatgctggcggttcatttcatgatcagtgttaagcattttcganggggggtgggggca 
agtgttttagccgtttggagcctttatttcctcttttggacatgganttttattgaggac 
ctgagttttttaccttcttgttggcggtacatacanattgggaaagannngtggcaaggg 
agcgaagatnatataaggttgagtagccaaaatgnggatgctattgcaccttgcattata 
cggaangncagggacctcngaaggcgagaatnaagtttttttttttttttntggaggatt 
tttganntgtgggccttatgtttttttcccattctgnaggttcggggaagncgatntgtt 
nanggggtactgtagaatattctaccttggnagtttatttttggattttgtgggggggat 
tttaagggatccgtttaaggggctgagcgggctgtatggacctccatgggaaggaagcat 
gggatatagaaaagcgaagggctgtcctttacaaattctggttctgcaacttcctagcgt 
gactttgggcttgggcaagtttcttagccgttctgagccttcatttcctcatctgtacaa 
tgagattaatagtacctatcatctaccttcaggattgctgacagacagaatttgaaataa 
aatatgcaagttagctaatacaaaaagtagatgatccaaaaatggtagccactcaccctt 
cacaaactgaagtccatggaccacggaagtcgagaattaatgtacacctgtatcatgtgt 
aggaaaccagaaatgtgttccttatttcttgttcccaaacaggattaactgtgaagacta 
atttataaatgtgaacctaagaaaactccacctctgaaggaaatcatttgaattttgttt 
ttgtacgtaaagttaaccttccaattgtctgagctgtcgtcactgacttcatgacagtct 
ggccctccagacaagagcagcgctggcatcgggcaggtgattcctgacacctgctgcctg 
caggcattcactgaccaggcctttcctggaggaaacacccagggccgggcggctgctgtt 
tccacacgtggactcggatctgctgtgacaccgtcagcccgacagtctctccatatgcag 
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cctttcctctgtacttttctccatggttgaaataaaacagggtgactgggagttacttag 
aattcatgaagattttaaaatggctttggagattttgcttttaaaccagtagattcaaaa 
cttaaacagcgtctgcagcacaatttcttgaggaaccttgaaaaacacaacttcccaggc 
cccattcagtaatcccaggatttctttaagctccccaaataattttgaaactcatcagcc 
gagtttctgccctcatgaggtaattccatcgttctccccagcctgcccctggcagctgta 
acacaggagctggcctgagagcagattcaccctggaatgttctctccacagaacaatcaa 
gtccctgtcgcctgcctagtgcttaccactgaagatttttctgattccagaccaactttt 
tgccaacattctgcttccagctctctgagcccctgccgtgtctccccaacactgccagcc 
ccagcacgcaatcaacctactttgtgcatgccacccgctttccacactgtgagaacaatc 
tgcccaactggaccctctggaggcgcacgatctcagccactcaccaggcctgagtgtttg 
tgaaatgatcatgtcctacttattacaaaaccgtaaccccaaaacattccttttatttct 
gtgaaaccggccaaagtgaggtccacccaccttcacacagctctggcggtgcacctgctc 
accttctcttggttctcagaactgagctgggcttgagaacacagctttggctttgccatt 
tttttcctacttggctgctgaggtggagggtgtgctgcacttatcaccccatttcaaaac 
caaaccaaacctgaggccaccccaaacaaattcagccagcaaaaagggtaggtatcgatg 
gtcacctgaagcctcaagggagtccactctgacttctgacagcagacagaacctatctgt 
cctcaggaggggcgcatggctaacctacgctctcctctgagcctaggccagtgctcaagg 
aggatcagcaccctgaagggccaggccaatggccttagcagccccctactgcccccaaaa 
acaccacatcaaaacagataagaaaatcattgtttataatagttatcgaaggtcgaggtt 
tttttggtttaattttgcatttacttaaaactctgtgtttgatgccaacagggttcaagt 
tgcacaggaaggtgcttcggctaaacacggatagtcttccttgatctccactgtctgctg 
tgtgggttccgccagcaaacctcccagcgagtggtcaccttcctatggagtcattaaacc 
tctgaatgtcagctggtgggattagccagcagcttccacacctgagcccccgtggcaggg 
acagaggcatttggatcaagatcactaatgaatacagaattcatatgccgcttgtcaggg 
ctcctgagctcagagtcgcacagatgcaacctaaataagacggaggacagaacaatcgtt 
gacaaagattaaaaagttttggcagttgggaaaaaaagagcacctcattattcaaatatg 
atagtcttgcttatttttttttatttttgttcaagttattcatgattcacctcaaatgga 
caaatgccattcaccaatatttaaagacttcttggtctcattactaaatctgaagagggc 
tgtgctgtctgtacatcagctatatggagaaaacagctctcctgtaaaacagctctcttg 
taaataatatttgaaaattttaagggaatataacaatcacaagctatttagggtggaggg 
cagtttaaaaatgaaaaaaaaaatctagaattaaaagaaaaaaatgaccttccacagcca 
agtgattttattacatttactaacatctctatctctgacttccttaaacttttctgttcc 
tgggttccctccaatcacagagctgccaaagagctttacacaccttgtctgacacttgct 
tggctttctgccgtcttcacaagacctcatcagagaaaagaaacagcccaagcagcccac 
acctctgctcacctggaataagatgtctttcccagtcttgttaaagaattacaggaataa 
gaaagtgttaaagaattaaaggaataagatgtctttccctccctgttgtgaaagaatgac 
aaagaggcagatatagcccagtggctgaacccatcggagggctctaacgtcaggctgacc 
agggttcagactctgagtcttccactgagtgtgtgacctctggccagttatttgacctcc 
ctacggcccaattttcttctctacaaaatggagacaacaaagtactgacctcaaggaatt 
gctgaaaagtaaatgaaataatataccactgtctatgaccaccacatatgcctgtctcca 
atattacgctttcagataagaaggcctgtatctatctatatcatgctataaattcaccaa 
gcctgatgccagcacaaataatcacatggggacgccctactacatgtcaccggaaaggaa 
ccttgggaaacggcttaacacttccaggccgaaatctggctccttggggctggcctgttg 
aacgaaaatgggaaacctcccagaaaacccttcttatggagaaaagaataagatctcttt 
ctcccccgcgcgcaaaaagatccaaacagcggggagccacccgccccctccccagaggaa 
gaccctgagtataaaagacttacccagaaactggttaccaacgggaactttctcccctcg 
gcacccccgcaccaggagggagcccttggaagagaggatcccatgaaaacacggctacca 
attttccatatccggaccccctgcgctccttcggcctgctcctgccttcatacagaatgg 
cccgcaacctccccggaccccgccctttcctcccggacgaacccgattgcacaccttcct 
ttcccctccatgtctggcctaaacttatcttcaaatacagccattgctgccaaggagcac 
aacgaaaacacaacaacagagactcaaggggcatcaacagcgcggggacgccgagacaac 
cagacgccacgggacgggcaacacggggcaaaacgcggcacccggacccagcattcccgc 
ggcccacaggggaaaaatgcgcaggagccccaaatagggaacccgcacaaactcgggaaa 
gcacgggtacggaaggacaagacgcgacgacactnaaaaaaaaaaaaaaaaaattcgggg 
cctctttttcgggggccgccacacggggtttggaagggccaggccctagcggttccggcc 
cactttttggccaaggggatctcgccggcccctaaagggcgataattcgaagcgtcgaag 
cttggcyggstgrwaccaarratggcaasscccycccaraarcccccatccagatggaaa 
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aaaaagagaaggatccggttttcaccccctgggaccagaaaaaagaacctatagccaatc 
acgaatttgagtaaggcaaacctataggccagcggggaagaaagatatagattac 

SEQ ID NO:256. >T11445_T1 # TY Transcript # LN 3629 # Source Gene: T11445 
# Encoded protein: T114 4 5_P13 

gggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgcg 
gccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgcac 
cggtcccccagcggcagccgagcccgcccgcgcgccgttcgtgccctcgtgaggctggca 
tgcaggatggcaggacagcccggccacatgccccatggagggagttccaacaacctctgc 
cacaccctggggcctgtgcatcctcctgacccacagaggcatcccaacacgctgtctttt 
cgctgctcgctggcggacttccagatcgaaaagaagataggccgaggacagttcagcgag 
gtgtacaaggccacctgcctgctggacaggaagacagtggctctgaagaaggtgcagatc 
tttgagatgatggacgccaaggcgaggcaggactgtgtcaaggagatcggcctcttgaag 
caactgaaccacccaaatatcatcaagtatttggactcgtttatcgaagacaacgagctg 
aacattgtgctggagttggctgacgcaggggacctctcgcagatgatcaagtactttaag 
aagcagaagcggctcatcccggagaggacagtatggaagtactttgtgcagctgtgcagc 
gccgtggagcacatgcattcacgccgggtgatgcaccgagacatcaagcctgccaacgtg 
ttcatcacagccacgggcgtcgtgaagctcggtgaccttggtctgggccgcttcttcagc 
tctgagaccaccgcagcccactccctagtggggacgccctactacatgtcaccggagagg 
atccatgagaacggctacaacttcaagtccgacatctggtccttgggctgtctgctgtac 
gagatggcagccctccagagccccttctatggagataagatgaatctcttctccctgtgc 
cagaagatcgagcagtgtgactaccccccactccccggggagcactactccgagaagtta 
cgagaactggtcagcatgtgcatctgccctgacccccaccagagacctgacatcggatac 
gtgcaccaggtggccaagcagatgcacatctggatgtccagcacctgagcgtggatgcac 
cgtgccttatcaaagccagcaccactttgccttacttgagtcgtcttctcttcgagtggc 
cacctggtagcctagaacagctaagaccacagggttcagcaggttccccaaaaggctgcc 
cagccttacagcagatgctgaaggcagagcagctgagggaggggcgctggccacatgtca 
ctgatggtcagattccaaagtcctttctttatactgttgtggacaatctcagctgggtca 
ataagggcaggtggttcagcgagccacggcagccccctgtatctggattgtaatgtgaat 
ctttagggtaattcctccagtgacctgtcaaggcttatgctaacaggagacttgcaggag 
accgtgtgatttgtgtagtgagcctttgaaaatggttagtaccgggttcagtttagttct 
tagtatcttttcaatcaagctgtgtgcttaatttactctgttgtaaagggataaagtgga 
aatcatttttttccgtggagtggtgattctgctaacatttttatctacgttttataactt 
ggtgagtgacgatgagagccctgcacctggccagagtgtcacaggcaaaaggcatcggga 
agcaggagcatcttcttggcagccaggctgggccatcttctcctggacacctgctgtgta 
ccaggaacttcgtcacctccttgaatgctggcggttcatttcatgatcagtgttaagcat 
tttcctccatgggaaggaagcatgggatatagaaaagcgaagggctgtcctttacaaatt 
ctggttctgcaacttcctagcgtgactttgggcttgggcaagtttcttagccgttctgag 
ccttcatttcctcatctgtacaatgagattaatagtacctatcatctaccttcaggattg 
ctgacagacagaatttgaaataaaatatgcaagttagctaatacaaaaagtagatgatcc 
aaaaatggtagccactcacccttcacaaactgaagtccatggaccacggaagtcgagaat 
taatgtacacctgtatcatgtgtaggaaaccagaaatgtgttccttatttcttgttccca 
aacaggattaactgtgaagactaatttataaatgtgaacctaagaaaactccacctctga 
aggaaatcatttgaattttgtttttgtacgtaaagttaaccttccaattgtctgagctgt 
cgtcactgacttcatgacagtctggccctccagacaagagcagcgctggcatcgggcagg 
tgattcctgacacctgctgcctgcaggcattcactgaccaggcctttcctggaggaaaca 
cccagggccgggcggctgctgtttccacacgtggactcggatctgctgtgacaccgtcag 
cccgacagtctctccatatgcagcctttcctctgtacttttctccatggttgaaataaaa 
cagggtgactgggagttacttagaattcatgaagattttaaaatggctttggagattttg 
cttttaaaccagtagattcaaaacttaaacagcgtctgcagcacaatttcttgaggaacc 
ttgaaaaacacaacttcccaggccccattcagtaatcccaggatttctttaagctcccca 
aataattttgaaactcatcagccgagtttctgccctcatgaggtaattccatcgttctcc 
ccagcctgcccctggcagctgtaacacaggagctggcctgagagcagattcaccctggaa 
tgttctctccacagaacaatcaagtccctgtcgcctgcctagtgcttaccactgaagatt 
tttctgattccagaccaactttttgccaacattctgcttccagctctctgagcccctgcc 
gtgtctccccaacactgccagccccagcacgcaatcaacctactttgtgcatgccacccg 
ctttccacactgtgagaacaatctgcccaactggaccctctggaggcgcacgatctcagc 
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cactcaccaggcctgagtgtttgtgaaatgatcatgtcctacttattacaaaaccgtaac 
cccaaaacattccttttatttctgtgaaaccggccaaagtgaggtccacccaccttcaca 
cagctctggcggtgcacctgctcaccttctcttggttctcagaactgagctgggcttgag 
aacacagctttggctttgccatttttttcctacttggctgctgaggtggagggtgtgctg 
cacttatcaccccatttcaaaaccaaaccaaacctgaggccaccccaaacaaattcagcc 
agcaaaaagggtaggtatcgatggtcacctgaagcctcaagggagtccactctgacttct 
gacagcagacagaacctatctgtcctcaggaggggcgcatggctaacctacgctctcctc 
tgagcctaggccagtgctcaaggaggatcagcaccctgaagggccaggccaatggcctta 
gcagccccctactgcccccaaaaacacca 

SEQ ID NO:257. >T11445_T2 # TY Transcript # LN 2594 # Source Gene: T11445 
# Encoded protein: T11445_P13 

gggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgcg 
gccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgcac 
cggtcccccagcggcagccgagcccgcccgcgcgccgttcgtgccctcgtgaggctggca 
tgcaggatggcaggacagcccggccacatgccccatggagggagttccaacaacctctgc 
cacaccctggggcctgtgcatcctcctgacccacagaggcatcccaacacgctgtctttt 
cgctgctcgctggcggacttccagatcgaaaagaagataggccgaggacagttcagcgag 
gtgtacaaggccacctgcctgctggacaggaagacagtggctctgaagaaggtgcagatc 
tttgagatgatggacgccaaggcgaggcaggactgtgtcaaggagatcggcctcttgaag 
caactgaaccacccaaatatcatcaagtatttggactcgtttatcgaagacaacgagctg 
aacattgtgctggagttggctgacgcaggggacctctcgcagatgatcaagtactttaag 
aagcagaagcggctcatcccggagaggacagtatggaagtactttgtgcagctgtgcagc 
gccgtggagcacatgcattcacgccgggtgatgcaccgagacatcaagcctgccaacgtg 
ttcatcacagccacgggcgtcgtgaagctcggtgaccttggtctgggccgcttcttcagc 
tctgagaccaccgcagcccactccctagtggggacgccctactacatgtcaccggagagg 
atccatgagaacggctacaacttcaagtccgacatctggtccttgggctgtctgctgtac 
gagatggcagccctccagagccccttctatggagataagatgaatctcttctccctgtgc 
cagaagatcgagcagtgtgactaccccccactccccggggagcactactccgagaagtta 
cgagaactggtcagcatgtgcatctgccctgacccccaccagagacctgacatcggatac 
gtgcaccaggtggccaagcagatgcacatctggatgtccagcacctgagcgtggatgcac 
cgtgccttatcaaagccagcaccactttgccttacttgagtcgtcttctcttcgagtggc 
cacctggtagcctagaacagctaagaccacagggttcagcaggttccccaaaaggctgcc 
cagccttacagcagatgctgaaggcagagcagctgagggaggggcgctggccacatgtca 
ctgatggtcagattccaaagtcctttctttatactgttgtggacaatctcagctgggtca 
ataagggcaggtggttcagcgagccacggcagccccctgtatctggattgtaatgtgaat 
ctttagggtaattcctccagtgacctgtcaaggcttatgctaacaggagacttgcaggag 
accgtgtgatttgtgtagtgagcctttgaaaatggttagtaccgggttcagtttagttct 
tagtatcttttcaatcaagctgtgtgcttaatttactctgttgtaaagggataaagtgga 
aatcatttttttccgtggagtggtgattctgctaacatttttatctacgttttataactt 
ggtgagtgacgatgagagccctgcacctggccagagtgtcacaggcaaaaggcatcggga 
agcaggagcatcttcttggcagccaggctgggccatcttctcctggacacctgctgtgta 
ccaggaacttcgtcacctccttgaatgctggcggttcatttcatgatcagtgttaagcat 
tttcctccatgggaaggaagcatgggatatagaaaagcgaagggctgtcctttacaaatt 
ctggttctgcaacttcctagcgtgactttgggcttgggcaagtttcttagccgttctgag 
ccttcatttcctcatctgtacaatgagattaatagtacctatcatctaccttcaggattg 
ctgacagacagaatttgaaataaaatatgcaagttagctaatacaaaaagtagatgatcc 
aaaaatggtagccactcacccttcacaaactgaagtccatggaccacggaagtcgagaat 
taatgtacacctgtatcatgtgtaggaaaccagaaatgtgttccttatttcttgttccca 
aacaggattaactgtgaagactaatttataaatgtgaacctaagaaaactccacctctga 
aggaaatcatttgaattttgtttttgtacgtaaagttaaccttccaattgtctgagctgt 
cgtcactgacttcatgacagtctggccctccagacaagagcagcgctggcatcgggcagg 
tgattcctgacacctgctgcctgcaggcattcactgaccaggcctttcctggaggaaaca 
cccagggccgggcggctgctgtttccacacgtggactcggatctgctgtgacaccgtcag 
cccgacagtctctccatatgcagcctttcctctgtacttttctccatggttgaaataaaa 
cagggtgactggga 
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SEQ ID NO:258. >T11445_T3 # TY Transcript # LN 2265 # Source Gene: T11445 
§ Encoded protein: T11445_P13 

gggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgcg 
gccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgcac 
cggtcccccagcggcagccgagcccgcccgcgcgccgttcgtgccctcgtgaggctggca 
tgcaggatggcaggacagcccggccacatgccccatggagggagttccaacaacctctgc 
cacaccctggggcctgtgcatcctcctgacccacagaggcatcccaacacgctgtctttt 
cgctgctcgctggcggacttccagatcgaaaagaagataggccgaggacagttcagcgag 
gtgtacaaggccacctgcctgctggacaggaagacagtggctctgaagaaggtgcagatc 
tttgagatgatggacgccaaggcgaggcaggactgtgtcaaggagatcggcctcttgaag 
caactgaaccacccaaatatcatcaagtatttggactcgtttatcgaagacaacgagctg 
aacattgtgctggagttggctgacgcaggggacctctcgcagatgatcaagtactttaag 
aagcagaagcggctcatcccggagaggacagtatggaagtactttgtgcagctgtgcagc 
gccgtggagcacatgcattcacgccgggtgatgcaccgagacatcaagcctgccaacgtg 
ttcatcacagccacgggcgtcgtgaagctcggtgaccttggtctgggccgcttcttcagc 
tctgagaccaccgcagcccactccctagtggggacgccctactacatgtcaccggagagg 
atccatgagaacggctacaacttcaagtccgacatctggtccttgggctgtctgctgtac 
gagatggcagccctccagagccccttctatggagataagatgaatctcttctccctgtgc 
cagaagatcgagcagtgtgactaccccccactccccggggagcactactccgagaagtta 
cgagaactggtcagcatgtgcatctgccctgacccccaccagagacctgacatcggatac 
gtgcaccaggtggccaagcagatgcacatctggatgtccagcacctgagcgtggatgcac 
cgtgccttatcaaagccagcaccactttgccttacttgagtcgtcttctcttcgagtggc 
cacctggtagcctagaacagctaagaccacagggttcagcaggttccccaaaaggctgcc 
cagccttacagcagatgctgaaggcagagcagctgagggaggggcgctggccacatgtca 
ctgatggtcagattccaaagtcctttctttatactgttgtggacaatctcagctgggtca 
ataagggcaggtggttcagcgagccacggcagccccctgtatctggattgtaatgtgaat 
ctttagggtaattcctccagtgacctgtcaaggcttatgctaacaggagacttgcaggag 
accgtgtgatttgtgtagtgagcctttgaaaatggttagtaccgggttcagtttagttct 
tagtatcttttcaatcaagctgtgtgcttaatttactctgttgtaaagggataaagtgga 
aatcatttttttccgtggagtggtgattctgctaacatttttatctacgttttataactt 
ggtgagtgacgatgagagccctgcacctggccagagtgtcacaggcaaaaggcatcggga 
agcaggagcatcttcttggcagccaggctgggccatcttctcctggacacctgctgtgta 
ccaggaacttcgtcacctccttgaatgctggcggttcatttcatgatcagtgttaagcat 
tttcctccatgggaaggaagcatgggatatagaaaagcgaagggctgtcctttacaaatt 
ctggttctgcaacttcctagcgtgactttgggcttgggcaagtttcttagccgttctgag 
ccttcatttcctcatctgtacaatgagattaatagtacctatcatctaccttcaggattg 
ctgacagacagaatttgaaataaaatatgcaagttagctaatacaaaaagtagatgatcc 
aaaaatggtagccactcacccttcacaaactgaagtccatggaccacggaagtcgagaat 
taatgtacacctgtatcatgtgtaggaaaccagaaatgtgttccttatttcttgttccca 
aacaggattaactgtgaagactaatttataaatgtgaacctaaga 

SEQ ID NO:259. >T11445_T4 # TY Transcript > LN 2962 # Source Gene: T11445 
# Encoded protein: T11445_P13 

gggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgcg 
gccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgcac 
cggtcccccagcggcagccgagcccgcccgcgcgccgttcgtgccctcgtgaggctggca 
tgcaggatggcaggacagcccggccacatgccccatggagggagttccaacaacctctgc 
cacaccctggggcctgtgcatcctcctgacccacagaggcatcccaacacgctgtctttt 
cgctgctcgctggcggacttccagatcgaaaagaagataggccgaggacagttcagcgag 
gtgtacaaggccacctgcctgctggacaggaagacagtggctctgaagaaggtgcagatc 
tttgagatgatggacgccaaggcgaggcaggactgtgtcaaggagatcggcctcttgaag 
caactgaaccacccaaatatcatcaagtatttggactcgtttatcgaagacaacgagctg 
aacattgtgctggagttggctgacgcaggggacctctcgcagatgatcaagtactttaag 
aagcagaagcggctcatcccggagaggacagtatggaagtactttgtgcagctgtgcagc 
gccgtggagcacatgcattcacgccgggtgatgcaccgagacatcaagcctgccaacgtg 
ttcatcacagccacgggcgtcgtgaagctcggtgaccttggtctgggccgcttcttcagc 
tctgagaccaccgcagcccactccctagtggggacgccctactacatgtcaccggagagg 
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atccatgagaacggctacaacttcaagtccgacatctggtccttgggctgtctgctgtac 
gagatggcagccctccagagccccttctatggagataagatgaatctcttctccctgtgc 
cagaagatcgagcagtgtgactaccccccactccccggggagcactactccgagaagtta 
cgagaactggtcagcatgtgcatctgccctgacccccaccagagacctgacatcggatac 
gtgcaccaggtggccaagcagatgcacatctggatgtccagcacctgagcgtggatgcac 
cgtgccttatcaaagccagcaccactttgccttacttgagtcgtcttctcttcgagtggc 
cacctggtagcctagaacagctaagaccacagggttcagcaggttccccaaaaggctgcc 
cagccttacagcagatgctgaaggcagagcagctgagggaggggcgctggccacatgtca 
ctgatggtcagattccaaagtcctttctttatactgttgtggacaatctcagctgggtca 
ataagggcaggtggttcagcgagccacggcagccccctgtatctggattgtaatgtgaat 
ctttagggtaattcctccagtgacctgtcaaggcttatgctaacaggagacttgcaggag 
accgtgtgatttgtgtagtgagcctttgaaaatggttagtaccgggttcagtttagttct 
tagtatcttttcaatcaagctgtgtgcttaatttactctgttgtaaagggataaagtgga 
aatcatttttttccgtggagtggtgattctgctaacatttttatctacgttttataactt 
ggtgagtgacgatgagagccctgcacctggccagagtgtcacaggcaaaaggcatcggga 
agcaggagcatcttcttggcagccaggctgggccatcttctcctggacacctgctgtgta 
ccaggaacttcgtcacctccttgaatgctggcggttcatttcatgatcagtgttaagcat 
tttcctccatatgcagcctttcctctgtacttttctccatggttgaaataaaacagggtg 
actgggagttacttagaattcatgaagattttaaaatggctttggagattttgcttttaa 
accagtagattcaaaacttaaacagcgtctgcagcacaatttcttgaggaaccttgaaaa 
acacaacttcccaggccccattcagtaatcccaggatttctttaagctccccaaataatt 
ttgaaactcatcagccgagtttctgccctcatgaggtaattccatcgttctccccagcct 
gcccctggcagctgtaacacaggagctggcctgagagcagattcaccctggaatgttctc 
tccacagaacaatcaagtccctgtcgcctgcctagtgcttaccactgaagatttttctga 
ttccagaccaactttttgccaacattctgcttccagctctctgagcccctgccgtgtctc 
cccaacactgccagccccagcacgcaatcaacctactttgtgcatgccacccgctttcca 
cactgtgagaacaatctgcccaactggaccctctggaggcgcacgatctcagccactcac 
caggcctgagtgtttgtgaaatgatcatgtcctacttattacaaaaccgtaaccccaaaa 
cattccttttatttctgtgaaaccggccaaagtgaggtccacccaccttcacacagctct 
ggcggtgcacctgctcaccttctcttggttctcagaactgagctgggcttgagaacacag 
ctttggctttgccatttttttcctacttggctgctgaggtggagggtgtgctgcacttat 
caccccatttcaaaaccaaaccaaacctgaggccaccccaaacaaattcagccagcaaaa 
agggtaggtatcgatggtcacctgaagcctcaagggagtccactctgacttctgacagca 
gacagaacctatctgtcctcaggaggggcgcatggctaacctacgctctcctctgagcct 
aggccagtgctcaaggaggatcagcaccctgaagggccaggccaatggccttagcagccc 
cctactgcccccaaaaacacca 

SEQ ID NO:260. >T11445_T5 # TY Transcript # LN 3582 # Source Gene: T11445 
# Encoded protein: T114 45_P1 

ccagcccagagaggcgacccagctccctggaggcccaggcacctcatgtctgccctcaac 
ccagagtggagcagaccttccctggagcttctcaggagccacttcaaaagttcgtgccct 
cgtgaggctggcatgcaggatggcaggacagcccggccacatgccccatggagggagttc 
caacaacctctgccacaccctggggcctgtgcatcctcctgacccacagaggcatcccaa 
cacgctgtcttttcgctgctcgctggcggacttccagatcgaaaagaagataggccgagg 
acagttcagcgaggtgtacaaggccacctgcctgctggacaggaagacagtggctctgaa 
gaaggtgcagatctttgagatgatggacgccaaggcgaggcaggactgtgtcaaggagat 
cggcctcttgaagcaactgaaccacccaaatatcatcaagtatttggactcgtttatcga 
agacaacgagctgaacattgtgctggagttggctgacgcaggggacctctcgcagatgat 
caagtactttaagaagcagaagcggctcatcccggagaggacagtatggaagtactttgt 
gcagctgtgcagcgccgtggagcacatgcattcacgccgggtgatgcaccgagacatcaa 
gcctgccaacgtgttcatcacagccacgggcgtcgtgaagctcggtgaccttggtctggg 
ccgcttcttcagctctgagaccaccgcagcccactccctagtggggacgccctactacat 
gtcaccggagaggatccatgagaacggctacaacttcaagtccgacatctggtccttggg 
ctgtctgctgtacgagatggcagccctccagagccccttctatggagataagatgaatct 
cttctccctgtgccagaagatcgagcagtgtgactaccccccactccccggggagcacta 
ctccgagaagttacgagaactggtcagcatgtgcatctgccctgacccccaccagagacc 
tgacatcggatacgtgcaccaggtggccaagcagatgcacatctggatgtccagcacctg 
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agcgtggatgcaccgtgccttatcaaagccagcaccactttgccttacttgagtcgtctt 
ctcttcgagtggccacctggtagcctagaacagctaagaccacagggttcagcaggttcc 
ccaaaaggctgcccagccttacagcagatgctgaaggcagagcagctgagggaggggcgc 
tggccacatgtcactgatggtcagattccaaagtcctttctttatactgttgtggacaat 
ctcagctgggtcaataagggcaggtggttcagcgagccacggcagccccctgtatctgga 
ttgtaatgtgaatctttagggtaattcctccagtgacctgtcaaggcttatgctaacagg 
agacttgcaggagaccgtgtgatttgtgtagtgagcctttgaaaatggttagtaccgggt 
tcagtttagttcttagtatcttttcaatcaagctgtgtgcttaatttactctgttgtaaa 
gggataaagtggaaatcatttttttccgtggagtggtgattctgctaacatttttatcta 
cgttttataacttggtgagtgacgatgagagccctgcacctggccagagtgtcacaggca 
aaaggcatcgggaagcaggagcatcttcttggcagccaggctgggccatcttctcctgga 
cacctgctgtgtaccaggaacttcgtcacctccttgaatgctggcggttcatttcatgat 
cagtgttaagcattttcctccatgggaaggaagcatgggatatagaaaagcgaagggctg 
tcctttacaaattctggttctgcaacttcctagcgtgactttgggcttgggcaagtttct 
tagccgttctgagccttcatttcctcatctgtacaatgagattaatagtacctatcatct 
accttcaggattgctgacagacagaatttgaaataaaatatgcaagttagctaatacaaa 
aagtagatgatccaaaaatggtagccactcacccttcacaaactgaagtccatggaccac 
ggaagtcgagaattaatgtacacctgtatcatgtgtaggaaaccagaaatgtgttcctta 
tttcttgttcccaaacaggattaactgtgaagactaatttataaatgtgaacctaagaaa 
actccacctctgaaggaaatcatttgaattttgtttttgtacgtaaagttaaccttccaa 
ttgtctgagctgtcgtcactgacttcatgacagtctggccctccagacaagagcagcgct 
ggcatcgggcaggtgattcctgacacctgctgcctgcaggcattcactgaccaggccttt 
cctggaggaaacacccagggccgggcggctgctgtttccacacgtggactcggatctgct 
gtgacaccgtcagcccgacagtctctccatatgcagcctttcctctgtacttttctccat 
ggttgaaataaaacagggtgactgggagttacttagaattcatgaagattttaaaatggc 
tttggagattttgcttttaaaccagtagattcaaaacttaaacagcgtctgcagcacaat 
ttcttgaggaaccttgaaaaacacaacttcccaggccccattcagtaatcccaggatttc 
tttaagctccccaaataattttgaaactcatcagccgagtttctgccctcatgaggtaat 
tccatcgttctccccagcctgcccctggcagctgtaacacaggagctggcctgagagcag 
attcaccctggaatgttctctccacagaacaatcaagtccctgtcgcctgcctagtgctt 
accactgaagatttttctgattccagaccaactttttgccaacattctgcttccagctct 
ctgagcccctgccgtgtctccccaacactgccagccccagcacgcaatcaacctactttg 
tgcatgccacccgctttccacactgtgagaacaatctgcccaactggaccctctggaggc 
gcacgatctcagccactcaccaggcctgagtgtttgtgaaatgatcatgtcctacttatt 
acaaaaccgtaaccccaaaacattccttttatttctgtgaaaccggccaaagtgaggtcc 
acccaccttcacacagctctggcggtgcacctgctcaccttctcttggttctcagaactg 
agctgggcttgagaacacagctttggctttgccatttttttcctacttggctgctgaggt 
ggagggtgtgctgcacttatcaccccatttcaaaaccaaaccaaacctgaggccacccca 
aacaaattcagccagcaaaaagggtaggtatcgatggtcacctgaagcctcaagggagtc 
cactctgacttctgacagcagacagaacctatctgtcctcaggaggggcgcatggctaac 
ctacgctctcctctgagcctaggccagtgctcaaggaggatcagcaccctgaagggccag 
gccaatggccttagcagccccctactgcccccaaaaacacca 

SEQ ID NO:261. >T11445_T6 # TY Transcript # LN 3595 # Source Gene: T11445 
# Encoded protein: T11445_P14 

ggyasgagcaatgctaggcgatggggaatccgaagacgcacaggagtccaagggttggct 
gactgcagaccctcatcataagtctagagccggagaagatggggagacgccggcctgcgc 
cctttcgtgccctcgtgaggctggcatgcaggatggcaggacagcccggccacatgcccc 
atggagggagttccaacaacctctgccacaccctggggcctgtgcatcctcctgacccac 
agaggcatcccaacacgctgtcttttcgctgctcgctggcggacttccagatcgaaaaga 
agataggccgaggacagttcagcgaggtgtacaaggccacctgcctgctggacaggaaga 
cagtggctctgaagaaggtgcagatctttgagatgatggacgccaaggcgaggcaggact 
gtgtcaaggagatcggcctcttgaagcaactgaaccacccaaatatcatcaagtatttgg 
actcgtttatcgaagacaacgagctgaacattgtgctggagttggctgacgcaggggacc 
tctcgcagatgatcaagtactttaagaagcagaagcggctcatcccggagaggacagtat 
ggaagtactttgtgcagctgtgcagcgccgtggagcacatgcattcacgccgggtgatgc 
accgagacatcaagcctgccaacgtgttcatcacagccacgggcgtcgtgaagctcggtg 
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accttggtctgggccgcttcttcagctctgagaccaccgcagcccactccctagtgggga 
cgccctactacatgtcaccggagaggatccatgagaacggctacaacttcaagtccgaca 
tctggtccttgggctgtctgctgtacgagatggcagccctccagagccccttctatggag 
ataagatgaatctcttctccctgtgccagaagatcgagcagtgtgactaccccccactcc 
ccggggagcactactccgagaagttacgagaactggtcagcatgtgcatctgccctgacc 
cccaccagagacctgacatcggatacgtgcaccaggtggccaagcagatgcacatctgga 
tgtccagcacctgagcgtggatgcaccgtgccttatcaaagccagcaccactttgcctta 
cttgagtcgtcttctcttcgagtggccacctggtagcctagaacagctaagaccacaggg 
ttcagcaggttccccaaaaggctgcccagccttacagcagatgctgaaggcagagcagct 
gagggaggggcgctggccacatgtcactgatggtcagattccaaagtcctttctttatac 
tgttgtggacaatctcagctgggtcaataagggcaggtggttcagcgagccacggcagcc 
ccctgtatctggattgtaatgtgaatctttagggtaattcctccagtgacctgtcaaggc 
ttatgctaacaggagacttgcaggagaccgtgtgatttgtgtagtgagcctttgaaaatg 
gttagtaccgggttcagtttagttcttagtatcttttcaatcaagctgtgtgcttaattt 
actctgttgtaaagggataaagtggaaatcatttttttccgtggagtggtgattctgcta 
acatttttatctacgttttataacttggtgagtgacgatgagagccctgcacctggccag 
agtgtcacaggcaaaaggcatcgggaagcaggagcatcttcttggcagccaggctgggcc 
atcttctcctggacacctgctgtgtaccaggaacttcgtcacctccttgaatgctggcgg 
ttcatttcatgatcagtgttaagcattttcctccatgggaaggaagcatgggatatagaa 
aagcgaagggctgtcctttacaaattctggttctgcaacttcctagcgtgactttgggct 
tgggcaagtttcttagccgttctgagccttcatttcctcatctgtacaatgagattaata 
gtacctatcatctaccttcaggattgctgacagacagaatttgaaataaaatatgcaagt 
tagctaatacaaaaagtagatgatccaaaaatggtagccactcacccttcacaaactgaa 
gtccatggaccacggaagtcgagaattaatgtacacctgtatcatgtgtaggaaaccaga 
aatgtgttccttatttcttgttcccaaacaggattaactgtgaagactaatttataaatg 
tgaacctaagaaaactccacctctgaaggaaatcatttgaattttgtttttgtacgtaaa 
gttaaccttccaattgtctgagctgtcgtcactgacttcatgacagtctggccctccaga 
caagagcagcgctggcatcgggcaggtgattcctgacacctgctgcctgcaggcattcac 
tgaccaggcctttcctggaggaaacacccagggccgggcggctgctgtttccacacgtgg 
actcggatctgctgtgacaccgtcagcccgacagtctctccatatgcagcctttcctctg 
tacttttctccatggttgaaataaaacagggtgactgggagttacttagaattcatgaag 
attttaaaatggctttggagattttgcttttaaaccagtagattcaaaacttaaacagcg 
tctgcagcacaatttcttgaggaaccttgaaaaacacaacttcccaggccccattcagta 
atcccaggatttctttaagctccccaaataattttgaaactcatcagccgagtttctgcc 
ctcatgaggtaattccatcgttctccccagcctgcccctggcagctgtaacacaggagct 
ggcctgagagcagattcaccctggaatgttctctccacagaacaatcaagtccctgtcgc 
ctgcctagtgcttaccactgaagatttttctgattccagaccaactttttgccaacattc 
tgcttccagctctctgagcccctgccgtgtctccccaacactgccagccccagcacgcaa 
tcaacctactttgtgcatgccacccgctttccacactgtgagaacaatctgcccaactgg 
accctctggaggcgcacgatctcagccactcaccaggcctgagtgtttgtgaaatgatca 
tgtcctacttattacaaaaccgtaaccccaaaacattccttttatttctgtgaaaccggc 
caaagtgaggtccacccaccttcacacagctctggcggtgcacctgctcaccttctcttg 
gttctcagaactgagctgggcttgagaacacagctttggctttgccatttttttcctact 
tggctgctgaggtggagggtgtgctgcacttatcaccccatttcaaaaccaaaccaaacc 
tgaggccaccccaaacaaattcagccagcaaaaagggtaggtatcgatggtcacctgaag 
cctcaagggagtccactctgacttctgacagcagacagaacctatctgtcctcaggaggg 
gcgcatggctaacctacgctctcctctgagcctaggccagtgctcaaggaggatcagcac 
cctgaagggccaggccaatggccttagcagccccctactgcccccaaaaacacca 

SEQ ID NO:262. >T11445JT7 # TY Transcript tt LN 3645 # Source Gene: T11445 
# Encoded protein: T11445_P1 

tattttgagagagcaggtttaccccagaacagtctgtggaaggtgtggggcagggaggag 
agctgggccgccctgaggacacagagggagaagggtgctgggagccagcctgcaagttct 
ggggagcctcagggtgcctttgagggcacaacttgggaggccaagctcagcggttcgtgc 
cctcgtgaggctggcatgcaggatggcaggacagcccggccacatgccccatggagggag 
ttccaacaacctctgccacaccctggggcctgtgcatcctcctgacccacagaggcatcc 
caacacgctgtcttttcgctgctcgctggcggacttccagatcgaaaagaagataggccg 
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aggacagttcagcgaggtgtacaaggccacctgcctgctggacaggaagacagtggctct 
gaagaaggtgcagatctttgagatgatggacgccaaggcgaggcaggactgtgtcaagga 
gatcggcctcttgaagcaactgaaccacccaaatatcatcaagtatttggactcgtttat 
cgaagacaacgagctgaacattgtgctggagttggctgacgcaggggacctctcgcagat 
gatcaagtactttaagaagcagaagcggctcatcccggagaggacagtatggaagtactt 
tgtgcagctgtgcagcgccgtggagcacatgcattcacgccgggtgatgcaccgagacat 
caagcctgccaacgtgttcatcacagccacgggcgtcgtgaagctcggtgaccttggtct 
gggccgcttcttcagctctgagaccaccgcagcccactccctagtggggacgccctacta 
catgtcaccggagaggatccatgagaacggctacaacttcaagtccgacatctggtcctt 
gggctgtctgctgtacgagatggcagccctccagagccccttctatggagataagatgaa 
tctcttctccctgtgccagaagatcgagcagtgtgactaccccccactccccggggagca 
ctactccgagaagttacgagaactggtcagcatgtgcatctgccctgacccccaccagag 
acctgacatcggatacgtgcaccaggtggccaagcagatgcacatctggatgtccagcac 
ctgagcgtggatgcaccgtgccttatcaaagccagcaccactttgccttacttgagtcgt 
cttctcttcgagtggccacctggtagcctagaacagctaagaccacagggttcagcaggt 
tccccaaaaggctgcccagccttacagcagatgctgaaggcagagcagctgagggagggg 
cgctggccacatgtcactgatggtcagattccaaagtcctttctttatactgttgtggac 
aatctcagctgggtcaataagggcaggtggttcagcgagccacggcagccccctgtatct 
ggattgtaatgtgaatctttagggtaattcctccagtgacctgtcaaggcttatgctaac 
aggagacttgcaggagaccgtgtgatttgtgtagtgagcctttgaaaatggttagtaccg 
ggttcagtttagttcttagtatcttttcaatcaagctgtgtgcttaatttactctgttgt 
aaagggataaagtggaaatcatttttttccgtggagtggtgattctgctaacatttttat 
ctacgttttataacttggtgagtgacgatgagagccctgcacctggccagagtgtcacag 
gcaaaaggcatcgggaagcaggagcatcttcttggcagccaggctgggccatcttctcct 
ggacacctgctgtgtaccaggaacttcgtcacctccttgaatgctggcggttcatttcat 
gatcagtgttaagcattttcctccatgggaaggaagcatgggatatagaaaagcgaaggg 
ctgtcctttacaaattctggttctgcaacttcctagcgtgactttgggcttgggcaagtt 
tcttagccgttctgagccttcatttcctcatctgtacaatgagattaatagtacctatca 
tctaccttcaggattgctgacagacagaatttgaaataaaatatgcaagttagctaatac 
aaaaagtagatgatccaaaaatggtagccactcacccttcacaaactgaagtccatggac 
cacggaagtcgagaattaatgtacacctgtatcatgtgtaggaaaccagaaatgtgttcc 
ttatttcttgttcccaaacaggattaactgtgaagactaatttataaatgtgaacctaag 
aaaactccacctctgaaggaaatcatttgaattttgtttttgtacgtaaagttaaccttc 
caattgtctgagctgtcgtcactgacttcatgacagtctggccctccagacaagagcagc 
gctggcatcgggcaggtgattcctgacacctgctgcctgcaggcattcactgaccaggcc 
tttcctggaggaaacacccagggccgggcggctgctgtttccacacgtggactcggatct 
gctgtgacaccgtcagcccgacagtctctccatatgcagcctttcctctgtacttttctc 
catggttgaaataaaacagggtgactgggagttacttagaattcatgaagattttaaaat 
ggctttggagattttgcttttaaaccagtagattcaaaacttaaacagcgtctgcagcac 
aatttcttgaggaaccttgaaaaacacaacttcccaggccccattcagtaatcccaggat 
ttctttaagctccccaaataattttgaaactcatcagccgagtttctgccctcatgaggt 
aattccatcgttctccccagcctgcccctggcagctgtaacacaggagctggcctgagag 
cagattcaccctggaatgttctctccacagaacaatcaagtccctgtcgcctgcctagtg 
cttaccactgaagatttttctgattccagaccaactttttgccaacattctgcttccagc 
tctctgagcccctgccgtgtctccccaacactgccagccccagcacgcaatcaacctact 
ttgtgcatgccacccgctttccacactgtgagaacaatctgcccaactggaccctctgga 
ggcgcacgatctcagccactcaccaggcctgagtgtttgtgaaatgatcatgtcctactt 
attacaaaaccgtaaccccaaaacattccttttatttctgtgaaaccggccaaagtgagg 
tccacccaccttcacacagctctggcggtgcacctgctcaccttctcttggttctcagaa 
ctgagctgggcttgagaacacagctttggctttgccatttttttcctacttggctgctga 
ggtggagggtgtgctgcacttatcaccccatttcaaaaccaaaccaaacctgaggccacc 
ccaaacaaattcagccagcaaaaagggtaggtatcgatggtcacctgaagcctcaaggga 
gtccactctgacttctgacagcagacagaacctatctgtcctcaggaggggcgcatggct 
aacctacgctctcctctgagcctaggccagtgctcaaggaggatcagcaccctgaagggc 
caggccaatggccttagcagccccctactgcccccaaaaacacca 



146/365 
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SEQ ID NO:263. >T11445_T8 # TY Transcript # LN 3590 # Source Gene: T11445 
# Encoded protein: T114 45_P3 

ttccccctccggcgcacgtcgccgcttactcacccccagacgatggtaccggtccggaat 
tctgcagatagggccagcgcaccggtcccttgncccagccgagcccgcccgcgcgccgtt 
cgtgccctcgtgaggctggcatgcaggatggcaggacagcccggccacatgccccatgga 
gggagttccaacaacctctgccacaccctggggcctgtgcatcctcctgacccacagagg 
catcccaacacgctgtcttttcgctgctcgctggcggacttccagatcgaaaagaagata 
ggccgaggacagttcagcgaggtgtacaaggccacctgcctgctggacaggaagacagtg 
gctctgaagaaggtgcagatctttgagatgatggacgccaaggcgaggcaggactgtgtc 
aaggagatcggcctcttgaagcaactgaaccacccaaatatcatcaagtatttggactcg 
tttatcgaagacaacgagctgaacattgtgctggagttggctgacgcaggggacctctcg 
cagatgatcaagtactttaagaagcagaagcggctcatcccggagaggacagtatggaag 
tactttgtgcagctgtgcagcgccgtggagcacatgcattcacgccgggtgatgcaccga 
gacatcaagcctgccaacgtgttcatcacagccacgggcgtcgtgaagctcggtgacctt 
ggtctgggccgcttcttcagctctgagaccaccgcagcccactccctagtggggacgccc 
tactacatgtcaccggagaggatccatgagaacggctacaacttcaagtccgacatctgg 
tccttgggctgtctgctgtacgagatggcagccctccagagccccttctatggagataag 
atgaatctcttctccctgtgccagaagatcgagcagtgtgactaccccccactccccggg 
gagcactactccgagaagttacgagaactggtcagcatgtgcatctgccctgacccccac 
cagagacctgacatcggatacgtgcaccaggtggccaagcagatgcacatctggatgtcc 
agcacctgagcgtggatgcaccgtgccttatcaaagccagcaccactttgccttacttga 
gtcgtcttctcttcgagtggccacctggtagcctagaacagctaagaccacagggttcag 
caggttccccaaaaggctgcccagccttacagcagatgctgaaggcagagcagctgaggg 
aggggcgctggccacatgtcactgatggtcagattccaaagtcctttctttatactgttg 
tggacaatctcagctgggtcaataagggcaggtggttcagcgagccacggcagccccctg 
tatctggattgtaatgtgaatctttagggtaattcctccagtgacctgtcaaggcttatg 
ctaacaggagacttgcaggagaccgtgtgatttgtgtagtgagcctttgaaaatggttag 
taccgggttcagtttagttcttagtatcttttcaatcaagctgtgtgcttaatttactct 
gttgtaaagggataaagtggaaatcatttttttccgtggagtggtgattctgctaacatt 
tttatctacgttttataacttggtgagtgacgatgagagccctgcacctggccagagtgt 
cacaggcaaaaggcatcgggaagcaggagcatcttcttggcagccaggctgggccatctt 
ctcctggacacctgctgtgtaccaggaacttcgtcacctccttgaatgctggcggttcat 
ttcatgatcagtgttaagcattttcctccatgggaaggaagcatgggatatagaaaagcg 
aagggctgtcctttacaaattctggttctgcaacttcctagcgtgactttgggcttgggc 
aagtttcttagccgttctgagccttcatttcctcatctgtacaatgagattaatagtacc 
tatcatctaccttcaggattgctgacagacagaatttgaaataaaatatgcaagttagct 
aatacaaaaagtagatgatccaaaaatggtagccactcacccttcacaaactgaagtcca 
tggaccacggaagtcgagaattaatgtacacctgtatcatgtgtaggaaaccagaaatgt 
gttccttatttcttgttcccaaacaggattaactgtgaagactaatttataaatgtgaac 
ctaagaaaactccacctctgaaggaaatcatttgaattttgtttttgtacgtaaagttaa 
ccttccaattgtctgagctgtcgtcactgacttcatgacagtctggccctccagacaaga 
gcagcgctggcatcgggcaggtgattcctgacacctgctgcctgcaggcattcactgacc 
aggcctttcctggaggaaacacccagggccgggcggctgctgtttccacacgtggactcg 
gatctgctgtgacaccgtcagcccgacagtctctccatatgcagcctttcctctgtactt 
ttctccatggttgaaataaaacagggtgactgggagttacttagaattcatgaagatttt 
aaaatggctttggagattttgcttttaaaccagtagattcaaaacttaaacagcgtctgc 
agcacaatttcttgaggaaccttgaaaaacacaacttcccaggccccattcagtaatccc 
aggatttctttaagctccccaaataattttgaaactcatcagccgagtttctgccctcat 
gaggtaattccatcgttctccccagcctgcccctggcagctgtaacacaggagctggcct 
gagagcagattcaccctggaatgttctctccacagaacaatcaagtccctgtcgcctgcc 
tagtgcttaccactgaagatttttctgattccagaccaactttttgccaacattctgctt 
ccagctctctgagcccctgccgtgtctccccaacactgccagccccagcacgcaatcaac 
ctactttgtgcatgccacccgctttccacactgtgagaacaatctgcccaactggaccct 
ctggaggcgcacgatctcagccactcaccaggcctgagtgtttgtgaaatgatcatgtcc 
tacttattacaaaaccgtaaccccaaaacattccttttatttctgtgaaaccggccaaag 
tgaggtccacccaccttcacacagctctggcggtgcacctgctcaccttctcttggttct 
cagaactgagctgggcttgagaacacagctttggctttgccatttttttcctacttggct 
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gctgaggtggagggtgtgctgcacttatcaccccatttcaaaaccaaaccaaacctgagg 
ccaccccaaacaaattcagccagcaaaaagggtaggtatcgatggtcacctgaagcctca 
agggagtccactctgacttctgacagcagacagaacctatctgtcctcaggaggggcgca 
tggctaacctacgctctcctctgagcctaggccagtgctcaaggaggatcagcaccctga 
agggccaggccaatggccttagcagccccctactgcccccaaaaacacca 

SEQ ID NO:264. >T11445JT9 # TY Transcript # LN 3649 # Source Gene: T11445 
# Encoded protein: T11445_P1 

gggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgcg 
gccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgcac 
cggtcccccagcggcagccgagcccgcccgcgcgccggccgctgttttctgttgcagttc 
gtgccctcgtgaggctggcatgcaggatggcaggacagcccggccacatgccccatggag 
ggagttccaacaacctctgccacaccctggggcctgtgcatcctcctgacccacagaggc 
atcccaacacgctgtcttttcgctgctcgctggcggacttccagatcgaaaagaagatag 
gccgaggacagttcagcgaggtgtacaaggccacctgcctgctggacaggaagacagtgg 
ctctgaagaaggtgcagatctttgagatgatggacgccaaggcgaggcaggactgtgtca 
aggagatcggcctcttgaagcaactgaaccacccaaatatcatcaagtatttggactcgt 
ttatcgaagacaacgagctgaacattgtgctggagttggctgacgcaggggacctctcgc 
agatgatcaagtactttaagaagcagaagcggctcatcccggagaggacagtatggaagt 
actttgtgcagctgtgcagcgccgtggagcacatgcattcacgccgggtgatgcaccgag 
acatcaagcctgccaacgtgttcatcacagccacgggcgtcgtgaagctcggtgaccttg 
gtctgggccgcttcttcagctctgagaccaccgcagcccactccctagtggggacgccct 
actacatgtcaccggagaggatccatgagaacggctacaacttcaagtccgacatctggt 
ccttgggctgtctgctgtacgagatggcagccctccagagccccttctatggagataaga 
tgaatctcttctccctgtgccagaagatcgagcagtgtgactaccccccactccccgggg 
agcactactccgagaagttacgagaactggtcagcatgtgcatctgccctgacccccacc 
agagacctgacatcggatacgtgcaccaggtggccaagcagatgcacatctggatgtcca 
gcacctgagcgtggatgcaccgtgccttatcaaagccagcaccactttgccttacttgag 
tcgtcttctcttcgagtggccacctggtagcctagaacagctaagaccacagggttcagc 
aggttccccaaaaggctgcccagccttacagcagatgctgaaggcagagcagctgaggga 
ggggcgctggccacatgtcactgatggtcagattccaaagtcctttctttatactgttgt 
ggacaatctcagctgggtcaataagggcaggtggttcagcgagccacggcagccccctgt 
atctggattgtaatgtgaatctttagggtaattcctccagtgacctgtcaaggcttatgc 
taacaggagacttgcaggagaccgtgtgatttgtgtagtgagcctttgaaaatggttagt 
accgggttcagtttagttcttagtatcttttcaatcaagctgtgtgcttaatttactctg 
ttgtaaagggataaagtggaaatcatttttttccgtggagtggtgattctgctaacattt 
ttatctacgttttataacttggtgagtgacgatgagagccctgcacctggccagagtgtc 
acaggcaaaaggcatcgggaagcaggagcatcttcttggcagccaggctgggccatcttc 
tcctggacacctgctgtgtaccaggaacttcgtcacctccttgaatgctggcggttcatt 
tcatgatcagtgttaagcattttcctccatgggaaggaagcatgggatatagaaaagcga 
agggctgtcctttacaaattctggttctgcaacttcctagcgtgactttgggcttgggca 
agtttcttagccgttctgagccttcatttcctcatctgtacaatgagattaatagtacct 
atcatctaccttcaggattgctgacagacagaatttgaaataaaatatgcaagttagcta 
atacaaaaagtagatgatccaaaaatggtagccactcacccttcacaaactgaagtccat 
ggaccacggaagtcgagaattaatgtacacctgtatcatgtgtaggaaaccagaaatgtg 
ttccttatttcttgttcccaaacaggattaactgtgaagactaatttataaatgtgaacc 
taagaaaactccacctctgaaggaaatcatttgaattttgtttttgtacgtaaagttaac 
cttccaattgtctgagctgtcgtcactgacttcatgacagtctggccctccagacaagag 
cagcgctggcatcgggcaggtgattcctgacacctgctgcctgcaggcattcactgacca 
ggcctttcctggaggaaacacccagggccgggcggctgctgtttccacacgtggactcgg 
atctgctgtgacaccgtcagcccgacagtctctccatatgcagcctttcctctgtacttt 
tctccatggttgaaataaaacagggtgactgggagttacttagaattcatgaagatttta 
aaatggctttggagattttgcttttaaaccagtagattcaaaacttaaacagcgtctgca 
gcacaatttcttgaggaaccttgaaaaacacaacttcccaggccccattcagtaatccca 
ggatttctttaagctccccaaataattttgaaactcatcagccgagtttctgccctcatg 
aggtaattccatcgttctccccagcctgcccctggcagctgtaacacaggagctggcctg 
agagcagattcaccctggaatgttctctccacagaacaatcaagtccctgtcgcctgcct 
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agtgcttaccactgaagatttttctgattccagaccaactttttgccaacattctgcttc 
cagctctctgagcccctgccgtgtctccccaacactgccagccccagcacgcaatcaacc 
tactttgtgcatgccacccgctttccacactgtgagaacaatctgcccaactggaccctc 
tggaggcgcacgatctcagccactcaccaggcctgagtgtttgtgaaatgatcatgtcct 
acttattacaaaaccgtaaccccaaaacattccttttatttctgtgaaaccggccaaagt 
gaggtccacccaccttcacacagctctggcggtgcacctgctcaccttctcttggttctc 
agaactgagctgggcttgagaacacagctttggctttgccatttttttcctacttggctg 
ctgaggtggagggtgtgctgcacttatcaccccatttcaaaaccaaaccaaacctgaggc 
caccccaaacaaattcagccagcaaaaagggtaggtatcgatggtcacctgaagcctcaa 
gggagtccactctgacttctgacagcagacagaacctatctgtcctcaggaggggcgcat 
ggctaacctacgctctcctctgagcctaggccagtgctcaaggaggatcagcaccctgaa 
gggccaggccaatggccttagcagccccctactgcccccaaaaacacca 

SEQ ID NO:265. >T11445JT10 # TY Transcript # LN 3794 # Source Gene: T11445 
# Encoded protein: T11445_P14 

gggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgcg 
gccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgcac 
cggtcccccagcggcagccgagcccgcccgcgcgccggagaagaagacacatgaattaga 
gacagcacgggggagcaggctgtggagctgggagtgacggggtgagtccaggaaggctgc 
ctggaggagatgcccaggagagaagtttgctgggaggcagctcatttccggcaggaggag 
cagagcctgccaaggcctcgagttcgtgccctcgtgaggctggcatgcaggatggcagga 
cagcccggccacatgccccatggagggagttccaacaacctctgccacaccctggggcct 
gtgcatcctcctgacccacagaggcatcccaacacgctgtcttttcgctgctcgctggcg 
gacttccagatcgaaaagaagataggccgaggacagttcagcgaggtgtacaaggccacc 
tgcctgctggacaggaagacagtggctctgaagaaggtgcagatctttgagatgatggac 
gccaaggcgaggcaggactgtgtcaaggagatcggcctcttgaagcaactgaaccaccca 
aatatcatcaagtatttggactcgtttatcgaagacaacgagctgaacattgtgctggag 
ttggctgacgcaggggacctctcgcagatgatcaagtactttaagaagcagaagcggctc 
atcccggagaggacagtatggaagtactttgtgcagctgtgcagcgccgtggagcacatg 
cattcacgccgggtgatgcaccgagacatcaagcctgccaacgtgttcatcacagccacg 
ggcgtcgtgaagctcggtgaccttggtctgggccgcttcttcagctctgagaccaccgca 
gcccactccctagtggggacgccctactacatgtcaccggagaggatccatgagaacggc 
tacaacttcaagtccgacatctggtccttgggctgtctgctgtacgagatggcagccctc 
cagagccccttctatggagataagatgaatctcttctccctgtgccagaagatcgagcag 
tgtgactaccccccactccccggggagcactactccgagaagttacgagaactggtcagc 
atgtgcatctgccctgacccccaccagagacctgacatcggatacgtgcaccaggtggcc 
aagcagatgcacatctggatgtccagcacctgagcgtggatgcaccgtgccttatcaaag 
ccagcaccactttgccttacttgagtcgtcttctcttcgagtggccacctggtagcctag 
aacagctaagaccacagggttcagcaggttccccaaaaggctgcccagccttacagcaga 
tgctgaaggcagagcagctgagggaggggcgctggccacatgtcactgatggtcagattc 
caaagtcctttctttatactgttgtggacaatctcagctgggtcaataagggcaggtggt 
tcagcgagccacggcagccccctgtatctggattgtaatgtgaatctttagggtaattcc 
tccagtgacctgtcaaggcttatgctaacaggagacttgcaggagaccgtgtgatttgtg 
tagtgagcctttgaaaatggttagtaccgggttcagtttagttcttagtatcttttcaat 
caagctgtgtgcttaatttactctgttgtaaagggataaagtggaaatcatttttttccg 
tggagtggtgattctgctaacatttttatctacgttttataacttggtgagtgacgatga 
gagccctgcacctggccagagtgtcacaggcaaaaggcatcgggaagcaggagcatcttc 
ttggcagccaggctgggccatcttctcctggacacctgctgtgtaccaggaacttcgtca 
cctccttgaatgctggcggttcatttcatgatcagtgttaagcattttcctccatgggaa 
ggaagcatgggatatagaaaagcgaagggctgtcctttacaaattctggttctgcaactt 
cctagcgtgactttgggcttgggcaagtttcttagccgttctgagccttcatttcctcat 
ctgtacaatgagattaatagtacctatcatctaccttcaggattgctgacagacagaatt 
tgaaataaaatatgcaagttagctaatacaaaaagtagatgatccaaaaatggtagccac 
tcacccttcacaaactgaagtccatggaccacggaagtcgagaattaatgtacacctgta 
tcatgtgtaggaaaccagaaatgtgttccttatttcttgttcccaaacaggattaactgt 
gaagactaatttataaatgtgaacctaagaaaactccacctctgaaggaaatcatttgaa 
ttttgtttttgtacgtaaagttaaccttccaattgtctgagctgtcgtcactgacttcat 
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Figure 1 (Confd) 



gacagtctggccctccagacaagagcagcgctggcatcgggcaggtgattcctgacacct 
gctgcctgcaggcattcactgaccaggcctttcctggaggaaacacccagggccgggcgg 
ctgctgtttccacacgtggactcggatctgctgtgacaccgtcagcccgacagtctctcc 
atatgcagcctttcctctgtacttttctccatggttgaaataaaacagggtgactgggag 
ttacttagaattcatgaagattttaaaatggctttggagattttgcttttaaaccagtag 
attcaaaacttaaacagcgtctgcagcacaatttcttgaggaaccttgaaaaacacaact 
tcccaggccccattcagtaatcccaggatttctttaagctccccaaataattttgaaact 
catcagccgagtttctgccctcatgaggtaattccatcgttctccccagcctgcccctgg 
cagctgtaacacaggagctggcctgagagcagattcaccctggaatgttctctccacaga 
acaatcaagtccctgtcgcctgcctagtgcttaccactgaagatttttctgattccagac 
caactttttgccaacattctgcttccagctctctgagcccctgccgtgtctccccaacac 
tgccagccccagcacgcaatcaacctactttgtgcatgccacccgctttccacactgtga 
gaacaatctgcccaactggaccctctggaggcgcacgatctcagccactcaccaggcctg 
agtgtttgtgaaatgatcatgtcctacttattacaaaaccgtaaccccaaaacattcctt 
ttatttctgtgaaaccggccaaagtgaggtccacccaccttcacacagctctggcggtgc 
acctgctcaccttctcttggttctcagaactgagctgggcttgagaacacagctttggct 
ttgccatttttttcctacttggctgctgaggtggagggtgtgctgcacttatcaccccat 
ttcaaaaccaaaccaaacctgaggccaccccaaacaaattcagccagcaaaaagggtagg 
tatcgatggtcacctgaagcctcaagggagtccactctgacttctgacagcagacagaac 
ctatctgtcctcaggaggggcgcatggctaacctacgctctcctctgagcctaggccagt 
gctcaaggaggatcagcaccctgaagggccaggccaatggccttagcagccccctactgc 
ccccaaaaacacca 

SEQ ID NO:266. >T11445_T11 # TY Transcript # LN 1629 # Source Gene: T11445 

# Encoded protein: T11445__P13 

gggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgcg 
gccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgcac 
cggtcccccagcggcagccgagcccgcccgcgcgccgttcgtgccctcgtgaggctggca 
tgcaggatggcaggacagcccggccacatgccccatggagggagttccaacaacctctgc 
cacaccctggggcctgtgcatcctcctgacccacagaggcatcccaacacgctgtctttt 
cgctgctcgctggcggacttccagatcgaaaagaagataggccgaggacagttcagcgag 
gtgtacaaggccacctgcctgctggacaggaagacagtggctctgaagaaggtgcagatc 
tttgagatgatggacgccaaggcgaggcaggactgtgtcaaggagatcggcctcttgaag 
caactgaaccacccaaatatcatcaagtatttggactcgtttatcgaagacaacgagctg 
aacattgtgctggagttggctgacgcaggggacctctcgcagatgatcaagtactttaag 
aagcagaagcggctcatcccggagaggacagtatggaagtactttgtgcagctgtgcagc 
gccgtggagcacatgcattcacgccgggtgatgcaccgagacatcaagcctgccaacgtg 
ttcatcacagccacgggcgtcgtgaagctcggtgaccttggtctgggccgcttcttcagc 
tctgagaccaccgcagcccactccctagtggggacgccctactacatgtcaccggagagg 
atccatgagaacggctacaacttcaagtccgacatctggtccttgggctgtctgctgtac 
gagatggcagccctccagagccccttctatggagataagatgaatctcttctccctgtgc 
cagaagatcgagcagtgtgactaccccccactccccggggagcactactccgagaagtta 
cgagaactggtcagcatgtgcatctgccctgacccccaccagagacctgacatcggatac 
gtgcaccaggtggccaagcagatgcacatctggatgtccagcacctgagcgtggatgcac 
cgtgccttatcaaagccagcaccactttgccttacttgagtcgtcttctcttcgagtggc 
cacctggtagcctagaacagctaagaccacagggttcagcaggttccccaaaaggctgcc 
cagccttacagcagatgctgaaggcagagcagctgagggaggggcgctggccacatgtca 
ctgatggtcagattccaaagtcctttctttatactgttgtggacaatctcagctgggtca 
ataagggcaggtggttcagcgagccacggcagccccctgtatctggattgtaatgtgaat 
ctttagggtaattcctccagtgacctgtcaaggcttatgctaacaggagacttgcaggag 
accgtgtgatttgtgtagtgagcctttgaaaatggttagtaccgggttcagtttagttct 
tagtatcttttcaatcaagctgtgtgcttaatttactctgttgtaaagggataaagtgga 
aatcatttt 

SEQ ID NO:267. >T11445_T12 < TY Transcript # LN 3393 # Source Gene: T11445 

# Encoded protein: T114 4 5_P5 

ggccgcgcggggctgagcgtgtcggcggccgggcgggcgggggcggcgtgtccgcgtgtc 
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Figure 1 (Cont'd) 



ccggggtgtgcgcgtccccggcgggtgtgcgtgtgcaccccggagcgtctggtggcgttg 
ttgggttcgtgccctcgtgaggctggcatgcaggatggcaggacagcccggccacatgcc 
ccatggagggagttccaacaacctctgccacaccctggggcctgtgcatcctcctgaccc 
acagcaactgaaccacccaaatatcatcaagtatttggactcgtttatcgaagacaacga 
gctgaacattgtgctggagttggctgacgcaggggacctctcgcagatgatcaagtactt 
taagaagcagaagcggctcatcccggagaggacagtatggaagtactttgtgcagctgtg 
cagcgccgtggagcacatgcattcacgccgggtgatgcaccgagacatcaagcctgccaa 
cgtgttcatcacagccacgggcgtcgtgaagctcggtgaccttggtctgggccgcttctt 
cagctctgagaccaccgcagcccactccctagtggggacgccctactacatgtcaccgga 
gaggatccatgagaacggctacaacttcaagtccgacatctggtccttgggctgtctgct 
gtacgagatggcagccctccagagccccttctatggagataagatgaatctcttctccct 
gtgccagaagatcgagcagtgtgactaccccccactccccggggagcactactccgagaa 
gttacgagaactggtcagcatgtgcatctgccctgacccccaccagagacctgacatcgg 
atacgtgcaccaggtggccaagcagatgcacatctggatgtccagcacctgagcgtggat 
gcaccgtgccttatcaaagccagcaccactttgccttacttgagtcgtcttctcttcgag 
tggccacctggtagcctagaacagctaagaccacagggttcagcaggttccccaaaaggc 
tgcccagccttacagcagatgctgaaggcagagcagctgagggaggggcgctggccacat 
gtcactgatggtcagattccaaagtcctttctttatactgttgtggacaatctcagctgg 
gtcaataagggcaggtggttcagcgagccacggcagccccctgtatctggattgtaatgt 
gaatctttagggtaattcctccagtgacctgtcaaggcttatgctaacaggagacttgca 
ggagaccgtgtgatttgtgtagtgagcctttgaaaatggttagtaccgggttcagtttag 
ttcttagtatcttttcaatcaagctgtgtgcttaatttactctgttgtaaagggataaag 
tggaaatcatttttttccgtggagtggtgattctgctaacatttttatctacgttttata 
acttggtgagtgacgatgagagccctgcacctggccagagtgtcacaggcaaaaggcatc 
gggaagcaggagcatcttcttggcagccaggctgggccatcttctcctggacacctgctg 
tgtaccaggaacttcgtcacctccttgaatgctggcggttcatttcatgatcagtgttaa 
gcattttcctccatgggaaggaagcatgggatatagaaaagcgaagggctgtcctttaca 
aattctggttctgcaacttcctagcgtgactttgggcttgggcaagtttcttagccgttc 
tgagccttcatttcctcatctgtacaatgagattaatagtacctatcatctaccttcagg 
attgctgacagacagaatttgaaataaaatatgcaagttagctaatacaaaaagtagatg 
atccaaaaatggtagccactcacccttcacaaactgaagtccatggaccacggaagtcga 
gaattaatgtacacctgtatcatgtgtaggaaaccagaaatgtgttccttatttcttgtt 
cccaaacaggattaactgtgaagactaatttataaatgtgaacctaagaaaactccacct 
ctgaaggaaatcatttgaattttgtttttgtacgtaaagttaaccttccaattgtctgag 
ctgtcgtcactgacttcatgacagtctggccctccagacaagagcagcgctggcatcggg 
caggtgattcctgacacctgctgcctgcaggcattcactgaccaggcctttcctggagga 
aacacccagggccgggcggctgctgtttccacacgtggactcggatctgctgtgacaccg 
tcagcccgacagtctctccatatgcagcctttcctctgtacttttctccatggttgaaat 
aaaacagggtgactgggagttacttagaattcatgaagattttaaaatggctttggagat 
tttgcttttaaaccagtagattcaaaacttaaacagcgtctgcagcacaatttcttgagg 
aaccttgaaaaacacaacttcccaggccccattcagtaatcccaggatttctttaagctc 
cccaaataattttgaaactcatcagccgagtttctgccctcatgaggtaattccatcgtt 
ctccccagcctgcccctggcagctgtaacacaggagctggcctgagagcagattcaccct 
ggaatgttctctccacagaacaatcaagtccctgtcgcctgcctagtgcttaccactgaa 
gatttttctgattccagaccaactttttgccaacattctgcttccagGtctctgagcccc 
tgccgtgtctccccaacactgccagccccagcacgcaatcaacctactttgtgcatgcca 
cccgctttccacactgtgagaacaatctgcccaactggaccctctggaggcgcacgatct 
cagccactcaccaggcctgagtgtttgtgaaatgatcatgtcctacttattacaaaaccg 
taaccccaaaacattccttttatttctgtgaaaccggccaaagtgaggtccacccacctt 
cacacagctctggcggtgcacctgctcaccttctcttggttctcagaactgagctgggct 
tgagaacacagctttggctttgccatttttttcctacttggctgctgaggtggagggtgt 
gctgcacttatcaccccatttcaaaaccaaaccaaacctgaggccaccccaaacaaattc 
agccagcaaaaagggtaggtatcgatggtcacctgaagcctcaagggagtccactctgac 
ttctgacagcagacagaacctatctgtcctcaggaggggcgcatggctaacctacgctct 
cctctgagcctaggccagtgctcaaggaggatcagcaccctgaagggccaggccaatggc 
cttagcagccccctactgcccccaaaaacacca 
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Figur 1 (Cont'd) 



SEQ ID NO:268. >T11445_T13 # TY Transcript # LN 3518 # Source Gene: T11445 
# Encoded protein: T11445_P6 

gggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgcg 
gccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgcac 
cggtcccccagcggcagccgagcccgcccgcgcgccgttcgtgccctcgtgaggctggca 
tgcaggatggcaggacagcccggccacatgccccatggagggagttccaacaacctctgc 
cacaccctggggcctgtgcatcctcctgacccacagaggcatcccaacacgctgtctttt 
cgctgctcgctggcggacttccagatcgaaaagaagataggccgaggacagttcagcgag 
gtgtacaaggccacctgcctgctggacaggaagacagtggctctgaagaaggtgcagatc 
tttgagatgatggacgccaaggcgaggcaggactgtgtcaaggagatcggcctcttgaag 
tactttaagaagcagaagcggctcatcccggagaggacagtatggaagtactttgtgcag 
ctgtgcagcgccgtggagcacatgcattcacgccgggtgatgcaccgagacatcaagcct 
gccaacgtgttcatcacagccacgggcgtcgtgaagctcggtgaccttggtctgggccgc 
ttcttcagctctgagaccaccgcagcccactccctagtggggacgccctactacatgtca 
ccggagaggatccatgagaacggctacaacttcaagtccgacatctggtccttgggctgt 
ctgctgtacgagatggcagccctccagagccccttctatggagataagatgaatctcttc 
tccctgtgccagaagatcgagcagtgtgactaccccccactccccggggagcactactcc 
gagaagttacgagaactggtcagcatgtgcatctgccctgacccccaccagagacctgac 
atcggatacgtgcaccaggtggccaagcagatgcacatctggatgtccagcacctgagcg 
tggat^gcaccgtgccttatcaaagccagcaccactttgccttacttgagtcgtcttctct 
tcgagtggccacctggtagcctagaacagctaagaccacagggttcagcaggttccccaa 
aaggctgcccagccttacagcagatgctgaaggcagagcagctgagggaggggcgctggc 
cacatgtcactgatggtcagattccaaagtcctttctttatactgttgtggacaatctca 
gctgggtcaataagggcaggtggttcagcgagccacggcagccccctgtatctggattgt 
aatgtgaatctttagggtaattcctccagtgacctgtcaaggcttatgctaacaggagac 
ttgcaggagaccgtgtgatttgtgtagtgagcctttgaaaatggttagtaccgggttcag 
tttagttcttagtatcttttcaatcaagctgtgtgcttaatttactctgttgtaaaggga 
taaagtggaaatcatttttttccgtggagtggtgattctgctaacatttttatctacgtt 
ttataacttggtgagtgacgatgagagccctgcacctggccagagtgtcacaggcaaaag 
gcatcgggaagcaggagcatcttcttggcagccaggctgggccatcttctcctggacacc 
tgctgtgtaccaggaacttcgtcacctccttgaatgctggcggttcatttcatgatcagt 
gttaagcattttcctccatgggaaggaagcatgggatatagaaaagcgaagggctgtcct 
ttacaaattctggttctgcaacttcctagcgtgactttgggcttgggcaagtttcttagc 
cgttctgagccttcatttcctcatctgtacaatgagattaatagtacctatcatctacct 
tcaggattgctgacagacagaatttgaaataaaatatgcaagttagctaatacaaaaagt 
agatgatccaaaaatggtagccactcacccttcacaaactgaagtccatggaccacggaa 
gtcgagaattaatgtacacctgtatcatgtgtaggaaaccagaaatgtgttccttatttc 
ttgttcccaaacaggattaactgtgaagactaatttataaatgtgaacctaagaaaactc 
cacctctgaaggaaatcatttgaattttgtttttgtacgtaaagttaaccttccaattgt 
ctgagctgtcgtcactgacttcatgacagtctggccctccagacaagagcagcgctggca 
tcgggcaggtgattcctgacacctgctgcctgcaggcattcactgaccaggcctttcctg 
gaggaaacacccagggccgggcggctgctgtttccacacgtggactcggatctgctgtga 
caccgtcagcccgacagtctctccatatgcagcctttcctctgtacttttctccatggtt 
gaaataaaacagggtgactgggagttacttagaattcatgaagattttaaaatggctttg 
gagattttgcttttaaaccagtagattcaaaacttaaacagcgtctgcagcacaatttct 
tgaggaaccttgaaaaacacaacttcccaggccccattcagtaatcccaggatttcttta 
agctccccaaataattttgaaactcatcagccgagtttctgccctcatgaggtaattcca 
tcgttctccccagcctgcccctggcagctgtaacacaggagctggcctgagagcagattc 
accctggaatgttctctccacagaacaatcaagtccctgtcgcctgcctagtgcttacca 
ctgaagatttttctgattccagaccaactttttgccaacattctgcttccagctctctga 
gcccctgccgtgtctccccaacactgccagccccagcacgcaatcaacctactttgtgca 
tgccacccgctttccacactgtgagaacaatctgcccaactggaccctctggaggcgcac 
gatctcagccactcaccaggcctgagtgtttgtgaaatgatcatgtcctacttattacaa 
aaccgtaaccccaaaacattccttttatttctgtgaaaccggccaaagtgaggtccaccc 
accttcacacagctctggcggtgcacctgctcaccttctcttggttctcagaactgagct 
gggcttgagaacacagctttggctttgccatttttttcctacttggctgctgaggtggag 
ggtgtgctgcacttatcaccccatttcaaaaccaaaccaaacctgaggccaccccaaaca 
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Figur 1 (Cont'd) 



aattcagccagcaaaaagggtaggtatcgatggtcacctgaagcctcaagggagtccact 
ctgacttctgacagcagacagaacctatctgtcctcaggaggggcgcatggctaacctac 
gctctcctctgagcctaggccagtgctcaaggaggatcagcaccctgaagggccaggcca 
atggccttagcagccccctactgcccccaaaaacacca 

SEQ ID NO:269. >T11445JT14 # TY Transcript § LN 2231 # Source Gene: T11445 

# Encoded protein: T11445_P14 

ggyasgagcaatgctaggcgatggggaatccgaagacgcacaggagtccaagggttggct 
gactgcagaccctcatcataagtctagagccggagaagatggggagacgccggcctgcgc 
cctttcgtgccctcgtgaggctggcatgcaggatggcaggacagcccggccacatgcccc 
atggagggagttccaacaacctctgccacaccctggggcctgtgcatcctcctgacccac 
agaggcatcccaacacgctgtcttttcgctgctcgctggcggacttccagatcgaaaaga 
agataggccgaggacagttcagcgaggtgtacaaggccacctgcctgctggacaggaaga 
cagtggctctgaagaaggtgcagatctttgagatgatggacgccaaggcgaggcaggact 
gtgtcaaggagatcggcctcttgaagcaactgaaccacccaaatatcatcaagtatttgg 
actcgtttatcgaagacaacgagctgaacattgtgctggagttggctgacgcaggggacc 
tctcgcagatgatcaagtactttaagaagcagaagcggctcatcccggagaggacagtat 
ggaagtactttgtgcagctgtgcagcgccgtggagcacatgcattcacgccgggtgatgc 
accgagacatcaagcctgccaacgtgttcatcacagccacgggcgtcgtgaagctcggtg 
accttggtctgggccgcttcttcagctctgagaccaccgcagcccactccctagtgggga 
cgccctactacatgtcaccggagaggatccatgagaacggctacaacttcaagtccgaca 
tctggtccttgggctgtctgctgtacgagatggcagccctccagagccccttctatggag 
ataagatgaatctcttctccctgtgccagaagatcgagcagtgtgactaccccccactcc 
ccggggagcactactccgagaagttacgagaactggtcagcatgtgcatctgccctgacc 
cccaccagagacctgacatcggatacgtgcaccaggtggccaagcagatgcacatctgga 
tgtccagcacctgagcgtggatgcaccgtgccttatcaaagccagcaccactttgcctta 
cttgagtcgtcttctcttcgagtggccacctggtagcctagaacagctaagaccacaggg 
ttcagcaggttccccaaaaggctgcccagccttacagcagatgctgaaggcagagcagct 
gagggaggggcgctggccacatgtcactgatggtcagattccaaagtcctttctttatac 
tgttgtggacaatctcagctgggtcaataagggcaggtggttcagcgagccacggcagcc 
ccctgtatctggattgtaatgtgaatctttagggtaattcctccagtgacctgtcaaggc 
ttatgctaacaggagacttgcaggagaccgtgtgatttgtgtagtgagcctttgaaaatg 
gttagtaccgggttcagtttagttcttagtatcttttcaatcaagctgtgtgcttaattt 
actctgttgtaaagggataaagtggaaatcatttttttccgtggagtggtgattctgcta 
acatttttatctacgttttataacttggtgagtgacgatgagagccctgcacctggccag 
agtgtcacaggcaaaaggcatcgggaagcaggagcatcttcttggcagccaggctgggcc 
atcttctcctggacacctgctgtgtaccaggaacttcgtcacctccttgaatgctggcgg 
ttcatttcatgatcagtgttaagcattttcctccatgggaaggaagcatgggatatagaa 
aagcgaagggctgtcctttacaaattctggttctgcaacttcctagcgtgactttgggct 
tgggcaagtttcttagccgttctgagccttcatttcctcatctgtacaatgagattaata 
gtacctatcatctaccttcaggattgctgacagacagaatttgaaataaaatatgcaagt 
tagctaatacaaaaagtagatgatccaaaaatggtagccactcacccttcacaaactgaa 
gtccatggaccacggaagtcgagaattaatgtacacctgtatcatgtgtaggaaaccaga 
aatgtgttccttatttcttgttcccaaacaggattaactgtgaagactaatttataaatg 
tgaacctaaga 

SEQ ID NO:270. >T11445_T15 # TY Transcript # LN 1773 # Source Gene: T11445 

# Encoded protein: T11445_P13 

gggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgcg 
gccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgcac 
cggtcccccagcggcagccgagcccgcccgcgcgccgttcgtgccctcgtgaggctggca 
tgcaggatggcaggacagcccggccacatgccccatggagggagttccaacaacctctgc 
cacaccctggggcctgtgcatcctcctgacccacagaggcatcccaacacgctgtctttt 
cgctgctcgctggcggacttccagatcgaaaagaagataggccgaggacagttcagcgag 
gtgtacaaggccacctgcctgctggacaggaagacagtggctctgaagaaggtgcagatc 
tttgagatgatggacgccaaggcgaggcaggactgtgtcaaggagatcggcctcttgaag 
caactgaaccacccaaatatcatcaagtatttggactcgtttatcgaagacaacgagctg 
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Figure 1 (Confd) 



aacattgtgctggagttggctgacgcaggggacctctcgcagatgatcaagtactttaag 
aagcagaagcggctcatcccggagaggacagtatggaagtactttgtgcagctgtgcagc 
gccgtggagcacatgcattcacgccgggtgatgcaccgagacatcaagcctgccaacgtg 
ttcatcacagccacgggcgtcgtgaagctcggtgaccttggtctgggccgcttcttcagc 
tctgagaccaccgcagcccactccctagtggggacgccctactacatgtcaccggagagg 
atccatgagaacggctacaacttcaagtccgacatctggtccttgggctgtctgctgtac 
gagatggcagccctccagagccccttctatggagataagatgaatctcttctccctgtgc 
cagaagatcgagcagtgtgactaccccccactccccggggagcactactccgagaagtta 
cgagaactggtcagcatgtgcatctgccctgacccccaccagagacctgacatcggatac 
gtgcaccaggtggccaagcagatgcacatctggatgtccagcacctgagcgtggatgcac 
cgtgccttatcaaagccagcaccactttgccttacttgagtcgtcttctcttcgagtggc 
cacctggtagcctagaacagctaagaccacagggttcagcaggttccccaaaaggctgcc 
cagccttacagcagatgctgaaggcagagcagctgagggaggggcgctggccacatgtca 
ctgatggtcagattccaaagtcctttctttatactgttgtggacaatctcagctgggtca 
ataagggcaggtggttcagcgagccacggcagccccctgtatctggattgtaatgtgaat 
ctttagggtaattcctccagtgacctgtcaaggcttatgctaacaggagacttgcaggag 
accgtgtgatttgtgtagtgagcctttgaaaatggttagtaccgggttcagtttagttct 
tagtatcttttcaatcaagctgtgtgcttaatttactctgttgtaaagggataaagtgga 
aatcattttaaaaaaaaaaaaaaaaaattcggggcctctttttcgggggccgccacacgg 
ggtttggaagggccaggccctagcggttccggcccactttttggccaaggggatctcgcc 
ggcccctaaagggcgataattcgaagcgtcgaa 

SEQ ID NO:271. >T11445_T16 # TY Transcript # LN 1846 # Source Gene: T11445 
# Encoded protein: T11445_P13 

gggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgcg 
gccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgcac 
cggtcccccagcggcagccgagcccgcccgcgcgccgttcgtgccctcgtgaggctggca 
tgcaggatggcaggacagcccggccacatgccccatggagggagttccaacaacctctgc 
cacaccctggggcctgtgcatcctcctgacccacagaggcatcccaacacgctgtctttt 
cgctgctcgctggcggacttccagatcgaaaagaagataggccgaggacagttcagcgag 
gtgtacaaggccacctgcctgctggacaggaagacagtggctctgaagaaggtgcagatc 
tttgagatgatggacgccaaggcgaggcaggactgtgtcaaggagatcggcctcttgaag 
caactgaaccacccaaatatcatcaagtatttggactcgtttatcgaagacaacgagctg 
aacattgtgctggagttggctgacgcaggggacctctcgcagatgatcaagtactttaag 
aagcagaagcggctcatcccggagaggacagtatggaagtactttgtgcagctgtgcagc 
gccgtggagcacatgcattcacgccgggtgatgcaccgagacatcaagcctgccaacgtg 
ttcatcacagccacgggcgtcgtgaagctcggtgaccttggtctgggccgcttcttcagc 
tctgagaccaccgcagcccactccctagtggggacgccctactacatgtcaccggagagg 
atccatgagaacggctacaacttcaagtccgacatctggtccttgggctgtctgctgtac 
gagatggcagccctccagagccccttctatggagataagatgaatctcttctccctgtgc 
cagaagatcgagcagtgtgactaccccccactccccggggagcactactccgagaagtta 
cgagaactggtcagcatgtgcatctgccctgacccccaccagagacctgacatcggatac 
gtgcaccaggtggccaagcagatgcacatctggatgtccagcacctgagcgtggatgcac 
cgtgccttatcaaagccagcaccactttgccttacttgagtcgtcttctcttcgagtggc 
cacctggtagcctagaacagctaagaccacagggttcagcaggttccccaaaaggctgcc 
cagccttacagcagatgctgaaggcagagcagctgagggaggggcgctggccacatgtca 
ctgatggtcagattccaaagtcctttctttatactgttgtggacaatctcagctgggtca 
ataagggcaggtggttcagcgagccacggcagccccctgtatctggattgtaatgtgaat 
ctttagggtaattcctccagtgacctgtcaaggcttatgctaacaggagacttgcaggag 
accgtgtgatttgtgtagtgagcctttgaaaatggttagtaccgggttcagtttagttct 
tagtatcttttcaatcaagctgtgtgcttaatttactctgttgtaaagggataaagtgga 
aatcattttacaacgaaaacacaacaacagagactcaaggggcatcaacagcgcggggac 
gccgagacaaccagacgccacgggacgggcaacacggggcaaaacgcggcacccggaccc 
agcattcccgcggcccacaggggaaaaatgcgcaggagccccaaatagggaacccgcaca 
aactcgggaaagcacgggtacggaaggacaagacgcgacgacactn 
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Figur 1 (Cont'd) 



SEQ ID NO:272. >T11445_T17 # TY Transcript 8 LN 1794 # Source Gene: T11445 

# Encoded protein: T11445_P14 

gggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgcg 
gccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgcac 
cggtcccccagcggcagccgagcccgcccgcgcgccggagaagaagacacatgaattaga 
gacagcacgggggagcaggctgtggagctgggagtgacggggtgagtccaggaaggctgc 
ctggaggagatgcccaggagagaagtttgctgggaggcagctcatttccggcaggaggag 
cagagcctgccaaggcctcgagttcgtgccctcgtgaggctggcatgcaggatggcagga 
cagcccggccacatgccccatggagggagttccaacaacctctgccacaccctggggcct 
gtgcatcctcctgacccacagaggcatcccaacacgctgtcttttcgctgctcgctggcg 
gacttccagatcgaaaagaagataggccgaggacagttcagcgaggtgtacaaggccacc 
tgcctgctggacaggaagacagtggctctgaagaaggtgcagatctttgagatgatggac 
gccaaggcgaggcaggactgtgtcaaggagatcggcctcttgaagcaactgaaccaccca 
aatatcatcaagtatttggactcgtttatcgaagacaacgagctgaacattgtgctggag 
ttggctgacgcaggggacctctcgcagatgatcaagtactttaagaagcagaagcggctc 
atcccggagaggacagtatggaagtactttgtgcagctgtgcagcgccgtggagcacatg 
cattcacgccgggtgatgcaccgagacatcaagcctgccaacgtgttcatcacagccacg 
ggcgtcgtgaagctcggtgaccttggtctgggccgcttcttcagctctgagaccaccgca 
gcccactccctagtggggacgccctactacatgtcaccggagaggatccatgagaacggc 
tacaacttcaagtccgacatctggtccttgggctgtctgctgtacgagatggcagccctc 
cagagccccttctatggagataagatgaatctcttctccctgtgccagaagatcgagcag 
tgtgactaccccccactccccggggagcactactccgagaagttacgagaactggtcagc 
atgtgcatctgccctgacccccaccagagacctgacatcggatacgtgcaccaggtggcc 
aagcagatgcacatctggatgtccagcacctgagcgtggatgcaccgtgccttatcaaag 
ccagcaccactttgccttacttgagtcgtcttctcttcgagtggccacctggtagcctag 
aacagctaagaccacagggttcagcaggttccccaaaaggctgcccagccttacagcaga 
tgctgaaggcagagcagctgagggaggggcgctggccacatgtcactgatggtcagattc 
caaagtcctttctttatactgttgtggacaatctcagctgggtcaataagggcaggtggt 
tcagcgagccacggcagccccctgtatctggattgtaatgtgaatctttagggtaattcc 
tccagtgacctgtcaaggcttatgctaacaggagacttgcaggagaccgtgtgatttgtg 
tagtgagcctttgaaaatggttagtaccgggttcagtttagttcttagtatcttttcaat 
caagctgtgtgcttaatttactctgttgtaaagggataaagtggaaatcatttt 

SEQ ID NO:273. >T11445_T18 # TY Transcript t LN 2313 # Source Gene: T11445 

# Encoded protein: T11445_P7 

gggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgcg 
gccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgcac 
cggtcccccagcggcagccgagcccgcccgcgcgccgttcgtgccctcgtgaggctggca 
tgcaggatggcaggacagcccggccacatgccccatggagggagttccaacaacctctgc 
cacaccctggggcctgtgcatcctcctgacccacagaggcatcccaacacgctgtctttt 
cgctgctcgctggcggacttccagatcgaaaagaagataggccgaggacagttcagcgag 
gtgtacaaggccacctgcctgctggacaggaagacagtggctctgaagaaggtgcagatc 
tttgagatgatggacgccaaggcgaggcaggactgtgtcaaggagatcggcctcttgaag 
caactgaaccacccaaatatcatcaagtatttggactcgtttatcgaagacaacgagctg 
aacattgtgctggagttggctgacgcaggggacctctcgcagatgatcaagtactttaag 
aagcagaagcggctcatcccggagaggacagtatggaagtactttgtgcagctgtgcagc 
gccgtggagcacatgcattcacgccgggtgatgcaccgagacatcaagcctgccaacgtg 
ttcatcacagccacgggcgtcgtgaagctcggtgaccttggtctgggccgcttcttcagc 
tctgagaccaccgcagcccactccctagtggggacgccctactacatgtcaccggagagg 
atccatgagaacggctacaacttcaagtccgacatctggtccttgggctgtctgctgtac 
gagatggcagccctccagagccccttctatggagataagatgaatctcttctccctgtgc 
cagaagatcgagcagtgtgactaccccccactccccggggagcactactccgagaagggt 
tcaagttgcacaggaaggtgcttcggctaaacacggatagtcttccttgatctccactgt 
ctgctgtgtgggttccgccagcaaacctcccagcgagtggtcaccttcctatggagtcat 
taaacctctgaatgtcagctggtgggattagccagcagcttccacacctgagcccccgtg 
gcagggacagaggcatttggatcaagatcactaatgaatacagaattcatatgccgcttg 
tcagggctcctgagctcagagtcgcacagatgcaacctaaataagacggaggacagaaca 
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Figur 1 (Cont'd) 



atcgttgacaaagattaaaaagttttggcagttgggaaaaaaagagcacctcattattca 
aatatgatagtcttgcttatttttttttatttttgttcaagttattcatgattcacctca 
aatggacaaatgccattcaccaatatttaaagacttcttggtctcattactaaatctgaa 
gagggctgtgctgtctgtacatcagctatatggagaaaacagctctcctgtaaaacagct 
ctcttgtaaataatatttgaaaattttaagggaatataacaatcacaagctatttagggt 
ggagggcagtttaaaaatgaaaaaaaaaatctagaattaaaagaaaaaaatgaccttcca 
cagccaagtgattttattacatttactaacatctctatctctgacttccttaaacttttc 
tgttcctgggttccctccaatcacagagctgccaaagagctttacacaccttgtctgaca 
cttgcttggctttctgccgtcttcacaagacctcatcagagaaaagaaacagcccaagca 
gcccacacctctgctcacctggaataagatgtctttcccagtcttgttaaagaattacag 
gaataagaaagtgttaaagaattaaaggaataagatgtctttccctccctgttgtgaaag 
aatgacaaagaggcagatatagcccagtggctgaacccatcggagggctctaacgtcagg 
ctgaccagggttcagactctgagtcttccactgagtgtgtgacctctggccagttatttg 
acctccctacggcccaattttcttctctacaaaatggagacaacaaagtactgacctcaa 
ggaattgctgaaaagtaaatgaaataatataccactgtctatgaccaccacatatgcctg 
tctccaatattacgctttcagataagaaggcctgtatctatctatatcatgctataaatt 
caccaagcctgatgccagcacaaataatcacat 

SEQ ID NO:274. >T11445_T19 # TY Transcript # LN 1079 # Source Gene: T11445 

# Encoded protein: T11445_P8 

gggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgcg 
gccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgcac 
cggtcccccagcggcagccgagcccgcccgcgcgccgttcgtgccctcgtgaggctggca 
tgcaggatggcaggacagcccggccacatgccccatggagggagttccaacaacctctgc 
cacaccctggggcctgtgcatcctcctgacccacagaggcatcccaacacgctgtctttt 
cgctgctcgctggcggacttccagatcgaaaagaagataggccgaggacagttcagcgag 
gtgtacaaggccacctgcctgctggacaggaagacagtggctctgaagaaggtgcagatc 
tttgagatgatggacgccaaggcgaggcaggactgtgtcaaggagatcggcctcttgaag 
caactgaaccacccaaatatcatcaagtatttggactcgtttatcgaagacaacgagctg 
aacattgtgctggagttggctgacgcaggggacctctcgcagatgatcaagtactttaag 
aagcagaagcggctcatcccggagaggacagtatggaagtactttgtgcagctgtgcagc 
gccgtggagcacatgcattcacgccgggtgatgcaccgagacatcaagcctgccaacgtg 
ttcatcacagccacgggcgtcgtgaagctcggtgaccttggtctgggccgcttcttcagc 
tctgagaccaccgcagcccactccctagtggggacgccctactacatgtcaccggagagg 
atccatgagaacggctacaacttcaagtccgacatctggtccttgggctgtctgctgtac 
gaggcttggcyggstgrwaccaarratggcaasscccycccaraarcccccatccagatg 
gaaaaaaaagagaaggatccggttttcaccccctgggaccagaaaaaagaacctatagcc 
aatcacgaatttgagtaaggcaaacctataggccagcggggaagaaagatatagattac 

SEQ ID NO:275. >T11445_T20 # TY Transcript # LN 5217 # Source Gene: T11445 

# Encoded protein: T114 45_P9 

ggccgcgcggggctgagcgtgtcggcggccgggcgggcgggggcggcgtgtccgcgtgtc 
ccggggtgtgcgcgtccccggcgggtgtgcgtgtgcaccccggagcgtctggtggcgttg 
ttggggtgagtgcgtgcggggacgcgggcatggctccccgctacccacttcccgagtggg 
gacgacgggcggaggagtgtgccgggcgccgcagggagtacgtgcgggtctcagggagtg 
tgtgtgtgcacgtgtgtgcattccaaagtgatttgtgtgcaagcacatgcgtgagcctag 
aagggcgaaagtrtgcctgtgtgggtgtgagtataacgagtgtgcagggccgtgcgagcg 
tgtt tat tact gaggttgaggggccgccccctttgcgcccaccaggcctttgcacagtga 
tggccccgagccagggagccgtggagagagaagcggcctggcatgggctgggggtcgggg 
ctgggcctgatccggtgaaccaggggccagragggcagactcagagcctggctgggcctg 
acagaggcaaaggtcttagagccaggctgcggcagggccactggcaggaccctctctgca 
tacacaggctgggcgagcggaggctgtggtgggcccctcggccgggcacccgaggacgtg 
tgttttcctaaatatcctccaggctctcctgtctttgttcttgtctctctccagccaggc 
ctggggggctgaccctagacttgcctgtgtatgtgggggcctctctgcctgccctgctaa 
gtagacggatcttttgatttatttatacctctcctaggctccaagtgtgaatcccaggct 
ggagtacagtggcgcaatctcagctycctgcaccctctgcctcctgggttcaagtgagcc 
tcctgcctcagcctcctcctgagtagctgggattataggcaggtgccaccacgcccagat 
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Figure 1 (Cont'd) 



aatttttgtatttttagtagagatggggtttcactgtgttggccagccaggctggtctcg 
aactcctgacctcaggtgatccgcccgccttggcctccaaagtgctggaattacaggcat 
gagccaccatgcccggccatgtccccatttactgaggatatactgtgtgctaggtctggt 
gctgtattttacccagtccactcaccaacactaagatggaagctgtcacgatgcccattt 
acagatgaggaaactggggcacagagagatgagagaatgcgccaaagaccatgtgctaat 
acgcagcagagctggatttgaacccaggcttgtctgagcagrgcccatggtctccccact 
gtactaggaaggtagcaggcactgtgggctggatgcctggcctgtgctgaggttcagtgt 
crttttcctgatcttacagagagggaggttaagtaacttgggcagggtcataaacccaca 
aatggcagagtcgggattcgaacccatgtctctgattccagagttcctcaccttaaacca 
catctggctgcctttgtcgtgctttgctcctgttgcatgctgcaaagccaagtaggttaa 
gacaggctaggttcctgcagggctgggttgcccgagacctgggagaggtcttccagaagg 
cactgctggccttcctcgcccagggccagagctactggagagaaagggctgctgttcaca 
ttttttcttttctttaagtgccccagatacttttcataagggaagtgtcaaaacaattgt 
caaaaggagaagaatatttccttcctcttctcaggctagaccaagcaagggagttggagg 
gtggagagtggagggacctggtgaatcatccaagattaaatacgggcaggcactgccagg 
cttgtgtggagatgaggctgggacactgttcgcaggaagagtccggtgttctggccccct 
gatgtcaccttcacgggcctgactcacagtcctaaatatctgacagcgagatcgcttgta 
ggtaggcttcttgtcacttgtcatttaagtcccctgcctgcctttttttccctgctcatt 
tccatttcgatcacggcttctactacatgtgcttgaaatgaatgttgtgtgtgaggacct 
gtgtggtttgggccatgttgtgcccccctgccagtgtccttcggtgttgaaaggaaaagg 
tatttctctcccttccatgtatgttctgcaaattgacgaacatttgcaaattgctttgag 
atgcgttaatttaaggtgtgaaataatagatcttgtgcgaaggctcagccacatttctgg 
cattgatggatttttctcccgtctggcctgtgatttggggaggtaaaaatagtccttttt 
tccacatttttatctttttttttttttgctgacatcattcttaggaagtggtgtgcacat 
gccgtggaactttacatggaacatggagctctggggagcaggagtgctcttggagtgagg 
ggtggtctgcggtggcccaggcaccccctggctgtccgtgaaatggctttgaaatgagca 
ggcactgttgagggcctcagtggctttttggtatttggctgtcttttcacttatgactct 
tgggtaatggcaagataacttttggggaaaatgttagtttaaaaagaaaaaaaatcaggc 
cagcactttcactcattcctgtctgagctggtcccttcctttggccaaagctgaactaaa 
ccaatggccttctcagactcgacttccagacagaggagccgacattggggcaaatcagtt 
ggtggtgtaatttcctgtgatgttaactttgggaacttgaacttggagtgggagctagca 
ttttgacatgtgtccctctaacaggacttgcttgtgtggaaacagacccccacatacatg 
gctgcgccattgtctccaattctggatttttccacagatgacttctcgggtgatctggtg 
gtaaaggcacaggtcttctagccaggccttcttggcctccagcaagggcaggatcacagc 
accccaccagacctgactggccaccaccgctgtagctgtggctgggcgagtgggggggcc 
atgggaattgaaagatggggtggccgagatctgaaatgcattcagttgcagtggcctgga 
tctgcacggagccgttctttctttctcttagacatctgcrtggtggtgacctcatagaat 
tctaagaaaaaaggggctttatgtaggaattccatagagtgtgaaggaatttggacaggg 
gaagggggagggaagttgccattcagagcctgcagtgcctgcattttccccgaattgttt 
aaccctcatgcttcagaattaggctgaggcttgcggggtgggtcatgttgacctgggtga 
acagagatccctttaagaagaacttctccatgttccagaggcgcgttcttactgcaggtg 
agtggcagtatgggaattagtccacaggccccttctcgaatgcctgccctctcttgttcc 
ttgtcctcaacgtctttgaaacttgggcttgttgggaagacacctgcaaaaggatggatg 
cacatgaccttcagctctaatgaatcaagctgctgatgaggaattcactgggctcccaat 
ccagagagcttcgcacacacccactggggttgagacgagcactgggtttatttattgtgg 
acttttggagtctgaaggactcttgccacccatctgtttcacggaacagaacctgaggct 
cagagggagcaaggcgctcccccacagccgcattgagagcctgagcctgggcacccgatg 
gagtgaatgaggctggagtcccagacctgcctctcatgagcacgtcgcccactgagcctc 
agcgtcatcatctggaaaacagggataatattatgacctcagaagcttcgggggaggaag 
taagtgaartaatgcatttcagatgctcagcgccttttaagtgcttgatgctcattcccc 
aagattacatgagagacatggaaaatctttaatgaccaaggacccacccagggtcactca 
gccgggacccttggtccgtggcccagagtgtctccagtgccccwgcattgaggccctaaa 
caaggccagaagcaggtgccggggacccctctggattccaccagagcaccttcctaggat 
catggctcccaaaacggaagggaaggagacagcgcagtttgcaaagaggcaggatttaag 
caccagggtggccctgtggcgcctcaggaaaatgtttgcctgtcagtatctgctctcgtt 
cccacctgtccccacaaagcgaggccataagtcctcggcgtggcattggagggtctctga 
aggccctgagagctgtgtcagccacggtgtgttatgaagcaaggcagatgttttgttaat 
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Figur 1 (Cont'd) 



watttacacagcgtcggccccgcagaggactgcgctgacaggagcggctgtcacaggcct 
ggccgtggggcagaagtgagcagccgtcttcccctggcagtcctcctgaaaaggtctgca 
tggcaaggcctgagggagtcctgcacattttatgcccccgccccccaaagccatttgggt 
ttcccttaaactggcttgttttcctgagccggtggagagatccttgtcctccggaagtgg 
ctatcgctctggggcggcttctctgccagctcgtcacaccctagacccagctgtagtctg 
tgtggtgggagagggtgtcaccaggctctggaggtccactcctctgtagtcacctcatgc 
aaggagggcttcacaggggcccagcctctactccctcatccggaaaacgggccagtaaca 
ccaggcaccagccccgtgatcctcaggcacccttgggggtgatctgccttagaaattcaa 
ctttaggattagaattctgctaagaggtaccatgtgacaaaaaaggtagtgtaaaaatca 
caaagaccaggacaggctcatgcctataatcccagcactttgagaggccgaggcgggcag 
atcacttgaggtcaggagtttgagaccagcccagccaacatggcaaaaccccatcgctac 
taaaaatacgaaaaattagctgggcgtggtggtggacacctttaatcccagctactcggg 
agcctgaggagcctgaggcgtgagaatcacttgaacctgggagacagaggctgcagtgag 
ctgagattgtgccactgcactccagcctaggcaacagagcgagactctgtatcaaaa 

SEQ ID NO:276. >T11445_T21 # TY Transcript # LN 1697 # Source Gene: T11445 
tf Encoded protein: T11445_P10 

ganggggggtgggggcaagtgttttagccgtttggagcctttatttcctcttttggacat 
gganttttattgaggacctgagttttttaccttcttgttggcggtacatacanattggga 
aagannngtggcaagggagcgaagatnatataaggttgagtagccaaaatgnggatgcta 
ttgcaccttgcattatacggaangncagggacctcngaaggcgagaatnaagtttttttt 
tttttttntggaggatttttganntgtgggccttatgtttttttcccattctgnaggttc 
ggggaagncgatntgttnanggggtactgtagaatattctaccttggnagtttatttttg 
gattttgtgggggggattttaagggatccgtttaaggggctgagcgggctgtatggactt 
catgacagtctggccctccagacaagagcagcgctggcatcgggcaggtgattcctgaca 
cctgctgcctgcaggcattcactgaccaggcctttcctggaggaaacacccagggccggg 
cggctgctgtttccacacgtggactcggatctgctgtgacaccgtcagcccgacagtctc 
tccatatgcagcctttcctctgtacttttctccatggttgaaataaaacagggtgactgg 
gagttacttagaattcatgaagattttaaaatggctttggagattttgcttttaaaccag 
tagattcaaaacttaaacagcgtctgcagcacaatttcttgaggaaccttgaaaaacaca 
acttcccaggccccattcagtaatcccaggatttctttaagctccccaaataattttgaa 
actcatcagccgagtttctgccctcatgaggtaattccatcgttctccccagcctgcccc 
tggcagctgtaacacaggagctggcctgagagcagattcaccctggaatgttctctccac 
agaacaatcaagtccctgtcgcctgcctagtgcttaccactgaagatttttctgattcca 
gaccaactttttgccaacattctgcttccagctctctgagcccctgccgtgtctccccaa 
cactgccagccccagcacgcaatcaacctactttgtgcatgccacccgctttccacactg 
tgagaacaatctgcccaactggaccctctggaggcgcacgatctcagccactcaccaggc 
ctgagtgtttgtgaaatgatcatgtcctacttattacaaaaccgtaaccccaaaacattc 
cttttatttctgtgaaaccggccaaagtgaggtccacccaccttcacacagctctggcgg 
tgcacctgctcaccttctcttggttctcagaactgagctgggcttgagaacacagctttg 
gctttgccatttttttcctacttggctgctgaggtggagggtgtgctgcacttatcaccc 
catttcaaaaccaaaccaaacctgaggccaccccaaacaaattcagccagcaaaaagggt 
aggtatcgatggtcacctgaagcctcaagggagtccactctgacttctgacagcagacag 
aacctatctgtcctcaggaggggcgcatggctaacctacgctctcctctgagcctaggcc 
agtgctcaaggaggatcagcaccctgaagggccaggccaatggccttagcagccccctac 
tgcccccaaaaacacca 

SEQ ID NO: 277. >T11445JT22 # TY Transcript # LN 3647 # Source Gene: T11445 
# Encoded protein: T114 45_P9 

ggccgcgcggggctgagcgtgtcggcggccgggcgggcgggggcggcgtgtccgcgtgtc 
ccggggtgtgcgcgtccccggcgggtgtgcgtgtgcaccccggagcgtctggtggcgttg 
ttggggtgagtgcgtgcggggacgcgggcatggctccccgctacccacttcccgagtggg 
gacgacgggcggaggagtgtgccgggcgccgcagggagtacgtgcgggtctcagggagtg 
tgtgtgtgcacgtgtgtgcattccaaagtgatttgtgtgcaagcacatgcgtgagcctag 
aagggcgaaagtrtgcctgtgtgggtgtgagtataacgagtgtgcagggccgtgcgagcg 
tgtt tat tact gaggttgaggggccgccccctttgcgcccaccaggcctttgcacagtga 
tggccccgagccagggagccgtggagagagaagcggcctggcatgggctgggggtcgggg 
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ctgggcctgatccggtgaaccaggggccagragggcagactcagagcctggctgggcctg 
acagaggcaaaggtcttagagccaggctgcggcagggccactggcaggaccctctctgca 
tacacaggctgggcgagcggaggctgtggtgggcccctcggccgggcacccgaggacgtg 
tgttttcctaaatatcctccaggctctcctgtctttgttcttgtctctctccagccaggc 
ctggggggctgaccctagacttgcctgtgtatgtgggggcctctctgcctgccctgctaa 
gtagacggatcttttgatttatttatacctctcctaggctccaagtgtgaatcccaggct 
ggagtacagtggcgcaatctcagctycctgcaccctctgcctcctgggttcaagtgagcc 
tcctgcctcagcctcctcctgagtagctgggattataggcaggtgccaccacgcccagat 
aatttttgtatttttagtagagatggggtttcactgtgttggccagccaggctggtctcg 
aactcctgacctcaggtgatccgcccgccttggcctccaaagtgctggaattacaggcat 
gagccaccatgcccggccatgtccccatttactgaggatatactgtgtgctaggtctggt 
gctgtattttacccagtccactcaccaacactaagatggaagctgtcacgatgcccattt 
acagatgaggaaactggggcacagagagatgagagaatgcgccaaagaccatgtgctaat 
acgcagcagagctggatttgaacccaggcttgtctgagcagrgcccatggtctccccact 
gtactaggaaggtagcaggcactgtgggctggatgcctggcctgtgctgaggttcagtgt 
crttttcctgatcttacagagagggaggttaagtaacttgggcagggtcataaacccaca 
aatggcagagtcgggattcgaacccatgtctctgattccagagttcctcaccttaaacca 
catctggctgcctttgtcgtgctttgctcctgttgcatgctgcaaagccaagtaggttaa 
gacaggctaggttcctgcagggctgggttgcccgagacctgggagaggtcttccagaagg 
cactgctggccttcctcgcccagggccagagctactggagagaaagggctgctgttcaca 
ttttttcttttctttaagtgccccagatacttttcataagggaagtgtcaaaacaattgt 
caaaaggagaagaatatttccttcctcttctcaggctagaccaagcaagggagttggagg 
gtggagagtggagggacctggtgaatcatccaagattaaatacgggcaggcactgccagg 
cttgtgtggagatgaggctgggacactgttcgcaggaagagtccggtgttctggccccct 
gatgtcaccttcacgggcctgactcacagtcctaaatatctgacagcgagatcgcttgta 
ggtaggcttcttgtcacttgtcatttaagtcccctgcctgcctttttttccctgctcatt 
tccatttcgatcacggcttctactacatgtgcttgaaatgaatgttgtgtgtgaggacct 
gtgtggtttgggccatgttgtgcccccctgccagtgtccttcggtgttgaaaggaaaagg 
tatttctctcccttccatgtatgttctgcaaattgacgaacatttgcaaattgctttgag 
atgcgttaatttaaggtgtgaaataatagatcttgtgcgaaggctcagccacatttctgg 
cattgatggatttttctcccgtctggcctgtgatttggggaggtaaaaatagtccttttt 
tccacatttttatctttttttttttttgctgacatcattcttaggaagtggtgtgcacat 
gccgtggaactttacatggaacatggagctctggggagcaggagtgctcttggagtgagg 
ggtggtctgcggtggcccaggcaccccctggctgtccgtgaaatggctttgaaatgagca 
ggcactgttgagggcctcagtggctttttggtatttggctgtcttttcacttatgactct 
tgggtaatggcaagataacttttggggaaaatgttagtttaaaaagaaaaaaaatcaggc 
cagcactttcactcattcctgtctgagctggtcccttcctttggccaaagctgaactaaa 
ccaatggccttctcagactcgacttccagacagaggagccgacattggggcaaatcagtt 
ggtggtgtaatttcctgtgatgttaactttgggaacttgaacttggagtgggagctagca 
ttttgacatgtgtccctctaacaggacttgcttgtgtggaaacagacccccacatacatg 
gctgcgccattgtctccaattctggatttttccacagatgacttctcgggtgatctggtg 
gtaaaggcacaggtcttctagccaggccttcttggcctccagcaagggcaggatcacagc 
accccaccagacctgactggccaccaccgctgtagctgtggctgggcgagtgggggggcc 
atgggaattgaaagatggggtggccgagatctgaaatgcattcagttgcagtggcctgga 
tctgcacggagccgttctttctttctcttagacatctgcrtggtggtgacctcatagaat 
tctaagaaaaaaggggctttatgtaggaattccatagagtgtgaaggaatttggacaggg 
gaagggggagggaagttgccattcagagcctgcagtgcctgcattttccccgaattgttt 
aaccctcatgcttcagaattaggctgaggcttgcggggtgggtcatgttgacctgggtga 
acagagatccctttaagaagaacttctccatgttccagaggcgcgttcttactgcagatg 
gcgacactgaggcccagagagacatcatgaccagcctcaggtcccacatgcagcctgctg 
ctctgtagcctaaatcagatcactgtcccagctttcaggtcaaggctctggttgtgacac 
ctgggcctggactgcgggtgaaaagttttgatgcaaaaacacagaccgtccattgtccta 
tgggctaagtggactcccttctgctaaccagagatgcttgtgctggc 

SEQ ID NO:278. >T11445_T23 # TY Transcript f LN 1082 # Source Gene: T11445 
# Encoded protein: T11445_P11 

ggccgcgcggggctgagcgtgtcggcggccgggcgggcgggggcggcgtgtccgcgtgtc 



159/365 



Figure 1 (Cont'd) 



ccggggtgtgcgcgtccccggcgggtgtgcgtgtgcaccccggagcgtctggtggcgttg 
ttgggttcgtgccctcgtgaggctggcatgcaggatggcaggacagcccggccacatgcc 
ccatggagggagttccaacaacctctgccacaccctggggcctgtgcatcctcctgaccc 
acagaggcatcccaacacgctgtcttttcgctgctcgctggcggacttccagatcgaaaa 
gaagataggccgaggacagttcagcgaggtgtacaaggccacctgcctgctggacaggaa 
gacagtggctctgaagaaggtgcagatctttgagatgatggacgccaaggcgaggcagga 
ctgtgtcaaggagatcggcctcttgaagcaactgaaccacccaaatatcatcaagtattt 
ggactcgtttatcgaagacaacgagctgaacattgtgctggagttggctgacgcagggga 
cctctcgcagatgatcaagtactttaagaagcagaagcggctcatcccggagaggacagt 
atggaagtactttgtgcagctgtgcagcgccgtggagcacatgcattcacgccgggtgat 
gcaccgagacatcaagcctgccaacgtgttcatcacagccacgggcgtcgtgaagctcgg 
tgaccttggtctgggccgcttcttcagctctgagaccaccgcagcccactccctaggggg 
acgccctactacatgtcaccggaaaggaaccttgggaaacggcttaacacttccaggccg 
aaatctggctccttggggctggcctgttgaacgaaaatgggaaacctcccagaaaaccct 
tcttatggagaaaagaataagatctctttctcccccgcgcgcaaaaagatccaaacagcg 
gggagccacccgccccctccccagaggaagaccctgagtataaaagacttacccagaaac 
tggttaccaacgggaactttctcccctcggcacccccgcaccaggagggagcccttggaa 
ga 

SEQ ID NO:279. >T11445_T24 # TY Transcript # LN 2811 # Source Gene: T11445 
# Encoded protein: T11445_P12 

ggccgcgcggggctgagcgtgtcggcggccgggcgggcgggggcggcgtgtccgcgtgtc 
ccggggtgtgcgcgtccccggcgggtgtgcgtgtgcaccccggagcgtctggtggcgttg 
ttggggtgagtgcgtgcggggacgcgggcatggctccccgctacccacttcccgagtggg 
gacgacgggcggaggagtgtgccgggcgccgcagggagtacgtgcgggtctcagggagtg 
tgtgtgtgcacgtgtgtgcattccaaagtgatttgtgtgcaagcacatgcgtgagcctag 
aagggcgaaagtrtgcctgtgtgggtgtgagtataacgagtgtgcagggccgtgcgagcg 
tgtttattactgaggttgaggggccgccccctttgcgcccaccaggcctttgcacagtga 
tggccccgagccagggagccgtggagagagaagcggcctggcatgggctgggggtcgggg 
ctgggcctgatccggtgaaccaggggccagragggcagactcagagcctggctgggcctg 
acagaggcaaaggtcttagagccaggctgcggcagggccactggcaggaccctctctgca 
tacacaggctgggcgagcggaggctgtggtgggcccctcggccgggcacccgaggacgtg 
tgttttcctaaatatcctccaggctctcctgtctttgttcttgtctctctccagccaggc 
ctggggggctgaccctagacttgcctgtgtatgtgggggcctctctgcctgccctgctaa 
gtagacggatcttttgatttatttatacctctcctaggctccaagtgtgaatcccaggct 
ggagtacagtggcgcaatctcagctycctgcaccctctgcctcctgggttcaagtgagcc 
tcctgcctcagcctcctcctgagtagctgggattataggcaggtgccaccacgcccagat 
aatttttgtatttttagtagagatggggtttcactgtgttggccagccaggctggtctcg 
aactcctgacctcaggtgatccgcccgccttggcctccaaagtgctggaattacaggcat 
gagccaccatgcccggccatgtccccatttactgaggatatactgtgtgctaggtctggt 
gctgtattttacccagtccactcaccaacactaagatggaagctgtcacgatgcccattt 
acagatgaggaaactggggcacagagagatgagagaatgcgccaaagaccatgtgctaat 
acgcagcagagctggatttgaacccaggcttgtctgagcagrgcccatggtctccccact 
gtactaggaaggtagcaggcactgtgggctggatgcctggcctgtgctgaggttcagtgt 
crttttcctgatcttacagagagggaggttaagtaacttgggcagggtcataaacccaca 
aatggcagagtcgggattcgaacccatgtctctgattccagagttcctcaccttaaacca 
catctggctgcctttgtcgtgctttgctcctgttgcatgctgcaaagccaagtaggttaa 
gacaggctaggttcctgcagggctgggttgcccgagacctgggagaggtcttccagaagg 
cactgctggccttcctcgcccagggccagagctactggagagaaagggctgctgttcaca 
ttttttcttttctttaagtgccccagatacttttcataagggaagtgtcaaaacaattgt 
caaaaggagaagaatatttccttcctcttctcaggctagaccaagcaagggagttggagg 
gtggagagtggagggacctggtgaatcatccaagattaaatacgggcaggcactgccagg 
cttgtgtggagatgaggctgggacactgttcgcaggaagagtccggtgttctggccccct 
gatgtcaccttcacgggcctgactcacagtcctaaatatctgacagcgagatcgcttgta 
gttcgtgccctcgtgaggctggcatgcaggatggcaggacagcccggccacatgccccat 
ggagggagttccaacaacctctgccacaccctggggcctgtgcatcctcctgacccacag 
aggcatcccaacacgctgtcttttcgctgctcgctggcggacttccagatcgaaaagaag 
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ataggccgaggacagttcagcgaggtgtacaaggccacctgcctgctggacaggaagaca 
gtggctctgaagaaggtgcagatctttgagatgatggacgccaaggcgaggcaggactgt 
gtcaaggagatcggcctcttgaagcaactgaaccacccaaatatcatcaagtatttggac 
tcgtttatcgaagacaacgagctgaacattgtgctggagttggctgacgcaggggacctc 
tcgcagatgatcaagtactttaagaagcagaagcggctcatcccggagaggacagtatgg 
aagtactttgtgcagctgtgcagcgccgtggagcacatgcattcacgccgggtgatgcac 
cgagacatcaagcctgccaacgtgttcatcacagccacgggcgtcgtgaagctcggtgac 
cttggtctgggccgcttcttcagctctgagaccaccgcagcccactccctagggggacgc 
cctactacatgtcaccggagaggatcccatgaaaacacggctaccaattttccatatccg 
gaccccctgcgctccttcggcctgctcctgccttcatacagaatggcccgcaacctcccc 
ggaccccgccctttcctcccggacgaacccgattgcacaccttcctttccc 

SEQ ID NO:280. >T11445_T25 # TY Transcript # LN 1401 # Source Gene: T11445 

# Encoded protein: 

catcaaaacagataagaaaatcattgtttataatagttatcgaaggtcgaggtttttttg 
gtttaattttgcatttacttaaaactctgtgtttgatgccaacagggttcaagttgcaca 
ggaaggtgcttcggctaaacacggatagtcttccttgatctccactgtctgctgtgtggg 
ttccgccagcaaacctcccagcgagtggtcaccttcctatggagtcattaaacctctgaa 
tgtcagctggtgggattagccagcagcttccacacctgagcccccgtggcagggacagag 
gcatttggatcaagatcactaatgaatacagaattcatatgccgcttgtcagggctcctg 
agctcagagtcgcacagatgcaacctaaataagacggaggacagaacaatcgttgacaaa 
gattaaaaagttttggcagttgggaaaaaaagagcacctcattattcaaatatgatagtc 
ttgcttatttttttttatttttgttcaagttattcatgattcacctcaaatggacaaatg 
ccattcaccaatatttaiaagacttcttggtctcattactaaatctgaagagggctgtgct 
gtctgtacatcagctatatggagaaaacagctctcctgtaaaacagctctcttgtaaata 
atatttgaaaattttaagggaatataacaatcacaagctatttagggtggagggcagttt 
aaaaatgaaaaaaaaaatctagaattaaaagaaaaaaatgaccttccacagccaagtgat 
tttattacatttactaacatctctatctctgacttccttaaacttttctgttcctgggtt 
ccctccaatcacagagctgccaaagagctttacacaccttgtctgacacttgcttggctt 
tctgccgtcttcacaagacctcatcagagaaaagaaacagcccaagcagcccacacctct 
gctcacctggaataagatgtctttcccagtcttgttaaagaattacaggaataagaaagt 
gttaaagaattaaaggaataagatgtctttccctccctgttgtgaaagaatgacaaagag 
gcagatatagcccagtggctgaacccatcggagggctctaacgtcaggctgaccagggtt 
cagactctgagtcttccactgagtgtgtgacctctggccagttatttgacctccctacgg 
cccaattttcttctctacaaaatggagacaacaaagtactgacctcaaggaattgctgaa 
aagtaaatgaaataatataccactgtctatgaccaccacatatgcctgtctccaatatta 
cgctttcagataagaaggcctgtatctatctatatcatgctataaattcaccaagcctga 
tgccagcacaaataatcacat 

SEQ ID NO: 281. >T11445_T26 # TY Transcript # LN 437 # Source Gene: T11445 

# Encoded protein: 

catcaaaacagataagaaaatcattgtttataatagttatcgaaggtcgaggtttttttg 
gtttaattttgcatttacttaaaactctgtgtttgatgccaacagggttcaagttgcaca 
ggaaggtgcttcggctaaacacggatagtcttccttgatctccactgtctgctgtgtggg 
ttccgccagcaaacctcccagcgagtggtcaccttcctatggagtcattaaacctctgaa 
tgtcagctggtgggattagccagcagcttccacacctgagcccccgtggcagggacagag 
gcatttggatcaagatcactaatgaatacagaattcatatgccgcttgtcagggctcctg 
agctcagagtcgcacagatgcaacctccatgtctggcctaaacttatcttcaaatacagc 
cattgctgccaaggagc 

SEQ ID NO: 282. >T11445_T27 # TY Transcript # LN 220 ft Source Gene: T11445 

# Encoded protein: 

gggcgcgcgggcccgcgcaggcggtggcggcggcggcggaaccgagctgacgggcgtgcg 
gccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcgggccagcgcac 
cggtcccccagcggcagccgagcccgcccgcgcgccggtgagtcgcctggggctggggcc 
gggccggtggggccccgcggcccggagaagggcgggggcc 
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SEQ ID NO:283. >T11445_P13 # TY Protein # CC #LN 338 # Source Gene: 

T11445 # Encoding Transcript: 1 

MLHRSPSGTRARPRAVRALVRLACRMAGQPGHMPHGGSSNNLCHTLGPVHPPDPQRHPNT 
LSFRCSLADFQIEKKIGRGQFSEVYKATCLLDRKTVALKKVQIFEMMDAKARQDCVKEIG 
LLKQLNHPNIIKYLDSFIEDNELNIVLELADAGDLSQMIKYFKKQKRLIPERTVWKYFVQ 
LCSAVEHMHSRRVMHRDIKPANVFITATGWKLGDLGLGRFFSSETTAAHSLVGTPYYMS 
PERIHENGYNFKSDIWSLGCLLYEMAALQSPFYGDKMNLFSLCQKIEQCDYPPLPGEHYS 
EKLRELVSMCICPDPHQRPDIGYVHQVAKQMHIWMSST 

SEQ ID NO: 284. >T11445_P1 # TY Protein # CC #LN 313 # Source Gene: T11445 

# Encoding Transcript: 5 

MAGQPGHMPHGGSSNNLCHTLGPVHPPDPQRHPNTLSFRCSLADFQIEKKIGRGQFSEVY 
KATCLLDRKTVALKKVQIFEMMDAKARQDCVKEIGLLKQLNHPNIIKYLDSFIEDNELNI 
VLELADAGDLSQMIKYFKKQKRLIPERTVWKYFVQLCSAVEHMHSRRVMHRDIKPANVFI 
TATGWKLGDLGLGRFFSSETTAAHSLVGTPYYMS PERIHENGYNFKSDIWSLGCLLYEM 
AALQSPFYGDKMNLFSLCQKIEQCDYPPLPGEHYSEKLRELVSMCICPDPHQRPDIGYVH 
QVAKQMHIWMSST 

SEQ ID NO:285. >T11445_P14 # TY Protein # CC #LN 306 # Source Gene: 

T114 4 5 # Encoding Transcript: 6 

MPHGGSSNNLCHTLGPVHPPDPQRHPNTLSFRCSLADFQIEKKIGRGQFSEVYKATCLLD 
RKTVALKKVQIFEMMDAKARQDCVKEIGLLKQLNHPNIIKYLDSFIEDNELNIVLELADA 
GDLSQMI KYFKKQKRL I PERTVWKYFVQLCSAVEHMHSRRVMHRDI KPANVFI TATGVVK 
LGDLGLGRFFSSETTAAHSLVGTPYYMSPERIHENGYNFKSDIWSLGCLLYEMAALQSPF 
YGDKMNLFSLCQKIEQCDYPPLPGEHYSEKLRELVSMCICPDPHQRPDIGYVHQVAKQMH 
IWMSST 

SEQ ID NO:286. >T11445_P3 # TY Protein # CC #LN 348 # Source Gene: T11445 

# Encoding Transcript: 8 

MVPVRNSADRASAPVPCPSRARPRAVRALVRLACRMAGQPGHMPHGGSSNNLCHTLGPVH 
PPDPQRHPNTLSFRCSLADFQIEKKIGRGQFSEVYKATCLLDRKTVALKKVQIFEMMDAK 
ARQDCVKEIGLLKQLNHPNIIKYLDSFIEDNELNIVLELADAGDLSQMIKYFKKQKRLIP 
ERTVWKYFVQLCSAVEHMHSRRVMHRDIKPANVFITATGVVKLGDLGLGRFFSSETTAAH 
SLVGTPYYMSPERIHENGYNFKSDIWSLGCLLYEMAALQSPFYGDKMNLFSLCQKIEQCD 
YPPLPGEHYSEKLRELVSMCICPDPHQRPDIGYVHQVAKQMHIWMSST 

SEQ ID NO:287. >T11445_P5 # TY Protein # CC #LN 245 # Source Gene: T11445 

# Encoding Transcript: 12 

MAGQPGHMPHGGSSNNLCHTLGPVHPPDPQQLNHPNIIKYLDSFIEDNELNIVLELADAG 
DLSQMI KY FKKQKRLI PERT VWKY FVQLCS AVE HMHS EUIVMH RDI KPANVFI T ATG VVKL 
GDLGLGRFFSSETTAAHSLVGTPYYMSPERIHENGYNFKSDIWSLGCLLYEMAALQSPFY 
GDKMNLFSLCQKIEQCDYPPLPGEHYSEKLRELVSMCICPDPHQRPDIGYVHQVAKQMHI 
WMSST 

SEQ ID NO:288. >T11445_P6 # TY Protein # CC #LN 276 # Source Gene: T11445 

# Encoding Transcript: 13 

MAGQPGHMPHGGSSNNLCHTLGPVHPPDPQRHPNTLSFRCSLADFQIEKKIGRGQFSEVY 
KATCLLDRKTVALKKVQI FEMMDAKARQDCVKEIGLLKY FKKQKRLI PERTVWKYFVQLC 
SAVEHMHSRRVMHRDIKPANVFITATGWKLGDLGLGRFFSSETTAAHSLVGTPYYMSPE 
RIHENGYNFKSDIWSLGCLLYEMAALQSPFYGDKMNLFSLCQKIEQCDYPPLPGEHYSEK 
LRELVSMCICPDPHQRPDIGYVHQVAKQMHIWMSST 

SEQ ID NO:289. >T11445_P7 # TY Protein # CC #LN 287 # Source Gene: T11445 

# Encoding Transcript: 18 

MAGQPGHMPHGGSSNNLCHTLGPVHPPDPQRHPNTLSFRCSLADFQIEKKIGRGQFSEVY 
KATCLLDRKTVALKKVQI FEMMDAKARQDCVKEIGLLKQLNHPN 1 1 KYLDSFIEDNELN I 
VLELADAGDLSQMIKY FKKQKRLI PERTVWKYFVQLCSAVEHMHSRRVMHRDI KPANVFI 
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TATGWKLGDLGLGRFFSSETTAAHSLVGTPYYMSPERIHENGYNFKSDIWSLGCLLYEM 
AALQSPFYGDKMNLFSLCQKIEQCDYPPLPGEHYSEKGSSCTGRCFG 

SEQ ID NO: 290 . >T11445_P8 # TY Protein # CC #LN 277 # Source Gene: T11445 

# Encoding Transcript: 19 

MAGQPGHMPHGGSSNNLCHTLGPVHPPDPQRHPNTLSFRCSLADFQIEKKIGRGQFSEVY 
KATCLLDRKTVALKKVQIFEMMDAKARQDCVKEIGLLKQLNHPNIIKYLDSFIEDNELNI 
VLELADAGDLSQMI KYFKKQKRLI PERTVWKYFVQLCSAVEHMHSRRVMHRDI KPANVFI 
TATGVVKLGDLGLGRFFSSETTAAHSLVGTPYYMSPERIHENGYNFKSDIWSLGCLLYEA 
WSGWYQGWASPSQKPPSRWKKKRRIRFSPPGTRKKNL 

SEQ ID NO: 291. >T11445_P9 # TY Protein # CC #LN 174 # Source Gene: T11445 

# Encoding Transcript: 20 

MAPRYPLPEWGRRAEECAGRRREYVRVSGSVCVHVCAFQSDLCASTCVSLEGRKYACVGV 
SITSVQGRASVFITEVEGPAPFAPTRPLHSDGPRAREPVEREAAWHGLGVGAGPDPVNQG 
PGGQTQSLAGPDRGKGLRARLRQGHWQDPLCIHRLGERRLWWAPRPGTRGRVFS 

SEQ ID NO: 292. >T11445_P10 8 TY Protein # CC #LN 98 # Source Gene: T11445 

# Encoding Transcript: 21 

MLLHLALYGKAGTSEGENEVFFFFFGGFLNCGPYVFFPFCRFGEDDVFMGYCRIFYLGSL 
FLDFVGGILRDPFKGLSGLYGLHDSLALQTRAALASGR 

SEQ ID NO:293. >T11445_P11 # TY Protein # CC #LN 309 # Source Gene: 

T11445 # Encoding Transcript: 23 

MAGQPGHMPHGGSSNNLCHTLGPVHPPDPQRHPNTLSFRCSLADFQIEKKIGRGQFSEVY 
KATCLLDRKTVALKKVQIFEMMDAECARQDCVKEIGLLKQLNHPNIIKYLDSFIEDNELNI 
VLELADAGDLSQMI KYFKKQKRLI PERTVWKYFVQLCSAVEHMHSRRVMHRDI KPANVFI 
TATGWKLGDLGLGRFFSSETTAAHSLGGRPTTCHRKRNLGKRLNTSRPKSGSLGLACGT 
KMGNLPENPSYGEKNKISFSPARKKIQTAGSHPPPPQRKTLSIKDLPRNWLPTGTFSPRH 
PRTRREPLE 

SEQ ID NO:294. >T11445_P12 # TY Protein # CC #LN 267 # Source Gene: 

T11445 # Encoding Transcript: 24 

MAGQPGHMPHGGSSNNLCHTLGPVHPPDPQRHPNTLSFRCSLADFQIEKKIGRGQFSEVY 
KATCLLDRKTVALKKVQIFEMMDAKARQDCVKEIGLLKQLNHPNIIKYLDSFIEDNELNI 
VLELADAGDLSQMI KYFKKQKRLI PERTVWKYFVQLCSAVEHMHSRRVMHRDI KPANVFI 
TATGWKLGDLGLGRFFSSETTAAHSLGGRPTTCHRRGSHENTATNFPYPDPLRSFGLLL 
PSYRMARNLPGPRPFLPDEPDCTPSFP 



SEQ ID NO: 295. >T23935 # TY Consensus # Length 15363 # Number of exons 25 

ctctcccccccaacccccactttcacaacgcggccccccacctttacctccctaagttct 

tcatctctttcgacattttttttgaaggctgatgggggggagggtggaagagaaagagtg 

aagaaaagttgcgagcgtctcctctcttcctaagcctcccgccccaacccacccctcaga 

gaaggagaagataatatactgaaaagaagaggaggaggagagcgacgggacgggacgcga 

gcgggagcgcagccgccctctcggctccgcggcggcgcctcgcaagtccgggaggcgagg 

ggggcccgaggggagacgccgtgacaactttcgtttccctctgagggaattgggaggtcg 

gcggccccaaaaaatgcctctttatgccactcaccttatggggctgatggagagtccacg 

taaaggataaccgtttcctcaaaggatttatgctgtttcacctccatcaagaaaagggct 

ttttaaagttaaggctttcagtccagtgtaaagctgttggagcgcgggagcaaaggtaaa 

gaatgatgtaatgcgctggctgctccaaagcatcttttgttgtggaatggttattccagt 

catctctttatgaatcaaatgtgaggggctgctttgtggacggagtcctttgcaagagca 

catcaacgggaaagagaaagagacattcacttggagggctcttgctgaaaatgggtttaa 
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ctctccttttgccagtcaccaccagcctgacctcatacacttttagtacaatggagtggc 
tgagcctttgagcacaccaccattacatcatcgtggcaaattaaagaaggaggtgggaaa 
agaggacttattgttgtcatggcccatgagatgattggaactcaaattgttactgagagg 
ttggtggctctgctggaaagtggaacggaaaaagtgctgctaattgatagccggccattt 
gtggaatacaatacatcccacattttggaagccattaatatcaactgctccaagcttatg 
aagcgaaggttgcaacaggacaaagtgttaattacagagctcatccagcattcagcgaaa 
cataaggtaaacgctttcctttttgccttttcaaaataaatggacacactgaatgaggaa 
taattttaattgaagcctcctctttaagtagagaaagtaatttgaggaagttttggttgc 
tttttaaattttggtgccttgtggtacctcaaaataatttattgtccatattgccacatt 
taataatatttctaaactatgaaggattaacttgctttagtttggacaagtgctatattt 
tcttgtggcttgctaaaattattttattaaacactattctatattatctgcaaatttact 
gaaaggaaagtcattcttttcaagttctttggaaagaaagtaggaactaattattagggg 
tgctttttatgtgtcataacacatgtggtaacattggtcaagacttaatgatatcttcaa 
gggaagttattcctgggatactaaaatgttgtctgtgtggtgaggaagttggtttttttc 
ttcacaaatacaaatgagctcatgttttaatttaaataaacaactgatttctttgttaag 
gtgtaaaagtcattttaaaaaaagtttgtttaggtccagggttatgtttattttgcatat 
tgtttggtttattatgaactttaccattttatagtataaaggatttaaaaccagggaatc 
gtgagatcacctacacagaaatagcatcattttggtttttaagagttttgtatgtgtctg 
tctcgtaaattgactacatctcatttttattcaaattattaattgaaaacaggaagcaat 
tattcgccccagcactggtaaatgtgagatcttaagcagtagtttgagattacttttctc 
accttcttcctaacaagcctcatgtttgcatttcaggttgacattgattgcagtcagaag 
gttgtagtttacgatcaaagctcccaagatgttgcctctctctcttcagactgttttctc 
actgtacttctgggtaaactggagaagagcttcaactctgttcacctgcttgcaggtaag 
gctcgtgatggaattaaaaggactcagaatcttgctgagctgtatgagtaagcatatttc 
actgtgaactcagtacaatagatcatcccaactttgctgtgtaaaatgatttatttatag 
cttttgattgatgtcaccgttctatttaacttgaatactatatcattcttccacataggt 
cagtgtatttttgacttggctcaagtgcctctttgcttgctattaactggtagtgccaca 
tagattygcacacccatatggatagctcaaagccagctgcttctgtctgcctttttccat 
tcacctgtgatgtaagaacaccaaatgatgaacatctcagctcctgctagttcccacccc 
tgtggcaagaaccagaccaatattacagtgtcaaactcttacttaacatttatagttgga 
gcggactccagagataagccttatggtcacctttctgggcattataaacaatgaaatagg 
gggagaatgtgtgcagaatgtttcttctggtaattaggggaattgttttaaaagttactt 
gtatatttttatttgttttataattaaaaaaaagtctcaacgtagaaatgctaagcaacc 
agatgatcttataaagtgtgtgggaaggaccaaatattctatatctctcctggaggacgg 
cattgtaggagttaatcccatcaaattccaaagaaggctattcatcctttgcttctcgtt 
ttgcaaatgaaaatcctttcccttgccctgtgggctcatagagctgggaatatagtttga 
caacacacagtaaccttaacataaagctgctgtgttgggcggattcagcttcagtgatag 
gcaagagtgcttttaaacatttggtctaagaaaaacaaaacattaagattaattagttgt 
wggtgtgttaataattttaaactctgggcaggtttagataagatttccatgtctccagtt 
ctctctcacaagcttttcctcagttctctaagaacaaggatagttcatctttttcagatt 
gtctatctgtacacctttcatttatgctgtgattccgtctgatctttcaattagtgttct 
taagtgaagccagcttgtcttagaatgtagttttcagaagtctgtaactgagaattataa 
tattagattctgtctctgttgatagacacattcttttactgggagtatgagtgtgttaaa 
tgtaacagctattattagcatttcactaaactaattgggtgagtttaggaactcatttca 
tgtttcaggtggccactgtgtgagggactttaccttggaatcctgttaatgggatcaggg 
tttttcttttctttttttttttttgagacagaattttgccctgtcgcccaggctagagtg 
cggtggtgtgaccttggcttgctgcaacctctgcttcccaggttcaagcgattctcctgc 
ctcagcctcccaagtagctgggattctgagcacctgccaccacgcctggctaatttttgt 
attttagtagagaccgggtttcaccatgttggccagtctggtctcgaactttgacctcag 
gtgatctgcctgcctcggcctcccaaagtgctgggattacaggcatgagccaccacaccc 
agccagagtctttcttaaagaagcaatatgccaagatgtgcttcgtgacagagtggttat 
cccaccatgttcttgtttgagttacttttaccctatggggagaccttactcaacatctcg 
gtcaagtagacagccccccagaggagtgagctcacattgagtttctagcacaggtgcttt 
agct act tgtaccctt tact cctttaccaaagagcattcctttgttacctgggaaggttc 
ttaatctatcctgtgtagccttgggaggggcattcactgggcaataaggaagaaacagga 
ctgcctgattagctatgtcaactctagggacacgtggtgtgacacactgagtcagaacac 
cctgacatccctgtttcaaacaggaagtgtttcttaggcctaacggcaagtccatggagg 
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cagcaccaggagccacattaaatgtattagtagcaccatatcttcatccatacctccaga 
atcctgtgtttctattagcttaggaggtgacttcagagtctgaattgcttgtttgggaat 
ttctgagcagctggatttcttgtgacacctaagagagggttctagatgattagaagcctt 
cccaatcatccaaaagcaattggggattctcttggttagtgaattttttcgtcagggtag 
gtagataattggggtaagtggataacagtattggagaaggtttggccatcaattcttggg 
aaactgttttattgcaaaggtccagctgctcagaggacaagccagggatgtgggacagaa 
ggacatattgacaaaagtttcttaaatattttggttggggagaaataactgaaaagcacc 
ctagtttctcacaggggctactgtaagttacaaggtcactggagacaattttcactacaa 
actgcccctgtgactttcaacagttggagattgttgtcagtctcgtttggacatggtaat 
agaaagacaaagtggtttcttggaataatttttccatcggtttagaactttcagccacct 
tgagaaaagaaagcatcagatatgtgagatggtttttatggtcattagctatggcagtga 
ggaattgtggccccacatctactcacactctcagggctaccttacccacggaagaatgac 
tgtgaataaggagaacaagaggactcccaagaaattgatagcagtatcttaacgagctct 
tttccttcaaacaagaggagactgacacttacctggaggacctacttacatggtcatgaa 
aaatgaaggattactgaaggagaaggaagtgggatttggggaacctctacaggaatgcag 
tgggcttagttttttaatatggaccaggtcttgtttacctttgtgttcccgcaaggccta 
gcccttcttaagttttcagtaaatattttgatattagcttacctgaaggttttatattgt 
ttatatttcctatgatttatcagtctagaatataagcatattaagcagtgatgaagtctg 
aaagtagagaaaacttcagattgtttcaaaataggtgatttggaaggtgtatttattctg 
ataaagcaaatatatagctgcgatgggaaaatatctaatatgaaactttcttcatccata 
taagcagatgtgaagatggtattaaaatctgatgaaaatgctgctgtagtgaaacagtgc 
taattttctgtggtccttaaatactgatatttgataaagagctaacaaccctagaaatct 
agctccactcataggaatacgcaagtccttgaattcatgttaattgagaacagtcagaat 
aattattgtatttttaaaaatgtaaatgtaaattgattgctctagtcattagtaaataac 
ctactaaccaagtcaaactaagccatagcattgtttcactcccagtggattaaagggttt 
aatatcagtagcagctgggtgtggtggtgcacccctctatttccagctactcgggaagct 
gatgggggaggattgcttgagcccaggagttcgaggttacagtgagctgtgattgcaaca 
ctgcactacagtctgggtgacagagagcctgtctctaaaaaaacaaaacacaaaattatc 
agtagcaatctcatcttaaaaaaaaaacaaatttggatgctttagaaagttctccattac 
tgcccggccatggtgacttacacctgtaatcccagaactttgggaggcygaggtgggtgg 
atcacctgaggtcaggagttcgagaccagcctggccaacatggcaaaaccccatctctac 
taaaaatataaaaattagccgggcatggtggcatctgcctgtagtctcagctactcagga 
ggctgaggcaggagaattgcttgaacccgggaggcagaggttgcagtgagccaagatcgc 
atcactgcaccctggtgtaggcaacwgagtgagactctgtctcgtgggtttgctgagttc 
tctcgttgtttccctggcctctgtgaaggaaaatccactctagtccctacctgcatttct 
cagccttgcttacctgttgccaacattgggccaacccgaattcttcccaatctttatctt 
ggctgccagcgagatgtcctcaacaaggagctgatgcagcagaatgggattggttatgtg 
ttaaatgccagcaatacctgtccaaagcctgactttatccccgagtctcatttcctgcgt 
gtgcctgtgaatgacagcttttgtgagaaaattttgccgtggttggacaaatcagtagat 
ttcattgtaattcaaggcaaaatctccaggccactttttgaattcagcccatattcacta 
caattgctaagtcttctcctctgtggggttttttgttgttgttcttgctttttatttact 
caagcagaagaaagacacaagaatgataacccacatacatacagagtagtagaatggcaa 
gagctaattattctcaaacacaaattcttaaagataaatgtaaaatttttttttaatttt 
taaatttttttatttttatttattattatactttaagttttagggatctgtggggtttta 
agccatggttagtgatagcatgtcatttgaagatgtacccccttaatgacttacaggctc 
atagtgattttttcttttttgttgttgttatttagattagatttgggtaagtctccgggt 
tggttggaaggactcaggtttatgtggagttttgtcttttagttaatgtgccagtaacag 
agtttgcccgtttgtgctatgggatactcatgaggtgcccccaaacccagctggctacat 
atacagcctcttgcttcctcttcactgtctcctgggctcttcctccagctctcctcatgc 
caaactcgtgactgcatggacacaacatagggaggtgattaggaccatgtattttggagg 
caaactgtctgtatttgagtcctgcctaccctggttattagtgaagtggtttgagtaagt 
tacttcaaacttctgtgcacttcagtttcttgatctggaaactgaaggtaatatacctac 
cttacagggatattgtgaggattaaaatagataatgtaaaatgaaaaaaaaagaaaaaaa 
gtaaaccaaaaagataaaataaaaataagtaaaatagataatgtccattgaacaccttgt 
aaagtacctgactcataattggctgttaataartattttaaaaattgttatctgaagttc 
ctccttttttctgtcctttgagctggactctgtaacacctcttagtattacttccttctt 
tttagcttttgtaacttctcatcctcactgcatctttgcccttttcatttgcagttctct 
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ctcccctttcttaacctagattttagccttaattgcggtaatggccacctgcttgtttcc 
agttttcatcatcattgccttctcgttctttattcagaaacctgaagaggcttcttagtt 
ttctgcrttaaaaaggtagccttggccaggtgcagcgtgtgtaatcaatcacagcacttt 
gggaagcggaggcgggaggattgcttgaggccaggagtttgacaccagcctagtcaacat 
agcaagatcctgtctctacaaaaaattttcaaaaattagccaggcatggtggtgtgcacc 
tgtagtcctagctgggaggctgaggcaggaggatcgcttgagcctaggagctcacagtta 
cagtgagctatgtttgcactatggcactccagcctggatgaacagagtgagaccctgtct 
gaaaaaacaaatgtagcccttagcctaacattcccaccaccctcaaacacctgcttccca 
cctttgaaaccttctcaaaccttctctctggtttcttctccaaggcaggctgttttcgaa 
tgagcactttgctcagtcccactgcagggctttctcgcacacatttcccttctcctctct 
gccagttgtgtgtgttaaattaattcctttgcttgaagacacagtttagggctcatgtct 
cccaagatgcatgctggtgaacctcaactccagtatagtcacttgtttttttcattttta 
aaagacgtcatcaaaagagttgtggttataaaaacagttgtaagcacattgcagaaaatt 
tgaaaaacagagaaaaggaaacctataacctcaacatcgtaacgcatgttactgtcattg 
ttttatttccttccagccttcttctttttttattaaaatctcttttgaaaatatgtatca 
gccagtatcagctggatgcctaccatgtgcctagcgttaagtcttttgggggattgttag 
aagcaagcaagatccttctaggtccttataatctaggttttgagatagataattattaaa 
attaatgagagagtattgtcaacattaaaaaacaaaaacccagactccttccagtaagga 
tttgaagcagttttcaataataggttagaaccaaaagatacacatattctcacacataca 
cattataatggtataaaggagcaataatcaagtaataggggctaggaaatgtacagtaac 
tgacaaaaagtctgaattgctacttcttattgaaattgagtagaaaccttagctttgagc 
ttcttagcagtagggcagaaagggaaggaaacactgagagtaacgtggggtaaatgatag 
ctgagaaggaaggggatgctggaggcacaggaaggcatgagactgtgtacacgagtttat 
atgttcttacttgtatagcatgcaattctttgtttgagtgttgcttggtattcatttgtt 
agtgttttattgtgtttggtttcctgatctagaagggaagctgcttgctgcttaattatt 
catatgctacccaaaacctgtatgttgctttatatatgatgaataattatgttatatata 
ataatcattacactagaaacttatatttatagcataggcagtgagattgaatagctttgt 
attattcagctgccaaagtaagattctttgcgcatctgttaatcgcaagaattttaacac 
aaggttcagtagacttttggtagtatctaccaagaatgaaaaggggagaattcagcttca 
tgactgagcacacatgtaattatttagaattgaggcagtcatggaactggccagaatgtt 
agtagcttccaggctgtgtggaataacctcgtcccctgccacatgtacatattgttaagc 
ccctcagcagtgagagatggcagcactgcttggatttccttggttctttggttctaacaa 
agtgtccgtgagtccttccatcatgttgtggggaattgagataatgccataagacaataa 
ctcctggtatcaaggaagggataaaagttctcttggatctcattaaagtgacctgctaaa 
cctgattcatagcctatacaaggttttcttctaaaactcagtaaataaagttgagccatt 
ttatgacatacagaaaagttgcttaaaatagggaggccaggggcagtggctcactcctgt 
aatcccagcactttgggaggctgaggtggacggatcacttgaggccaggagttagagacc 
agcctgcccaacatggtgaaaccgcaactccactaaaaatagaaaaattagccaggcgtg 
gtgctgcacacctgtaatcccagctactctggaggctaaggcaggagaatcacttgaatc 
cgggaggcggagattgcaatgagccgacatcgtgcactccagcctgggcgacagagcaag 
actctatctcaaaatagtaataatgattttaaaaaatgtaattatatttcttaggtaatc 
agggaaaaatttgaagtgtcagataacaggaatatattaaatattacccatatcttaggg 
gaaaatactaacataagtcaggaaacctgagttagtagtagcttgcttattaacgaattg 
cttattaacaaacagttttggaccccagttgtcctgcgaggcgatctagaggatcccttg 
gtagtcacctttactaatatatttccattatatatgaaatgtgttcattattagaaactt 
ggaagttacagaaatgtagaagagaaactcacccagtgttttcaccatccaaagagtgtg 
attgtccnagtgatgtgatggttcagtgatgtgatgtgagaaagcaaaagcctccaatgg 
atgtgttctagtgcactgtttagctgggatctcccgctccgccaccatcgctatcgccta 
catcatgaagaggatggacatgtctttagatgaagcttacagatttgtgaaagaaaaaag 
acctactatatctccaaacttcaattttctgggccaactcctggactatgagaagaagat 
taagaaccagactggagcatcagggccaaagagcaaactcaagctgctgcacctggagaa 
gccaaatgaacctgtccctgctgtctcagagggtggacagaaaagcgagacgcccctcag 
tccaccctgtgccgactctgctacctcagaggcagcaggacaaaggcccgtgcatcccgc 
cagcgtgcccagcgtgcccagcgtgcagccgtcgctgttagaggacagcccgctggtaca 
ggcgctcagtgggctgcacctgtccgcagacaggctggaagacagcaataagctcaagcg 
ttccttctctctggatatcaaatcagtttcatattcagccagcatggcagcatccttaca 
tggcttctcctcatcagaagatgctttggaatactacaaaccttccactactctggatgg 
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gaccaacaagctatgccagttctcccctgttcaggaactatcggagcagactcccgaaac 
cagtcctgataaggaggaagccagcatccccaagaagctgcagaccgccaggccttcaga 
cagccagagcaagcgattgcattcggtcagaaccagcagcagtggcaccgcccagaggtc 
ccttttatctccactgcatcgaagtgggagcgtggaggacaattaccacaccagcttcct 
tttcggcctttccaccagccagcagcacctcacgaagtctgctggcctgggccttaaggg 
ctggcactcggatatcttggccccccagacctctaccccttccctgaccagcagctggta 
ttttgccacagagtcctcacacttctactctgcctcagccatctacggaggcagtgccag 
ttactctgcctacagctgcagccagctgcccacttgcggagaccaagtctattctgtgcg 
caggcggcagaagccaagtgacagagctgactcgcggcggagctggcatgaagagagccc 
ctttgaaaagcagtttaaacgcagaagctgccaaatggaatttggagagagcatcatgtc 
agagaacaggtcacgggaagagctggggaaagtgggcagtcagtctagcttttcgggcag 
catggaaatcattgaggtctcctgagaagaaagacacttgtgacttctatagacaatttt 
tttttcttgttcacaaaaaaattccctgtaaatctgaaatatatatatgtacatacatat 
atatttttggaaaatggagctatggtgtaaaagcaacaggtggatcaacccagttgttac 
tctcttaacatctgcatttgagagatcagctaatacttctctcaacaaaaatggaagggc 
agatgctagaatcccccctagacggaggaaaaccattttattcagtgaattacacatcct 
cttgttcttaaaaaagcaagtgtctttggtgttggaggacaaaatcccctaccattttca 
cgttgtgctactaagagatctcaaatattagtctttgtccggacccttccatagtacacc 
ttagcgctgagactgagccagcttgggggtcaggtaggtagaccctgttagggacagagc 
ctagtggtaaatccaagagaaatgatcctatccaaagctgattcacaaacccacgctcac 
ctgacagccgagggacacgagcatcactctgctggacggaccattaggggccttgccaag 
gtctaccttagagcaaacccagtacctcagacaggaaagtcggggctttgaccactacca 
tatctggtagcccattttctaggcattgtgaataggtaggtagctagtcacacttttcag 
accaattcaaactgtctatgcacaaaattcccgtgggcctagatggagataatttttttt 
tcttctcagctttatgaagagaagggaaactgtctaggattcagctgaaccaccaggaac 
ctggcaacatcacgatttaagctaaggttgggaggctaacgagtctacctccctctttgt 
aaatcaaagaattgtttaaaatgggattgtcaatcctttaaataaagatgaacttggttt 
caagccaaatgtgaatttatttgggttggtagcagagcagcagcaccttcaaattctcag 
ccaaagcagatgtttttgccctttctgcttcactgcatggatacagttggtaaaatgtaa 
taatatggcagaattttataggaaacttcctagggaggtaaattatgggaagattaagaa 
aggtacaaattgctgaggagaagcaggaaacctgtttccttagtggcttttatcccctcg 
gcatgcgatggggctgatgtttctataattgcctcagactttcacatttactagtagggc 
tgagagaggctttagtgaggaaggaatattcagaataaaacggttgagaaagctgagaag 
accattgagttttgatcagttgtgaatagagtgcaaagccatggccaagctgtttttgga 
aacgctggccggcgtgtcttcagtggaaaaagcaaatcaaaatggagcgagagcaaaggg 
gcgtcctcagtcctcaacctacaatcactgtatggaatcggtcctggcagctgaacatag 
gaggtcactggaacaagtgatagtgcagattggctttcaaacatcctcctggcttgagtt 
ttatcagctacaatgtgggtcctcttttgaagccttaattcacaacagcagctttttggg 
ggtggggctgggcgggtgttgtcattgttctttcccttcctgtaagtgtcgctagttgct 
gcctcgtatctcaggtttttctctgtttttgagaaatggacagttttttgaccaggatgt 
gacttcatgtttcctatggtgacttctaaaaccagcacagaatgatatgactcaacacag 
accgacttggttatggggatgatgagccgcacagacctcactagttgtgcacaaataatg 
tgctatgatggggtgtaaagtgaaggcagaagagggtcagccgcattgttatgatactgg 
gaaagtgctggtcaacgatttgagttagtttttagatatacattgaaatctttaatcaga 
cattctcaagtttcacacagtagtttttgatgttatgtacacacacaccaaatgtgtaac 
agttcaccacttccagagtgtggtcatgcccaaaacatgtttaagaaaggaaagcagtag 
ctccttgctaacgatgtttcaggaggtttggggcacttggttttaatgagcttctgtcat 
ttagcatctctaatatgacgcccagcacacaagtacagaggctggccaggcgaataaaaa 
ggataagcaaggtaagcggtcacccctggggaacgaggcggccagcgatgggggggatgg 
ggtcataatagcggcttctcttggccatggtccccttccttctggaactgtgatgtagtc 
acatcctacagcctttagtgctggttcactagtgtcagataatcagttcttggaatcgag 
actgccgtggcgaaggggtggcctcggaggcaggctctggagctgcttggatgtctttag 
gtggggtggtggctggctgctctcttcagcatgtaattggggaaaccctcgcgtctacta 
ggggtgatacagatggtgattttaaagagcaaaactagacttctatgtgagaagtctagt 
tttctcttcagcatgtaattggggaaaccctcgcgtctactaggggtgatacagatggtg 
attttaaagagcaaaactagacttctatgtgagaagtgctggaaaatgatttaggacatg 
tgtaaagttagatggaaagactgtaaatgtttaatatgaatatagtgttcttttgaagta 
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aggccagctgttgaacggttaaactgtgcatttctcattttgatgtgtcatgtatgttaa 
tgtatgaaatgattaaataaaatcaaaactggtacctgtttatacataaatacgagaaaa 
gacctatctttgcagccataaactcggtgggaacaccaccactcaagttgccaaaggagg 
cagtggtgaaacctgtcctgttctcacttaaatgaggatttagctcaaaataaagtggtg 
gtgtcatcaggtttattccgtgttctgtcattcacatggaacaccggatgattagctaac 
agtttagtgccagccttcattctttactgtgtacgttaaatgcacactacagtgaaaaag 
cctaagacacttggtaaatattttctagctgactgattccagaacacacaagtacatggt 
aggaaaaaaaaggccagctacacactcatggcccatttcgatataacatgtaaatgatgt 
tacgattcctacaggaaaaacaggaagggcaattgttgaagagctgtcctttgtccttgc 
cacatacatctcaacaacctctctgctcatgacactagatttgatgacttatgagactag 
aaacttagtctcatcagtttcataaaaattaggcattgaaggaataaagcttcagtgcaa 
acagatggaatattgactacagaaagaatcacaggtgaccttgaagttggttgagttgaa 
gttacatttaagaaaatttataagaaagattgggccgggtgagctagactagaagagcta 
tcttggccaggaccctgcagaaaaagaggtggaagtgagatagaggaataaagggaatta 
gaataaatcagcctccagagatagagggtaacaggaggaacagttgtttcagtgagaggc 
ttccaatgagaagatgcagatcttgggagagcctgaagttcaggccagttgctgacagtg 
etc 

SEQ ID NO:296. >T23935_T1 # TY Transcript # LN 5351 # Source Gene: T23935 
# Encoded protein: T23935_P1 

ctctcccccccaacccccactttcacaacgcggccccccacctttacctccctaagttct 
tcatctctttcgacattttttttgaaggctgatgggggggagggtggaagagaaagagtg 
aagaaaagttgcgagcgtctcctctcttcctaagcctcccgccccaacccacccctcaga 
gaaggagaagataatatactgaaaagaagaggaggaggagagcgacgggacgggacgcga 
gcgggagcgcagccgccctctcggctccgcggcggcgcctcgcaagtccgggaggcgagg 
ggggcccgaggggagacgccgtgacaactttcgtttccctctgagggaattgggaggtcg 
gcggccccaaaagctttcagtccagtgtaaagctgttggagcgcgggagcaaaggtaaag 
aatgatgtaatgcgctggctgctccaaagcatcttttgttgtggaatggttattccagtc 
atctctttatgaatcaaatgtgaggggctgctttgtggacggagtcctttgcaagagcac 
atcaacgggaaagagaaagagacattcacttggagggctcttgctgaaaatgggtttaac 
tctccttttgccagtcaccaccagcctgacctcatacacttttagtacaatggagtggct 
gagcctttgagcacaccaccattacatcatcgtggcaaattaaagaaggaggtgggaaaa 
gaggacttattgttgtcatggcccatgagatgattggaactcaaattgttactgagaggt 
tggtggctctgctggaaagtggaacggaaaaagtgctgctaattgatagccggccatttg 
tggaatacaatacatcccacattttggaagccattaatatcaactgctccaagcttatga 
agcgaaggttgcaacaggacaaagtgttaattacagagctcatccagcattcagcgaaac 
ataaggttgacattgattgcagtcagaaggttgtagtttacgatcaaagctcccaagatg 
ttgcctctctctcttcagactgttttctcactgtacttctgggtaaactggagaagagct 
tcaactctgttcacctgcttgcaggtgggtttgctgagttctctcgttgtttccctggcc 
tctgtgaaggaaaatccactctagtccctacctgcatttctcagccttgcttacctgttg 
ccaacattgggccaacccgaattcttcccaatctttatcttggctgccagcgagatgtcc 
tcaacaaggagctgatgcagcagaatgggattggttatgtgttaaatgccagcaatacct 
gtccaaagcctgactttatccccgagtctcatttcctgcgtgtgcctgtgaatgacagct 
tttgtgagaaaattttgccgtggttggacaaatcagtagatttcattgagaaagcaaaag 
cctccaatggatgtgttctagtgcactgtttagctgggatctcccgctccgccaccatcg 
ctatcgcctacatcatgaagaggatggacatgtctttagatgaagcttacagatttgtga 
aagaaaaaagacctactatatctccaaacttcaattttctgggccaactcctggactatg 
agaagaagattaagaaccagactggagcatcagggccaaagagcaaactcaagctgctgc 
acctggagaagccaaatgaacctgtccctgctgtctcagagggtggacagaaaagcgaga 
cgcccctcagtccaccctgtgccgactctgctacctcagaggcagcaggacaaaggcccg 
tgcatcccgccagcgtgcccagcgtgcccagcgtgcagccgtcgctgttagaggacagcc 
cgctggtacaggcgctcagtgggctgcacctgtccgcagacaggctggaagacagcaata 
agctcaagcgttccttctctctggatatcaaatcagtttcatattcagccagcatggcag 
catccttacatggcttctcctcatcagaagatgctttggaatactacaaaccttccacta 
ctctggatgggaccaacaagctatgccagttctcccctgttcaggaactatcggagcaga 
ctcccgaaaccagtcctgataaggaggaagccagcatccccaagaagctgcagaccgcca 
ggccttcagacagccagagcaagcgattgcattcggtcagaaccagcagcagtggcaccg 



168/365 



Figure 1 (Cont'd) 



cccagaggtcccttttatctccactgcatcgaagtgggagcgtggaggacaattaccaca 
ccagcttccttttcggcctttccaccagccagcagcacctcacgaagtctgctggcctgg 
gccttaagggctggcactcggatatcttggccccccagacctctaccccttccctgacca 
gcagctggtattttgccacagagtcctcacacttctactctgcctcagccatctacggag 
gcagtgccagttactctgcctacagctgcagccagctgcccacttgcggagaccaagtct 
attctgtgcgcaggcggcagaagccaagtgacagagctgactcgcggcggagctggcatg 
aagagagcccctttgaaaagcagtttaaacgcagaagctgccaaatggaatttggagaga 
gcatcatgtcagagaacaggtcacgggaagagctggggaaagtgggcagtcagtctagct 
tttcgggcagcatggaaatcattgaggtctcctgagaagaaagacacttgtgacttctat 
agacaatttttttttcttgttcacaaaaaaattccctgtaaatctgaaatatatatatgt 
acatacatatatatttttggaaaatggagctatggtgtaaaagcaacaggtggatcaacc 
cagttgttactctcttaacatctgcatttgagagatcagctaatacttctctcaacaaaa 
atggaagggcagatgctagaatcccccctagacggaggaaaaccattttattcagtgaat 
tacacatcctcttgttcttaaaaaagcaagtgtctttggtgttggaggacaaaatcccct 
accattttcacgttgtgctactaagagatctcaaatattagtctttgtccggacccttcc 
atagtacaccttagcgctgagactgagccagcttgggggtcaggtaggtagaccctgtta 
gggacagagcctagtggtaaatccaagagaaatgatcctatccaaagctgattcacaaac 
ccacgctcacctgacagccgagggacacgagcatcactctgctggacggaccattagggg 
ccttgccaaggtctaccttagagcaaacccagtacctcagacaggaaagtcggggctttg 
accactaccatatctggtagcccattttctaggcattgtgaataggtaggtagctagtca 
cacttttcagaccaattcaaactgtctatgcacaaaattcccgtgggcctagatggagat 
aatttttttttcttctcagctttatgaagagaagggaaactgtctaggattcagctgaac 
caccaggaacctggcaacatcacgatttaagctaaggttgggaggctaacgagtctacct 
ccctctttgtaaatcaaagaattgtttaaaatgggattgtcaatcctttaaataaagatg 
aacttggtttcaagccaaatgtgaatttatttgggttggtagcagagcagcagcaccttc 
aaattctcagccaaagcagatgtttttgccctttctgcttcactgcatggatacagttgg 
taaaatgtaataatatggcagaattttataggaaacttcctagggaggtaaattatggga 
agattaagaaaggtacaaattgctgaggagaagcaggaaacctgtttccttagtggcttt 
tatcccctcggcatgcgatggggctgatgtttctataattgcctcagactttcacattta 
ctagtagggctgagagaggctttagtgaggaaggaatattcagaataaaacggttgagaa 
agctgagaagaccattgagttttgatcagttgtgaatagagtgcaaagccatggccaagc 
tgtttttggaaacgctggccggcgtgtcttcagtggaaaaagcaaatcaaaatggagcga 
gagcaaaggggcgtcctcagtcctcaacctacaatcactgtatggaatcggtcctggcag 
ctgaacataggaggtcactggaacaagtgatagtgcagattggctttcaaacatcctcct 
ggcttgagttttatcagctacaatgtgggtcctcttttgaagccttaattcacaacagca 
gctttttgggggtggggctgggcgggtgttgtcattgttctttcccttcctgtaagtgtc 
gctagttgctgcctcgtatctcaggtttttctctgtttttgagaaatggacagttttttg 
accaggatgtgacttcatgtttcctatggtgacttctaaaaccagcacagaatgatatga 
ctcaacacagaccgacttggttatggggatgatgagccgcacagacctcactagttgtgc 
acaaataatgtgctatgatggggtgtaaagtgaaggcagaagagggtcagccgcattgtt 
atgatactgggaaagtgctggtcaacgatttgagttagtttttagatatacattgaaatc 
tttaatcagacattctcaagtttcacacagtagtttttgatgttatgtacacacacacca 
aatgtgtaacagttcaccacttccagagtgtggtcatgcccaaaacatgtttaagaaagg 
aaagcagtagctccttgctaacgatgtttcaggaggtttggggcacttggttttaatgag 
cttctgtcatttagggcttctcttggccatggtccccttccttctggaactgtgatgtag 
tcacatcctacagcctttagtgctggttcactagtgtcagataatcagttcttggaatcg 
agactgccgtggcgaaggggtggcctcggaggcaggctctggagctgcttggatgtcttt 
aggtggggtggtggctggctctcttcagcatgtaattggggaaaccctcgcgtctactag 
gggtgatacagatggtgattttaaagagcaaaactagacttctatgtgagaagtgctgga 
aaatgatttaggacatgtgtaaagttagatggaaagactgtaaatgtttaatatgaatat 
agtgttcttttgaagtaaggccagctgttgaacggttaaactgtgcatttctcattttga 
tgtgtcatgtatgttaatgtatgaaatgattaaataaaatcaaaactggtacctgtttat 
acataaatacg 

SEQ ID NO: 297. >T23935_T2 # TY Transcript # LN 5100 # Source Gene: T23935 
# Encoded protein: T23935_P1 

aatgcctctttatgccactcaccttatggggctgatggagagtccacgtaaaggataacc 
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gtttcctcaaaggatttatgctgtttcacctccatcaagaaaagggctttttaaagttaa 
ggctttcagtccagtgtaaagctgttggagcgcgggagcaaaggtaaagaatgatgtaat 
gcgctggctgctccaaagcatcttttgttgtggaatggttattccagtcatctctttatg 
aatcaaatgtgaggggctgctttgtggacggagtcctttgcaagagcacatcaacgggaa 
agagaaagagacattcacttggagggctcttgctgaaaatgggtttaactctccttttgc 
cagtcaccaccagcctgacctcatacacttttagtacaatggagtggctgagcctttgag 
cacaccaccattacatcatcgtggcaaattaaagaaggaggtgggaaaagaggacttatt 
gttgtcatggcccatgagatgattggaactcaaattgttactgagaggttggtggctctg 
ctggaaagtggaacggaaaaagtgctgctaattgatagccggccatttgtggaatacaat 
acatcccacattttggaagccattaatatcaactgctccaagcttatgaagcgaaggttg 
caacaggacaaagtgttaattacagagctcatccagcattcagcgaaacataaggttgac 
attgattgcagtcagaaggttgtagtttacgatcaaagctcccaagatgttgcctctctc 
tcttcagactgttttctcactgtacttctgggtaaactggagaagagcttcaactctgtt 
cacctgcttgcaggtgggtttgctgagttctctcgttgtttccctggcctctgtgaagga 
aaatccactctagtccctacctgcatttctcagccttgcttacctgttgccaacattggg 
ccaacccgaattcttcccaatctttatcttggctgccagcgagatgtcctcaacaaggag 
ctgatgcagcagaatgggattggttatgtgttaaatgccagcaatacctgtccaaagcct 
gactttatccccgagtctcatttcctgcgtgtgcctgtgaatgacagcttttgtgagaaa 
attttgccgtggttggacaaatcagtagatttcattgagaaagcaaaagcctccaatgga 
tgtgttctagtgcactgtttagctgggatctcccgctccgccaccatcgctatcgcctac 
atcatgaagaggatggacatgtctttagatgaagcttacagatttgtgaaagaaaaaaga 
cctactatatctccaaacttcaattttctgggccaactcctggactatgagaagaagatt 
aagaaccagactggagcatcagggccaaagagcaaactcaagctgctgcacctggagaag 
ccaaatgaacctgtccctgctgtctcagagggtggacagaaaagcgagacgcccctcagt 
ccaccctgtgccgactctgctacctcagaggcagcaggacaaaggcccgtgcatcccgcc 
agcgtgcccagcgtgcccagcgtgcagccgtcgctgttagaggacagcccgctggtacag 
gcgctcagtgggctgcacctgtccgcagacaggctggaagacagcaataagctcaagcgt 
tccttctctctggatatcaaatcagtttcatattcagccagcatggcagcatccttacat 
ggcttctcctcatcagaagatgctttggaatactacaaaccttccactactctggatggg 
accaacaagctatgccagttctcccctgttcaggaactatcggagcagactcccgaaacc 
agtcctgataaggaggaagccagcatccccaagaagctgcagaccgccaggccttcagac 
agccagagcaagcgattgcattcggtcagaaccagcagcagtggcaccgcccagaggtcc 
cttttatctccactgcatcgaagtgggagcgtggaggacaattaccacaccagcttcctt 
ttcggcctttccaccagccagcagcacctcacgaagtctgctggcctgggccttaagggc 
tggcactcggatatcttggccccccagacctctaccccttccctgaccagcagctggtat 
tttgccacagagtcctcacacttctactctgcctcagccatctacggaggcagtgccagt 
tactctgcctacagctgcagccagctgcccacttgcggagaccaagtctattctgtgcgc 
aggcggcagaagccaagtgacagagctgactcgcggcggagctggcatgaagagagcccc 
tttgaaaagcagtttaaacgcagaagctgccaaatggaatttggagagagcatcatgtca 
gagaacaggtcacgggaagagctggggaaagtgggcagtcagtctagcttttcgggcagc 
atggaaatcattgaggtctcctgagaagaaagacacttgtgacttctatagacaattttt 
ttttcttgttcacaaaaaaattccctgtaaatctgaaatatatatatgtacatacatata 
tatttttggaaaatggagctatggtgtaaaagcaacaggtggatcaacccagttgttact 
ctcttaacatctgcatttgagagatcagctaatacttctctcaacaaaaatggaagggca 
gatgctagaatcccccctagacggaggaaaaccattttattcagtgaattacacatcctc 
ttgttcttaaaaaagcaagtgtctttggtgttggaggacaaaatcccctaccattttcac 
gttgtgctactaagagatctcaaatattagtctttgtccggacccttccatagtacacct 
tagcgctgagactgagccagcttgggggtcaggtaggtagaccctgttagggacagagcc 
tagtggtaaatccaagagaaatgatcctatccaaagctgattcacaaacccacgctcacc 
tgacagccgagggacacgagcatcactctgctggacggaccattaggggccttgccaagg 
tctaccttagagcaaacccagtacctcagacaggaaagtcggggctttgaccactaccat 
atctggtagcccattttctaggcattgtgaataggtaggtagctagtcacacttttcaga 
ccaattcaaactgtctatgcacaaaattcccgtgggcctagatggagataattttttttt 
cttctcagctttatgaagagaagggaaactgtctaggattcagctgaaccaccaggaacc 
tggcaacatcacgatttaagctaaggttgggaggctaacgagtctacctccctctttgta 
aatcaaagaattgtttaaaatgggattgtcaatcctttaaataaagatgaacttggtttc 
aagccaaatgtgaatttatttgggttggtagcagagcagcagcaccttcaaattctcagc 
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caaagcagatgtttttgccctttctgcttcactgcatggatacagttggtaaaatgtaat 
aatatggcagaattttataggaaacttcctagggaggtaaattatgggaagattaagaaa 
ggtacaaattgctgaggagaagcaggaaacctgtttccttagtggcttttatcccctcgg 
catgcgatggggctgatgtttctataattgcctcagactttcacatttactagtagggct 
gagagaggctttagtgaggaaggaatattcagaataaaacggttgagaaagctgagaaga 
ccattgagttttgatcagttgtgaatagagtgcaaagccatggccaagctgtttttggaa 
acgctggccggcgtgtcttcagtggaaaaagcaaatcaaaatggagcgagagcaaagggg 
cgtcctcagtcctcaacctacaatcactgtatggaatcggtcctggcagctgaacatagg 
aggtcactggaacaagtgatagtgcagattggctttcaaacatcctcctggcttgagttt 
tatcagctacaatgtgggtcctcttttgaagccttaattcacaacagcagctttttgggg 
gtggggctgggcgggtgttgtcattgttctttcccttcctgtaagtgtcgctagttgctg 
cctcgtatctcaggtttttctctgtttttgagaaatggacagttttttgaccaggatgtg 
acttcatgtttcctatggtgacttctaaaaccagcacagaatgatatgactcaacacaga 
ccgacttggttatggggatgatgagccgcacagacctcactagttgtgcacaaataatgt 
gctatgatggggtgtaaagtgaaggcagaagagggtcagccgcattgttatgatactggg 
aaagtgctggtcaacgatttgagttagtttttagatatacattgaaatctttaatcagac 
attctcaagtttcacacagtagtttttgatgttatgtacacacacaccaaatgtgtaaca 
gttcaccacttccagagtgtggtcatgcccaaaacatgtttaagaaaggaaagcagtagc 
tccttgctaacgatgtttcaggaggtttggggcacttggttttaatgagcttctgtcatt 
tagggcttctcttggccatggtccccttccttctggaactgtgatgtagtcacatcctac 
agcctttagtgctggttcactagtgtcagataatcagttcttggaatcgagactgccgtg 
gcgaaggggtggcctcggaggcaggctctggagctgcttggatgtctttaggtggggtgg 
tggctggctctcttcagcatgtaattggggaaaccctcgcgtctactaggggtgatacag 
atggtgattttaaagagcaaaactagacttctatgtgagaagtgctggaaaatgatttag 
gacatgtgtaaagttagatggaaagactgtaaatgtttaatatgaatatagtgttctttt 
gaagtaaggccagctgttgaacggttaaactgtgcatttctcattttgatgtgtcatgta 
tgttaatgtatgaaatgattaaataaaatcaaaactggtacctgtttatacataaatacg 

SEQ ID NO: 298. >T23935_T3 # TY Transcript # LN 5187 # Source Gene: T23935 
# Encoded protein: T23935_P6 

ctctcccccccaacccccactttcacaacgcggccccccacctttacctccctaagttct 
tcatctctttcgacattttttttgaaggctgatgggggggagggtggaagagaaagagtg 
aagaaaagttgcgagcgtctcctctcttcctaagcctcccgccccaacccacccctcaga 
gaaggagaagataatatactgaaaagaagaggaggaggagagcgacgggacgggacgcga 
gcgggagcgcagccgccctctcggctccgcggcggcgcctcgcaagtccgggaggcgagg 
ggggcccgaggggagacgccgtgacaactttcgtttccctctgagggaattgggaggtcg 
gcggccccaaaagctttcagtccagtgtaaagctgttggagcgcgggagcaaaggtaaag 
aatgatgtaatgcgctggctgctccaaagcatcttttgttgtggaatggttattccagtc 
atctctttatgaatcaaatgtgaggggctgctttgtggacggagtcctttgcaagagcac 
atcaacgggaaagagaaagagacattcacttggagggctcttgctgaaaatgggtttaac 
tctccttttgccagtcaccaccagcctgacctcatacacttttagtacaatggagtggct 
gagcctttgagcacaccaccattacatcatcgtggcaaattaaagaaggaggtgggaaaa 
gaggacttattgttgtcatggcccatgagatgattggaactcaaattgttactgagaggt 
tggtggctctgctggaaagtggaacggaaaaagtgctgctaattgatagccggccatttg 
tggaatacaatacatcccacattttggaagccattaatatcaactgctccaagcttatga 
agcgaaggttgcaacaggacaaagtgttaattacagagctcatccagcattcagcgaaac 
ataaggttgacattgattgcagtcagaaggttgtagtttacgatcaaagctcccaagatg 
ttgcctctctctcttcagactgttttctcactgtacttctgggtaaactggagaagagct 
tcaactctgttcacctgcttgcaggagctgatgcagcagaatgggattggttatgtgtta 
aatgccagcaatacctgtccaaagcctgactttatccccgagtctcatttcctgcgtgtg 
cctgtgaatgacagcttttgtgagaaaattttgccgtggttggacaaatcagtagatttc 
attgagaaagcaaaagcctccaatggatgtgttctagtgcactgtttagctgggatctcc 
cgctccgccaccatcgctatcgcctacatcatgaagaggatggacatgtctttagatgaa 
gcttacagatttgtgaaagaaaaaagacctactatatctccaaacttcaattttctgggc 
caactcctggactatgagaagaagattaagaaccagactggagcatcagggccaaagagc 
aaactcaagctgctgcacctggagaagccaaatgaacctgtccctgctgtctcagagggt 
ggacagaaaagcgagacgcccctcagtccaccctgtgccgactctgctacctcagaggca 
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Figure 1 (Cont'd) 



gcaggacaaaggcccgtgcatcccgccagcgtgcccagcgtgcccagcgtgcagccgtcg 
ctgttagaggacagcccgctggtacaggcgctcagtgggctgcacctgtccgcagacagg 
ctggaagacagcaataagctcaagcgttccttctctctggatatcaaatcagtttcatat 
tcagccagcatggcagcatccttacatggcttctcctcatcagaagatgctttggaatac 
tacaaaccttccactactctggatgggaccaacaagctatgccagttctcccctgttcag 
gaactatcggagcagactcccgaaaccagtcctgataaggaggaagccagcatccccaag 
aagctgcagaccgccaggccttcagacagccagagcaagcgattgcattcggtcagaacc 
agcagcagtggcaccgcccagaggtcccttttatctccactgcatcgaagtgggagcgtg 
gaggacaattaccacaccagcttccttttcggcctttccaccagccagcagcacctcacg 
aagtctgctggcctgggccttaagggctggcactcggatatcttggccccccagacctct 
accccttccctgaccagcagctggtattttgccacagagtcctcacacttctactctgcc 
tcagccatctacggaggcagtgccagttactctgcctacagctgcagccagctgcccact 
tgcggagaccaagtctattctgtgcgcaggcggcagaagccaagtgacagagctgactcg 
cggcggagctggcatgaagagagcccctttgaaaagcagtttaaacgcagaagctgccaa 
atggaatttggagagagcatcatgtcagagaacaggtcacgggaagagctggggaaagtg 
ggcagtcagtctagcttttcgggcagcatggaaatcattgaggtctcctgagaagaaaga 
cacttgtgacttctatagacaatttttttttcttgttcacaaaaaaattccctgtaaatc 
tgaaatatatatatgtacatacatatatatttttggaaaatggagctatggtgtaaaagc 
aacaggtggatcaacccagttgttactctcttaacatctgcatttgagagatcagctaat 
acttctctcaacaaaaatggaagggcagatgctagaatcccccctagacggaggaaaacc 
attttattcagtgaattacacatcctcttgttcttaaaaaagcaagtgtctttggtgttg 
gaggacaaaatcccctaccattttcacgttgtgctactaagagatctcaaatattagtct 
ttgtccggacccttccatagtacaccttagcgctgagactgagccagcttgggggtcagg 
taggtagaccctgttagggacagagcctagtggtaaatccaagagaaatgatcctatcca 
aagctgattcacaaacccacgctcacctgacagccgagggacacgagcatcactctgctg 
gacggaccattaggggccttgccaaggtctaccttagagcaaacccagtacctcagacag 
gaaagtcggggctttgaccactaccatatctggtagcccattttctaggcattgtgaata 
ggtaggtagctagtcacacttttcagaccaattcaaactgtctatgcacaaaattcccgt 
gggcctagatggagataatttttttttcttctcagctttatgaagagaagggaaactgtc 
taggattcagctgaaccaccaggaacctggcaacatcacgatttaagctaaggttgggag 
gctaacgagtctacctccctctttgtaaatcaaagaattgtttaaaatgggattgtcaat 
cctttaaataaagatgaacttggtttcaagccaaatgtgaatttatttgggttggtagca 
gagcagcagcaccttcaaattctcagccaaagcagatgtttttgccctttctgcttcact 
gcatggatacagttggtaaaatgtaataatatggcagaattttataggaaacttcctagg 
gaggtaaattatgggaagattaagaaaggtacaaattgctgaggagaagcaggaaacctg 
tttccttagtggcttttatcccctcggcatgcgatggggctgatgtttctataattgcct 
cagactttcacatttactagtagggctgagagaggctttagtgaggaaggaatattcaga 
ataaaacggttgagaaagctgagaagaccattgagttttgatcagttgtgaatagagtgc 
aaagccatggccaagctgtttttggaaacgctggccggcgtgtcttcagtggaaaaagca 
aatcaaaatggagcgagagcaaaggggcgtcctcagtcctcaacctacaatcactgtatg 
gaatcggtcctggcagctgaacataggaggtcactggaacaagtgatagtgcagattggc 
tttcaaacatcctcctggcttgagttttatcagctacaatgtgggtcctcttttgaagcc 
ttaattcacaacagcagctttttgggggtggggctgggcgggtgttgtcattgttctttc 
ccttcctgtaagtgtcgctagttgctgcctcgtatctcaggtttttctctgtttttgaga 
aatggacagttttttgaccaggatgtgacttcatgtttcctatggtgacttctaaaacca 
gcacagaatgatatgactcaacacagaccgacttggttatggggatgatgagccgcacag 
acctcactagttgtgcacaaataatgtgctatgatggggtgtaaagtgaaggcagaagag 
ggtcagccgcattgttatgatactgggaaagtgctggtcaacgatttgagttagttttta 
gatatacattgaaatctttaatcagacattctcaagtttcacacagtagtttttgatgtt 
atgtacacacacaccaaatgtgtaacagttcaccacttccagagtgtggtcatgcccaaa 
acatgtttaagaaaggaaagcagtagctccttgctaacgatgtttcaggaggtttggggc 
acttggttttaatgagcttctgtcatttagggcttctcttggccatggtccccttccttc 
tggaactgtgatgtagtcacatcctacagcctttagtgctggttcactagtgtcagataa 
tcagttcttggaatcgagactgccgtggcgaaggggtggcctcggaggcaggctctggag 
ctgcttggatgtctttaggtggggtggtggctggctctcttcagcatgtaattggggaaa 
ccctcgcgtctactaggggtgatacagatggtgattttaaagagcaaaactagacttcta 
tgtgagaagtgctggaaaatgatttaggacatgtgtaaagttagatggaaagactgtaaa 
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Figur 1 (Cont'd) 



tgtttaatatgaatatagtgttcttttgaagtaaggccagctgttgaacggttaaactgt 
gcatttctcattttgatgtgtcatgtatgttaatgtatgaaatgattaaataaaatcaaa 
actggtacctgtttatacataaatacg 

SEQ ID NO: 299. >T23935_T4 # TY Transcript # LN 5970 # Source Gene: T23935 
# Encoded protein: T23935_P3 

aatgcctctttatgccactcaccttatggggctgatggagagtccacgtaaaggataacc 
gtttcctcaaaggatttatgctgtttcacctccatcaagaaaagggctttttaaagttaa 
ggctttcagtccagtgtaaagctgttggagcgcgggagcaaaggtaaagaatgatgtaat 
gcgctggctgctccaaagcatcttttgttgtggaatggttattccagtcatctctttatg 
aatcaaatgtgaggggctgctttgtggacggagtcctttgcaagagcacatcaacgggaa 
agagaaagagacattcacttggagggctcttgctgaaaatgggtttaactctccttttgc 
cagtcaccaccagcctgacctcatacacttttagtacaatggagtggctgagcctttgag 
cacaccaccattacatcatcgtggcaaattaaagaaggaggtgggaaaagaggacttatt 
gttgtcatggcccatgagatgattggaactcaaattgttactgagaggttggtggctctg 
ctggaaagtggaacggaaaaagtgctgctaattgatagccggccatttgtggaatacaat 
acatcccacattttggaagccattaatatcaactgctccaagcttatgaagcgaaggttg 
caacaggacaaagtgttaattacagagctcatccagcattcagcgaaacataaggtaaac 
gctttcctttttgccttttcaaaataaatggacacactgaatgaggaataattttaattg 
aagcctcctctttaagtagagaaagtaatttgaggaagttttggttgctttttaaatttt 
ggtgccttgtggtacctcaaaataatttattgtccatattgccacatttaataatatttc 
taaactatgaaggattaacttgctttagtttggacaagtgctatattttcttgtggcttg 
ctaaaattattttattaaacactattctatattatctgcaaatttactgaaaggaaagtc 
attcttttcaagttctttggaaagaaagtaggaactaattattaggggtgctttttatgt 
gtcataacacatgtggtaacattggtcaagacttaatgatatcttcaagggaagttattc 
ctgggatactaaaatgttgtctgtgtggtgaggaagttggtttttttcttcacaaataca 
aatgagctcatgttttaatttaaataaacaactgatttctttgttaaggtgtaaaagtca 
ttttaaaaaaagtttgtttaggtccagggttatgtttattttgcatattgtttggtttat 
tatgaactttaccattttatagtataaaggatttaaaaccagggaatcgtgagatcacct 
acacagaaatagcatcattttggtttttaagagttttgtatgtgtctgtctcgtaaattg 
actacatctcatttttattcaaattattaattgaaaacaggaagcaattattcgccccag 
cactggtaaatgtgagatcttaagcagtagtttgagattacttttctcaccttcttccta 
acaagcctcatgtttgcatttcaggttgacattgattgcagtcagaaggttgtagtttac 
gatcaaagctcccaagatgttgcctctctctcttcagactgttttctcactgtacttctg 
ggtaaactggagaagagcttcaactctgttcacctgcttgcaggtgggtttgctgagttc 
tctcgttgtttccctggcctctgtgaaggaaaatccactctagtccctacctgcatttct 
cagccttgcttacctgttgccaacattgggccaacccgaattcttcccaatctttatctt 
ggctgccagcgagatgtcctcaacaaggagctgatgcagcagaatgggattggttatgtg 
ttaaatgccagcaatacctgtccaaagcctgactttatccccgagtctcatttcctgcgt 
gtgcctgtgaatgacagcttttgtgagaaaattttgccgtggttggacaaatcagtagat 
ttcattgagaaagcaaaagcctccaatggatgtgttctagtgcactgtttagctgggatc 
tcccgctccgccaccatcgctatcgcctacatcatgaagaggatggacatgtctttagat 
gaagcttacagatttgtgaaagaaaaaagacctactatatctccaaacttcaattttctg 
ggccaactcctggactatgagaagaagattaagaaccagactggagcatcagggccaaag 
agcaaactcaagctgctgcacctggagaagccaaatgaacctgtccctgctgtctcagag 
ggtggacagaaaagcgagacgcccctcagtccaccctgtgccgactctgctacctcagag 
gcagcaggacaaaggcccgtgcatcccgccagcgtgcccagcgtgcccagcgtgcagccg 
tcgctgttagaggacagcccgctggtacaggcgctcagtgggctgcacctgtccgcagac 
aggctggaagacagcaataagctcaagcgttccttctctctggatatcaaatcagtttca 
tattcagccagcatggcagcatccttacatggcttctcctcatcagaagatgctttggaa 
tactacaaaccttccactactctggatgggaccaacaagctatgccagttctcccctgtt 
caggaactatcggagcagactcccgaaaccagtcctgataaggaggaagccagcatcccc 
aagaagctgcagaccgccaggccttcagacagccagagcaagcgattgcattcggtcaga 
accagcagcagtggcaccgcccagaggtcccttttatctccactgcatcgaagtgggagc 
gtggaggacaattaccacaccagcttccttttcggcctttccaccagccagcagcacctc 
acgaagtctgctggcctgggccttaagggctggcactcggatatcttggccccccagacc 
tctaccccttccctgaccagcagctggtattttgccacagagtcctcacacttctactct 
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Figur 1 (Cont'd) 



gcctcagccatctacggaggcagtgccagttactctgcctacagctgcagccagctgccc 
acttgcggagaccaagtctattctgtgcgcaggcggcagaagccaagtgacagagctgac 
tcgcggcggagctggcatgaagagagcccctttgaaaagcagtttaaacgcagaagctgc 
caaatggaatttggagagagcatcatgtcagagaacaggtcacgggaagagctggggaaa 
gtgggcagtcagtctagcttttcgggcagcatggaaatcattgaggtctcctgagaagaa 
agacacttgtgacttctatagacaatttttttttcttgttcacaaaaaaattccctgtaa 
atctgaaatatatatatgtacatacatatatatttttggaaaatggagctatggtgtaaa 
agcaacaggtggatcaacccagttgttactctcttaacatctgcatttgagagatcagct 
aatacttctctcaacaaaaatggaagggcagatgctagaatcccccctagacggaggaaa 
accattttattcagtgaattacacatcctcttgttcttaaaaaagcaagtgtctttggtg 
ttggaggacaaaatcccctaccattttcacgttgtgctactaagagatctcaaatattag 
tctttgtccggacccttccatagtacaccttagcgctgagactgagccagcttgggggtc 
aggtaggtagaccctgttagggacagagcctagtggtaaatccaagagaaatgatcctat 
ccaaagctgattcacaaacccacgctcacctgacagccgagggacacgagcatcactctg 
ctggacggaccattaggggccttgccaaggtctaccttagagcaaacccagtacctcaga 
caggaaagtcggggctttgaccactaccatatctggtagcccattttctaggcattgtga 
ataggtaggtagctagtcacacttttcagaccaattcaaactgtctatgcacaaaattcc 
cgtgggcctagatggagataatttttttttcttctcagctttatgaagagaagggaaact 
gtctaggattcagctgaaccaccaggaacctggcaacatcacgatttaagctaaggttgg 
gaggctaacgagtctacctccctctttgtaaatcaaagaattgtttaaaatgggattgtc 
aatcctttaaataaagatgaacttggtttcaagccaaatgtgaatttatttgggttggta 
gcagagcagcagcaccttcaaattctcagccaaagcagatgtttttgccctttctgcttc 
actgcatggatacagttggtaaaatgtaataatatggcagaattttataggaaacttcct 
agggaggtaaattatgggaagattaagaaaggtacaaattgctgaggagaagcaggaaac 
ctgtttccttagtggcttttatcccctcggcatgcgatggggctgatgtttctataattg 
cctcagactttcacatttactagtagggctgagagaggctttagtgaggaaggaatattc 
agaataaaacggttgagaaagctgagaagaccattgagttttgatcagttgtgaatagag 
tgcaaagccatggccaagctgtttttggaaacgctggccggcgtgtcttcagtggaaaaa 
gcaaatcaaaatggagcgagagcaaaggggcgtcctcagtcctcaacctacaatcactgt 
atggaatcggtcctggcagctgaacataggaggtcactggaacaagtgatagtgcagatt 
ggctttcaaacatcctcctggcttgagttttatcagctacaatgtgggtcctcttttgaa 
gccttaattcacaacagcagctttttgggggtggggctgggcgggtgttgtcattgttct 
ttcccttcctgtaagtgtcgctagttgctgcctcgtatctcaggtttttctctgtttttg 
agaaatggacagttttttgaccaggatgtgacttcatgtttcctatggtgacttctaaaa 
ccagcacagaatgatatgactcaacacagaccgacttggttatggggatgatgagccgca 
cagacctcactagttgtgcacaaataatgtgctatgatggggtgtaaagtgaaggcagaa 
gagggtcagccgcattgttatgatactgggaaagtgctggtcaacgatttgagttagttt 
ttagatatacattgaaatctttaatcagacattctcaagtttcacacagtagtttttgat 
gttatgtacacacacaccaaatgtgtaacagttcaccacttccagagtgtggtcatgccc 
aaaacatgtttaagaaaggaaagcagtagctccttgctaacgatgtttcaggaggtttgg 
ggcacttggttttaatgagcttctgtcatttagggcttctcttggccatggtccccttcc 
ttctggaactgtgatgtagtcacatcctacagcctttagtgctggttcactagtgtcaga 
taatcagttcttggaatcgagactgccgtggcgaaggggtggcctcggaggcaggctctg 
gagctgcttggatgtctttaggtggggtggtggctggctctcttcagcatgtaattgggg 
aaaccctcgcgtctactaggggtgatacagatggtgattttaaagagcaaaactagactt 
ctatgtgagaagtgctggaaaatgatttaggacatgtgtaaagttagatggaaagactgt 
aaatgtttaatatgaatatagtgttcttttgaagtaaggccagctgttgaacggttaaac 
tgtgcatttctcattttgatgtgtcatgtatgttaatgtatgaaatgattaaataaaatc 
aaaactggtacctgtttatacataaatacg 

SEQ ID NO: 300. >T23935_T5 # TY Transcript # LN 3672 # Source Gene: T23935 
# Encoded protein: T23935_P1 

ctctcccccccaacccccactttcacaacgcggccccccacctttacctccctaagttct 
tcatctctttcgacattttttttgaaggctgatgggggggagggtggaagagaaagagtg 
aagaaaagttgcgagcgtctcctctcttcctaagcctcccgccccaacccacccctcaga 
gaaggagaagataatatactgaaaagaagaggaggaggagagcgacgggacgggacgcga 
gcgggagcgcagccgccctctcggctccgcggcggcgcctcgcaagtccgggaggcgagg 
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Figur 1 (Cont'd) 



ggggcccgaggggagacgccgtgacaactttcgtttccctctgagggaattgggaggtcg 
gcggccccaaaagctttcagtccagtgtaaagctgttggagcgcgggagcaaaggtaaag 
aatgatgtaatgcgctggctgctccaaagcatcttttgttgtggaatggttattccagtc 
atctctttatgaatcaaatgtgaggggctgctttgtggacggagtcctttgcaagagcac 
atcaacgggaaagagaaagagacattcacttggagggctcttgctgaaaatgggtttaac 
tctccttttgccagtcaccaccagcctgacctcatacacttttagtacaatggagtggct 
gagcctttgagcacaccaccattacatcatcgtggcaaattaaagaaggaggtgggaaaa 
gaggacttattgttgtcatggcccatgagatgattggaactcaaattgttactgagaggt 
tggtggctctgctggaaagtggaacggaaaaagtgctgctaattgatagccggccatttg 
tggaatacaatacatcccacattttggaagccattaatatcaactgctccaagcttatga 
agcgaaggttgcaacaggacaaagtgttaattacagagctcatccagcattcagcgaaac 
ataaggttgacattgattgcagtcagaaggttgtagtttacgatcaaagctcccaagatg 
ttgcctctctctcttcagactgttttctcactgtacttctgggtaaactggagaagagct 
tcaactctgttcacctgcttgcaggtgggtttgctgagttctctcgttgtttccctggcc 
tctgtgaaggaaaatccactctagtccctacctgcatttctcagccttgcttacctgttg 
ccaacattgggccaacccgaattcttcccaatctttatcttggctgccagcgagatgtcc 
tcaacaaggagctgatgcagcagaatgggattggttatgtgttaaatgccagcaatacct 
gtccaaagcctgactttatccccgagtctcatttcctgcgtgtgcctgtgaatgacagct 
tttgtgagaaaattttgccgtggttggacaaatcagtagatttcattgagaaagcaaaag 
cctccaatggatgtgttctagtgcactgtttagctgggatctcccgctccgccaccatcg 
ctatcgcctacatcatgaagaggatggacatgtctttagatgaagcttacagatttgtga 
aagaaaaaagacctactatatctccaaacttcaattttctgggccaactcctggactatg 
agaagaagattaagaaccagactggagcatcagggccaaagagcaaactcaagctgctgc 
acctggagaagccaaatgaacctgtccctgctgtctcagagggtggacagaaaagcgaga 
cgcccctcagtccaccctgtgccgactctgctacctcagaggcagcaggacaaaggcccg 
tgcatcccgccagcgtgcccagcgtgcccagcgtgcagccgtcgctgttagaggacagcc 
cgctggtacaggcgctcagtgggctgcacctgtccgcagacaggctggaagacagcaata 
agctcaagcgttccttctctctggatatcaaatcagtttcatattcagccagcatggcag 
catccttacatggcttctcctcatcagaagatgctttggaatactacaaaccttccacta 
ctctggatgggaccaacaagctatgccagttctcccctgttcaggaactatcggagcaga 
ctcccgaaaccagtcctgataaggaggaagccagcatccccaagaagctgcagaccgcca 
ggccttcagacagccagagcaagcgattgcattcggtcagaaccagcagcagtggcaccg 
cccagaggtcccttttatctccactgcatcgaagtgggagcgtggaggacaattaccaca 
ccagcttccttttcggcctttccaccagccagcagcacctcacgaagtctgctggcctgg 
gccttaagggctggcactcggatatcttggccccccagacctctaccccttccctgacca 
gcagctggtattttgccacagagtcctcacacttctactctgcctcagccatctacggag 
gcagtgccagttactctgcctacagctgcagccagctgcccacttgcggagaccaagtct 
attctgtgcgcaggcggcagaagccaagtgacagagctgactcgcggcggagctggcatg 
aagagagcccctttgaaaagcagtttaaacgcagaagctgccaaatggaatttggagaga 
gcatcatgtcagagaacaggtcacgggaagagctggggaaagtgggcagtcagtctagct 
tttcgggcagcatggaaatcattgaggtctcctgagaagaaagacacttgtgacttctat 
agacaatttttttttcttgttcacaaaaaaattccctgtaaatctgaaatatatatatgt 
acatacatatatatttttggaaaatggagctatggtgtaaaagcaacaggtggatcaacc 
cagttgt tact ctcttaacatctgcatttgagagatcagctaat act tctctcaacaaaa 
atggaagggcagatgctagaatcccccctagacggaggaaaaccattttattcagtgaat 
tacacatcctcttgttcttaaaaaagcaagtgtctttggtgttggaggacaaaatcccct 
accattttcacgttgtgctactaagagatctcaaatattagtctttgtccggacccttcc 
atagtacaccttagcgctgagactgagccagcttgggggtcaggtaggtagaccctgtta 
gggacagagcctagtggtaaatccaagagaaatgatcctatccaaagctgattcacaaac 
ccacgctcacctgacagccgagggacacgagcatcactctgctggacggaccattagggg 
ccttgccaaggtctaccttagagcaaacccagtacctcagacaggaaagtcggggctttg 
accactaccatatctggtagcccattttctaggcattgtgaataggtaggtagctagtca 
cacttttcagaccaattcaaactgtctatgcacaaaattcccgtgggcctagatggagat 
aatttttttttcttctcagctttatgaagagaagggaaactgtctaggattcagctgaac 
caccaggaacctggcaacatcacgatttaagctaaggttgggaggctaacgagtctacct 
ccctctttgtaaatcaaagaattgtttaaaatgggattgtcaatcctttaaataaagatg 
aacttggtttca 
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Figur 1 (Cont'd) 



SEQ ID NO: 301. >T23935_T6 # TY Transcript # LN 6200 # Source Gene: T23935 
# Encoded protein: T23935_P1 

ctctcccccccaacccccactttcacaacgcggccccccacctttacctccctaagttct 
tcatctctttcgacattttttttgaaggctgatgggggggagggtggaagagaaagagtg 
aagaaaagttgcgagcgtctcctctcttcctaagcctcccgccccaacccacccctcaga 
gaaggagaagataatatactgaaaagaagaggaggaggagagcgacgggacgggacgcga 
gcgggagcgcagccgccctctcggctccgcggcggcgcctcgcaagtccgggaggcgagg 
ggggcccgaggggagacgccgtgacaactttcgtttccctctgagggaattgggaggtcg 
gcggccccaaaagctttcagtccagtgtaaagctgttggagcgcgggagcaaaggtaaag 
aatgatgtaatgcgctggctgctccaaagcatcttttgttgtggaatggttattccagtc 
atctctttatgaatcaaatgtgaggggctgctttgtggacggagtcctttgcaagagcac 
atcaacgggaaagagaaagagacattcacttggagggctcttgctgaaaatgggtttaac 
tctccttttgccagtcaccaccagcctgacctcatacacttttagtacaatggagtggct 
gagcctttgagcacaccaccattacatcatcgtggcaaattaaagaaggaggtgggaaaa 
gaggacttattgttgtcatggcccatgagatgattggaactcaaattgttactgagaggt 
tggtggctctgctggaaagtggaacggaaaaagtgctgctaattgatagccggccatttg 
tggaatacaatacatcccacattttggaagccattaatatcaactgctccaagcttatga 
agcgaaggttgcaacaggacaaagtgttaattacagagctcatccagcattcagcgaaac 
ataaggttgacattgattgcagtcagaaggttgtagtttacgatcaaagctcccaagatg 
ttgcctctctctcttcagactgttttctcactgtacttctgggtaaactggagaagagct 
tcaactctgttcacctgcttgcaggtgggtttgctgagttctctcgttgtttccctggcc 
tctgtgaaggaaaatccactctagtccctacctgcatttctcagccttgcttacctgttg 
ccaacattgggccaacccgaattcttcccaatctttatcttggctgccagcgagatgtcc 
tcaacaaggagctgatgcagcagaatgggattggttatgtgttaaatgccagcaatacct 
gtccaaagcctgactttatccccgagtctcatttcctgcgtgtgcctgtgaatgacagct 
tttgtgagaaaattttgccgtggttggacaaatcagtagatttcattgagaaagcaaaag 
cctccaatggatgtgttctagtgcactgtttagctgggatctcccgctccgccaccatcg 
ctatcgcctacatcatgaagaggatggacatgtctttagatgaagcttacagatttgtga 
aagaaaaaagacctactatatctccaaacttcaattttctgggccaactcctggactatg 
agaagaagattaagaaccagactggagcatcagggccaaagagcaaactcaagctgctgc 
acctggagaagccaaatgaacctgtccctgctgtctcagagggtggacagaaaagcgaga 
cgcccctcagtccaccctgtgccgactctgctacctcagaggcagcaggacaaaggcccg 
tgcatcccgccagcgtgcccagcgtgcccagcgtgcagccgtcgctgttagaggacagcc 
cgctggtacaggcgctcagtgggctgcacctgtccgcagacaggctggaagacagcaata 
agctcaagcgttccttctctctggatatcaaatcagtttcatattcagccagcatggcag 
catccttacatggcttctcctcatcagaagatgctttggaatactacaaaccttccacta 
ctctggatgggaccaacaagctatgccagttctcccctgttcaggaactatcggagcaga 
ctcccgaaaccagtcctgataaggaggaagccagcatccccaagaagctgcagaccgcca 
ggccttcagacagccagagcaagcgattgcattcggtcagaaccagcagcagtggcaccg 
cccagaggtcccttttatctccactgcatcgaagtgggagcgtggaggacaattaccaca 
ccagcttccttttcggcctttccaccagccagcagcacctcacgaagtctgctggcctgg 
gccttaagggctggcactcggatatcttggccccccagacctctaccccttccctgacca 
gcagctggtattttgccacagagtcctcacacttctactctgcctcagccatctacggag 
gcagtgccagttactctgcctacagctgcagccagctgcccacttgcggagaccaagtct 
attctgtgcgcaggcggcagaagccaagtgacagagctgactcgcggcggagctggcatg 
aagagagcccctttgaaaagcagtttaaacgcagaagctgccaaatggaatttggagaga 
gcatcatgtcagagaacaggtcacgggaagagctggggaaagtgggcagtcagtctagct 
tttcgggcagcatggaaatcattgaggtctcctgagaagaaagacacttgtgacttctat 
agacaatttttttttcttgttcacaaaaaaattccctgtaaatctgaaatatatatatgt 
acatacatatatatttttggaaaatggagctatggtgtaaaagcaacaggtggatcaacc 
cagttgttactctcttaacatctgcatttgagagatcagctaatacttctctcaacaaaa 
atggaagggcagatgctagaatcccccctagacggaggaaaaccattttattcagtgaat 
tacacatcctcttgttcttaaaaaagcaagtgtctttggtgttggaggacaaaatcccct 
accattttcacgttgtgctactaagagatctcaaatattagtctttgtccggacccttcc 
atagtacaccttagcgctgagactgagccagcttgggggtcaggtaggtagaccctgtta 
gggacagagcctagtggtaaatccaagagaaatgatcctatccaaagctgattcacaaac 
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Figur 1 (Cont'd) 



ccacgctcacctgacagccgagggacacgagcatcactctgctggacggaccattagggg 
ccttgccaaggtctaccttagagcaaacccagtacctcagacaggaaagtcggggctttg 
accactaccatatctggtagcccattttctaggcattgtgaataggtaggtagctagtca 
cacttttcagaccaattcaaactgtctatgcacaaaattcccgtgggcctagatggagat 
aatttttttttcttctcagctttatgaagagaagggaaactgtctaggattcagctgaac 
caccaggaacctggcaacatcacgatttaagctaaggttgggaggctaacgagtctacct 
ccctctttgtaaatcaaagaattgtttaaaatgggattgtcaatcctttaaataaagatg 
aacttggtttcaagccaaatgtgaatttatttgggttggtagcagagcagcagcaccttc 
aaattctcagccaaagcagatgtttttgccctttctgcttcactgcatggatacagttgg 
taaaatgtaataatatggcagaattttataggaaacttcctagggaggtaaattatggga 
agattaagaaaggtacaaattgctgaggagaagcaggaaacctgtttccttagtggcttt 
tatcccctcggcatgcgatggggctgatgtttctataattgcctcagactttcacattta 
ctagtagggctgagagaggctttagtgaggaaggaatattcagaataaaacggttgagaa 
agctgagaagaccattgagttttgatcagttgtgaatagagtgcaaagccatggccaagc 
tgtttttggaaacgctggccggcgtgtcttcagtggaaaaagcaaatcaaaatggagcga 
gagcaaaggggcgtcctcagtcctcaacctacaatcactgtatggaatcggtcctggcag 
ctgaacataggaggtcactggaacaagtgatagtgcagattggctttcaaacatcctcct 
ggcttgagttttatcagctacaatgtgggtcctcttttgaagccttaattcacaacagca 
gctttttgggggtggggctgggcgggtgttgtcattgttctttcccttcctgtaagtgtc 
gctagttgctgcctcgtatctcaggtttttctctgtttttgagaaatggacagttttttg 
accaggatgtgacttcatgtttcctatggtgacttctaaaaccagcacagaatgatatga 
ctcaacacagaccgacttggttatggggatgatgagccgcacagacctcactagttgtgc 
acaaataatgtgctatgatggggtgtaaagtgaaggcagaagagggtcagccgcattgtt 
atgatactgggaaagtgctggtcaacgatttgagttagtttttagatatacattgaaatc 
tttaatcagacattctcaagtttcacacagtagtttttgatgttatgtacacacacacca 
aatgtgtaacagttcaccacttccagagtgtggtcatgcccaaaacatgtttaagaaagg 
aaagcagtagctccttgctaacgatgtttcaggaggtttggggcacttggttttaatgag 
cttctgtcatttagggcttctcttggccatggtccccttccttctggaactgtgatgtag 
tcacatcctacagcctttagtgctggttcactagtgtcagataatcagttcttggaatcg 
agactgccgtggcgaaggggtggcctcggaggcaggctctggagctgcttggatgtcttt 
aggtggggtggtggctggctctcttcagcatgtaattggggaaaccctcgcgtctactag 
gggtgatacagatggtgattttaaagagcaaaactagacttctatgtgagaagtgctgga 
aaatgatttaggacatgtgtaaagttagatggaaagactgtaaatgtttaatatgaatat 
agtgttcttttgaagtaaggccagctgttgaacggttaaactgtgcatttctcattttga 
tgtgtcatgtatgttaatgtatgaaatgattaaataaaatcaaaactggtacctgtttat 
acataaatacgagaaaagacctatctttgcagccataaactcggtgggaacaccaccact 
caagttgccaaaggaggcagtggtgaaacctgtcctgttctcacttaaatgaggatttag 
ctcaaaataaagtggtggtgtcatcaggtttattccgtgttctgtcattcacatggaaca 
ccggatgattagctaacagtttagtgccagccttcattctttactgtgtacgttaaatgc 
acactacagtgaaaaagcctaagacacttggtaaatattttctagctgactgattccaga 
acacacaagtacatggtaggaaaaaaaaggccagctacacactcatggcccatttcgata 
taacatgtaaatgatgttacgattcctacaggaaaaacaggaagggcaattgttgaagag 
ctgtcctttgtccttgccacatacatctcaacaacctctctgctcatgacactagatttg 
atgacttatgagactagaaacttagtctcatcagtttcataaaaattaggcattgaagga 
ataaagcttcagtgcaaacagatggaatattgactacagaaagaatcacaggtgaccttg 
aagttggttgagttgaagttacatttaagaaaatttataagaaagattgggccgggtgag 
ctagactagaagagctatcttggccaggaccctgcagaaaaagaggtggaagtgagatag 
aggaataaagggaattagaataaatcagcctccagagatagagggtaacaggaggaacag 
ttgtttcagtgagaggcttccaatgagaagatgcagatcttgggagagcctgaagttcag 
gccagttgctgacagtgctc 

SEQ ID NO: 302. >T23935_T7 # TY Transcript # LN 2754 # Source Gene: T23935 
# Encoded protein: 

cgaggcgatctagaggatcccttggtagtcacctttactaatatatttccattatatatg 
aaatgtgttcattattagaaacttggaagttacagaaatgtagaagagaaactcacccag 
tgttttcaccatccaaagagtgtgattgtccnagtgatgtgatggttcagtgatgtgatg 
tgttcacaaaaaaattccctgtaaatctgaaatatatatatgtacatacatatatatttt 
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Figur 1 (Cont'd) 



tggaaaatggagctatggtgtaaaagcaacaggtggatcaacccagttgttactctctta 
acatctgcatttgagagatcagctaatacttctctcaacaaaaatggaagggcagatgct 
agaatcccccctagacggaggaaaaccattttattcagtgaattacacatcctcttgttc 
ttaaaaaagcaagtgtctttggtgttggaggacaaaatcccctaccattttcacgttgtg 
ctactaagagatctcaaatattagtctttgtccggacccttccatagtacaccttagcgc 
tgagactgagccagcttgggggtcaggtaggtagaccctgttagggacagagcctagtgg 
taaatccaagagaaatgatcctatccaaagctgattcacaaacccacgctcacctgacag 
ccgagggacacgagcatcactctgctggacggaccattaggggccttgccaaggtctacc 
ttagagcaaacccagtacctcagacaggaaagtcggggctttgaccactaccatatctgg 
tagcccattttctaggcattgtgaataggtaggtagctagtcacacttttcagaccaatt 
caaactgtctatgcacaaaattcccgtgggcctagatggagataatttttttttcttctc 
agctttatgaagagaagggaaactgtctaggattcagctgaaccaccaggaacctggcaa 
catcacgatttaagctaaggttgggaggctaacgagtctacctccctctttgtaaatcaa 
agaattgtttaaaatgggattgtcaatcctttaaataaagatgaacttggtttcaagcca 
aatgtgaatttatttgggttggtagcagagcagcagcaccttcaaattctcagccaaagc 
agatgtttttgccctttctgcttcactgcatggatacagttggtaaaatgtaataatatg 
gcagaattttataggaaacttcctagggaggtaaattatgggaagattaagaaaggtaca 
aattgctgaggagaagcaggaaacctgtttccttagtggcttttatcccctcggcatgcg 
atggggctgatgtttctataattgcctcagactttcacatttactagtagggctgagaga 
ggctttagtgaggaaggaatattcagaataaaacggttgagaaagctgagaagaccattg 
agttttgatcagttgtgaatagagtgcaaagccatggccaagctgtttttggaaacgctg 
gccggcgtgtcttcagtggaaaaagcaaatcaaaatggagcgagagcaaaggggcgtcct 
cagtcctcaacctacaatcactgtatggaatcggtcctggcagctgaacataggaggtca 
ctggaacaagtgatagtgcagattggctttcaaacatcctcctggcttgagttttatcag 
ctacaatgtgggtcctcttttgaagccttaattcacaacagcagctttttgggggtgggg 
ctgggcgggtgttgtcattgttctttcccttcctgtaagtgtcgctagttgctgcctcgt 
atctcaggtttttctctgtttttgagaaatggacagttttttgaccaggatgtgacttca 
tgtttcctatggtgacttctaaaaccagcacagaatgatatgactcaacacagaccgact 
tggttatggggatgatgagccgcacagacctcactagttgtgcacaaataatgtgctatg 
atggggtgtaaagtgaaggcagaagagggtcagccgcattgttatgatactgggaaagtg 
ctggtcaacgatttgagttagtttttagatatacattgaaatctttaatcagacattctc 
aagtttcacacagtagtttttgatgttatgtacacacacaccaaatgtgtaacagttcac 
cacttccagagtgtggtcatgcccaaaacatgtttaagaaaggaaagcagtagctccttg 
ctaacgatgtttcaggaggtttggggcacttggttttaatgagcttctgtcatttagggc 
ttctcttggccatggtccccttccttctggaactgtgatgtagtcacatcctacagcctt 
tagtgctggttcactagtgtcagataatcagttcttggaatcgagactgccgtggcgaag 
gggtggcctcggaggcaggctctggagctgcttggatgtctttaggtggggtggtggctg 
gctctcttcagcatgtaattggggaaaccctcgcgtctactaggggtgatacagatggtg 
attttaaagagcaaaactagacttctatgtgagaagtgctggaaaatgatttaggacatg 
tgtaaagttagatggaaagactgtaaatgtttaatatgaatatagtgttcttttgaagta 
aggccagctgttgaacggttaaactgtgcatttctcattttgatgtgtcatgtatgttaa 
tgtatgaaatgattaaataaaatcaaaactggtacctgtttatacataaatacg 

SEQ ID NO: 303. >T23935_T8 4 TY Transcript # LN 5165 # Source Gene: T23935 
# Encoded protein: T23935_P1 

ctctcccccccaacccccactttcacaacgcggccccccacctttacctccctaagttct 
tcatctctttcgacattttttttgaaggctgatgggggggagggtggaagagaaagagtg 
aagaaaagttgcgagcgtctcctctcttcctaagcctcccgccccaacccacccctcaga 
gaaggagaagataatatactgaaaagaagaggaggaggagagcgacgggacgggacgcga 
gcgggagcgcagccgccctctcggctccgcggcggcgcctcgcaagtccgggaggcgagg 
ggggcccgaggggagacgccgtgacaactttcgtttccctctgagggaattgggaggtcg 
gcggccccaaaagctttcagtccagtgtaaagctgttggagcgcgggagcaaaggtaaag 
aatgatgtaatgcgctggctgctccaaagcatcttttgttgtggaatggttattccagtc 
atctctttatgaatcaaatgtgaggggctgctttgtggacggagtcctttgcaagagcac 
atcaacgggaaagagaaagagacattcacttggagggctcttgctgaaaatgggtttaac 
tctccttttgccagtcaccaccagcctgacctcatacacttttagtacaatggagtggct 
gagcctttgagcacaccaccattacatcatcgtggcaaattaaagaaggaggtgggaaaa 
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Figur 1 (Confd) 



gaggacttattgttgtcatggcccatgagatgattggaactcaaattgttactgagaggt 
tggtggctctgctggaaagtggaacggaaaaagtgctgctaattgatagccggccatttg 
tggaatacaatacatcccacattttggaagccattaatatcaactgctccaagcttatga 
agcgaaggttgcaacaggacaaagtgttaattacagagctcatccagcattcagcgaaac 
ataaggttgacattgattgcagtcagaaggttgtagtttacgatcaaagctcccaagatg 
ttgcctctctctcttcagactgttttctcactgtacttctgggtaaactggagaagagct 
tcaactctgttcacctgcttgcaggtgggtttgctgagttctctcgttgtttccctggcc 
tctgtgaaggaaaatccactctagtccctacctgcatttctcagccttgcttacctgttg 
ccaacattgggccaacccgaattcttcccaatctttatcttggctgccagcgagatgtcc 
tcaacaaggagctgatgcagcagaatgggattggttatgtgttaaatgccagcaatacct 
gtccaaagcctgactttatccccgagtctcatttcctgcgtgtgcctgtgaatgacagct 
tttgtgagaaaattttgccgtggttggacaaatcagtagatttcattgagaaagcaaaag 
cctccaatggatgtgttctagtgcactgtttagctgggatctcccgctccgccaccatcg 
ctatcgcctacatcatgaagaggatggacatgtctttagatgaagcttacagatttgtga 
aagaaaaaagacctactatatctccaaacttcaattttctgggccaactcctggactatg 
agaagaagattaagaaccagactggagcatcagggccaaagagcaaactcaagctgctgc 
acctggagaagccaaatgaacctgtccctgctgtctcagagggtggacagaaaagcgaga 
cgcccctcagtccaccctgtgccgactctgctacctcagaggcagcaggacaaaggcccg 
tgcatcccgccagcgtgcccagcgtgcccagcgtgcagccgtcgctgttagaggacagcc 
cgctggtacaggcgctcagtgggctgcacctgtccgcagacaggctggaagacagcaata 
agctcaagcgttccttctctctggatatcaaatcagtttcatattcagccagcatggcag 
catccttacatggcttctcctcatcagaagatgctttggaatactacaaaccttccacta 
ctctggatgggaccaacaagctatgccagttctcccctgttcaggaactatcggagcaga 
ctcccgaaaccagtcctgataaggaggaagccagcatccccaagaagctgcagaccgcca 
ggccttcagacagccagagcaagcgattgcattcggtcagaaccagcagcagtggcaccg 
cccagaggtcccttttatctccactgcatcgaagtgggagcgtggaggacaattaccaca 
ccagcttccttttcggcctttccaccagccagcagcacctcacgaagtctgctggcctgg 
gccttaagggctggcactcggatatcttggccccccagacctctaccccttccctgacca 
gcagctggtattttgccacagagtcctcacacttctactctgcctcagccatctacggag 
gcagtgccagttactctgcctacagctgcagccagctgcccacttgcggagaccaagtct 
attctgtgcgcaggcggcagaagccaagtgacagagctgactcgcggcggagctggcatg 
aagagagcccctttgaaaagcagtttaaacgcagaagctgccaaatggaatttggagaga 
gcatcatgtcagagaacaggtcacgggaagagctggggaaagtgggcagtcagtctagct 
tttcgggcagcatggaaatcattgaggtctcctgagaagaaagacacttgtgacttctat 
agacaatttttttttcttgttcacaaaaaaattccctgtaaatctgaaatatatatatgt 
acatacatatatatttttggaaaatggagctatggtgtaaaagcaacaggtggatcaacc 
cagttgt tact ctcttaacatctgcattt gaga gatcagctaat act tctctcaacaaaa 
atggaagggcagatgctagaatcccccctagacggaggaaaaccattttattcagtgaat 
tacacatcctcttgttcttaaaaaagcaagtgtctttggtgttggaggacaaaatcccct 
accattttcacgttgtgctactaagagatctcaaatattagtctttgtccggacccttcc 
atagtacaccttagcgctgagactgagccagcttgggggtcaggtaggtagaccctgtta 
gggacagagcctagtggtaaatccaagagaaatgatcctatccaaagctgattcacaaac 
ccacgctcacctgacagccgagggacacgagcatcactctgctggacggaccattagggg 
ccttgccaaggtctaccttagagcaaacccagtacctcagacaggaaagtcggggctttg 
accactaccatatctggtagcccattttctaggcattgtgaataggtaggtagctagtca 
cacttttcagaccaattcaaactgtctatgcacaaaattcccgtgggcctagatggagat 
aatttttttttcttctcagctttatgaagagaagggaaactgtctaggattcagctgaac 
caccaggaacctggcaacatcacgatttaagctaaggttgggaggctaacgagtctacct 
ccctctttgtaaatcaaagaattgtttaaaatgggattgtcaatcctttaaataaagatg 
aacttggtttcaagccaaatgtgaatttatttgggttggtagcagagcagcagcaccttc 
aaattctcagccaaagcagatgtttttgccctttctgcttcactgcatggatacagttgg 
taaaatgtaataatatggcagaattttataggaaacttcctagggaggtaaattatggga 
agattaagaaaggtacaaattgctgaggagaagcaggaaacctgtttccttagtggcttt 
tatcccctcggcatgcgatggggctgatgtttctataattgcctcagactttcacattta 
ctagtagggctgagagaggctttagtgaggaaggaatattcagaataaaacggttgagaa 
agctgagaagaccattgagttttgatcagttgtgaatagagtgcaaagccatggccaagc 
tgtttttggaaacgctggccggcgtgtcttcagtggaaaaagcaaatcaaaatggagcga 
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Figure 1 (Cont'd) 



gagcaaaggggcgtcctcagtcctcaacctacaatcactgtatggaatcggtcctggcag 
ctgaacataggaggtcactggaacaagtgatagtgcagattggctttcaaacatcctcct 
ggcttgagttttatcagctacaatgtgggtcctcttttgaagccttaattcacaacagca 
gctttttgggggtggggctgggcgggtgttgtcattgttctttcccttcctgtaagtgtc 
gctagttgctgcctcgtatctcaggtttttctctgtttttgagaaatggacagttttttg 
accaggatgtgacttcatgtttcctatggtgacttctaaaaccagcacagaatgatatga 
ctcaacacagaccgacttggttatggggatgatgagccgcacagacctcactagttgtgc 
acaaataatgtgctatgatggggtgtaaagtgaaggcagaagagggtcagccgcattgtt 
atgatactgggaaagtgctggtcaacgatttgagttagtttttagatatacattgaaatc 
tttaatcaggcttctcttggccatggtccccttccttctggaactgtgatgtagtcacat 
cctacagcctttagtgctggttcactagtgtcagataatcagttcttggaatcgagactg 
ccgtggcgaaggggtggcctcggaggcaggctctggagctgcttggatgtctttaggtgg 
ggtggtggctggctctcttcagcatgtaattggggaaaccctcgcgtctactaggggtga 
tacagatggtgattttaaagagcaaaactagacttctatgtgagaagtgctggaaaatga 
tttaggacatgtgtaaagttagatggaaagactgtaaatgtttaatatgaatatagtgtt 
cttttgaagtaaggccagctgttgaacggttaaactgtgcatttctcattttgatgtgtc 
atgtatgttaatgtatgaaatgattaaataaaatcaaaactggtacctgtttatacataa 
atacg 

SEQ ID NO: 304 . >T23935_T9 * TY Transcript # LN 6305 # Source Gene: T23935 
# Encoded protein: T23935_P1 

ctctcccccccaacccccactttcacaacgcggccccccacctttacctccctaagttct 
tcatctctttcgacattttttttgaaggctgatgggggggagggtggaagagaaagagtg 
aagaaaagttgcgagcgtctcctctcttcctaagcctcccgccccaacccacccctcaga 
gaaggagaagataatatactgaaaagaagaggaggaggagagcgacgggacgggacgcga 
gcgggagcgcagccgccctctcggctccgcggcggcgcctcgcaagtccgggaggcgagg 
ggggcccgaggggagacgccgtgacaactttcgtttccctctgagggaattgggaggtcg 
gcggccccaaaagctttcagtccagtgtaaagctgttggagcgcgggagcaaaggtaaag 
aatgatgtaatgcgctggctgctccaaagcatcttttgttgtggaatggttattccagtc 
atctctttatgaatcaaatgtgaggggctgctttgtggacggagtcctttgcaagagcac 
atcaacgggaaagagaaagagacattcacttggagggctcttgctgaaaatgggtttaac 
tctccttttgccagtcaccaccagcctgacctcatacacttttagtacaatggagtggct 
gagcctttgagcacaccaccattacatcatcgtggcaaattaaagaaggaggtgggaaaa 
gaggacttattgttgtcatggcccatgagatgattggaactcaaattgttactgagaggt 
tggtggctctgctggaaagtggaacggaaaaagtgctgctaattgatagccggccatttg 
tggaatacaatacatcccacattttggaagccattaatatcaactgctccaagcttatga 
agcgaaggttgcaacaggacaaagtgttaattacagagctcatccagcattcagcgaaac 
ataaggttgacattgattgcagtcagaaggttgtagtttacgatcaaagctcccaagatg 
ttgcctctctctcttcagactgttttctcactgtacttctgggtaaactggagaagagct 
tcaactctgttcacctgcttgcaggtgggtttgctgagttctctcgttgtttccctggcc 
tctgtgaaggaaaatccactctagtccctacctgcatttctcagccttgcttacctgttg 
ccaacattgggccaacccgaattcttcccaatctttatcttggctgccagcgagatgtcc 
tcaacaaggagctgatgcagcagaatgggattggttatgtgttaaatgccagcaatacct 
gtccaaagcctgactttatccccgagtctcatttcctgcgtgtgcctgtgaatgacagct 
tttgtgagaaaattttgccgtggttggacaaatcagtagatttcattgagaaagcaaaag 
cctccaatggatgtgttctagtgcactgtttagctgggatctcccgctccgccaccatcg 
ctatcgcctacatcatgaagaggatggacatgtctttagatgaagcttacagatttgtga 
aagaaaaaagacctactatatctccaaacttcaattttctgggccaactcctggactatg 
agaagaagattaagaaccagactggagcatcagggccaaagagcaaactcaagctgctgc 
acctggagaagccaaatgaacctgtccctgctgtctcagagggtggacagaaaagcgaga 
cgcccctcagtccaccctgtgccgactctgctacctcagaggcagcaggacaaaggcccg 
tgcatcccgccagcgtgcccagcgtgcccagcgtgcagccgtcgctgttagaggacagcc 
cgctggtacaggcgctcagtgggctgcacctgtccgcagacaggctggaagacagcaata 
agctcaagcgttccttctctctggatatcaaatcagtttcatattcagccagcatggcag 
catccttacatggcttctcctcatcagaagatgctttggaatactacaaaccttccacta 
ctctggatgggaccaacaagctatgccagttctcccctgttcaggaactatcggagcaga 
ctcccgaaaccagtcctgataaggaggaagccagcatccccaagaagctgcagaccgcca 
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Figure 1 (Cont'd) 



ggccttcagacagccagagcaagcgattgcattcggtcagaaccagcagcagtggcaccg 
cccagaggtcccttttatctccactgcatcgaagtgggagcgtggaggacaattaccaca 
ccagcttccttttcggcctttccaccagccagcagcacctcacgaagtctgctggcctgg 
gccttaagggctggcactcggatatcttggccccccagacctctaccccttccctgacca 
gcagctggtattttgccacagagtcctcacacttctactctgcctcagccatctacggag 
gcagtgccagttactctgcctacagctgcagccagctgcccacttgcggagaccaagtct 
attctgtgcgcaggcggcagaagccaagtgacagagctgactcgcggcggagctggcatg 
aagagagcccctttgaaaagcagtttaaacgcagaagctgccaaatggaatttggagaga 
gcatcatgtcagagaacaggtcacgggaagagctggggaaagtgggcagtcagtctagct 
tttcgggcagcatggaaatcattgaggtctcctgagaagaaagacacttgtgacttctat 
agacaatttttttttcttgttcacaaaaaaattccctgtaaatctgaaatatatatatgt 
acatacatatatatttttggaaaatggagctatggtgtaaaagcaacaggtggatcaacc 
cagttgttactctcttaacatctgcatttgagagatcagctaatacttctctcaacaaaa 
atggaagggcagatgctagaatcccccctagacggaggaaaaccattttattcagtgaat 
tacacatcctcttgttcttaaaaaagcaagtgtctttggtgttggaggacaaaatcccct 
accattttcacgttgtgctactaagagatctcaaatattagtctttgtccggacccttcc 
atagtacaccttagcgctgagactgagccagcttgggggtcaggtaggtagaccctgtta 
gggacagagcctagtggtaaatccaagagaaatgatcctatccaaagctgattcacaaac 
ccacgctcacctgacagccgagggacacgagcatcactctgctggacggaccattagggg 
ccttgccaaggtctaccttagagcaaacccagtacctcagacaggaaagtcggggctttg 
accactaccatatctggtagcccattttctaggcattgtgaataggtaggtagctagtca 
cacttttcagaccaattcaaactgtctatgcacaaaattcccgtgggcctagatggagat 
aatttttttttcttctcagctttatgaagagaagggaaactgtctaggattcagctgaac 
caccaggaacctggcaacatcacgatttaagctaaggttgggaggctaacgagtctacct 
ccctctttgtaaatcaaagaattgtttaaaatgggattgtcaatcctttaaataaagatg 
aacttggtttcaagccaaatgtgaatttatttgggttggtagcagagcagcagcaccttc 
aaattctcagccaaagcagatgtttttgccctttctgcttcactgcatggatacagttgg 
taaaatgtaataatatggcagaattttataggaaacttcctagggaggtaaattatggga 
agattaagaaaggtacaaattgctgaggagaagcaggaaacctgtttccttagtggcttt 
tatcccctcggcatgcgatggggctgatgtttctataattgcctcagactttcacattta 
ctagtagggctgagagaggctttagtgaggaaggaatattcagaataaaacggttgagaa 
agctgagaagaccattgagttttgatcagttgtgaatagagtgcaaagccatggccaagc 
tgtttttggaaacgctggccggcgtgtcttcagtggaaaaagcaaatcaaaatggagcga 
gagcaaaggggcgtcctcagtcctcaacctacaatcactgtatggaatcggtcctggcag 
ctgaacataggaggtcactggaacaagtgatagtgcagattggctttcaaacatcctcct 
ggcttgagttttatcagctacaatgtgggtcctcttttgaagccttaattcacaacagca 
gctttttgggggtggggctgggcgggtgttgtcattgttctttcccttcctgtaagtgtc 
gctagttgctgcctcgtatctcaggtttttctctgtttttgagaaatggacagttttttg 
accaggatgtgacttcatgtttcctatggtgacttctaaaaccagcacagaatgatatga 
ctcaacacagaccgacttggttatggggatgatgagccgcacagacctcactagttgtgc 
acaaataatgtgctatgatggggtgtaaagtgaaggcagaagagggtcagccgcattgtt 
atgatactgggaaagtgctggtcaacgatttgagttagtttttagatatacattgaaatc 
tttaatcagacattctcaagtttcacacagtagtttttgatgttatgtacacacacacca 
aatgtgtaacagttcaccacttccagagtgtggtcatgcccaaaacatgtttaagaaagg 
aaagcagtagctccttgctaacgatgtttcaggaggtttggggcacttggttttaatgag 
cttctgtcatttagggcttctcttggccatggtccccttccttctggaactgtgatgtag 
tcacatcctacagcctttagtgctggttcactagtgtcagataatcagttcttggaatcg 
agactgccgtggcgaaggggtggcctcggaggcaggctctggagctgcttggatgtcttt 
aggtggggtggtggctggctgctctcttcagcatgtaattggggaaaccctcgcgtctac 
taggggtgatacagatggtgattttaaagagcaaaactagacttctatgtgagaagtcta 
gttttctcttcagcatgtaattggggaaaccctcgcgtctactaggggtgatacagatgg 
tgattttaaagagcaaaactagacttctatgtgagaagtgctggaaaatgatttaggaca 
tgtgtaaagttagatggaaagactgtaaatgtttaatatgaatatagtgttcttttgaag 
taaggccagctgttgaacggttaaactgtgcatttctcattttgatgtgtcatgtatgtt 
aatgtatgaaatgattaaataaaatcaaaactggtacctgtttatacataaatacgagaa 
aagacctatctttgcagccataaactcggtgggaacaccaccactcaagttgccaaagga 
ggcagtggtgaaacctgtcctgttctcacttaaatgaggatttagctcaaaataaagtgg 
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tggtgtcatcaggtttattccgtgttctgtcattcacatggaacaccggatgattagcta 
acagtttagtgccagccttcattctttactgtgtacgttaaatgcacactacagtgaaaa 
agcctaagacacttggtaaatattttctagctgactgattccagaacacacaagtacatg 
gtaggaaaaaaaaggccagctacacactcatggcccatttcgatataacatgtaaatgat 
gttacgattcctacaggaaaaacaggaagggcaattgttgaagagctgtcctttgtcctt 
gccacatacatctcaacaacctctctgctcatgacactagatttgatgacttatgagact 
agaaacttagtctcatcagtttcataaaaattaggcattgaaggaataaagcttcagtgc 
aaacagatggaatattgactacagaaagaatcacaggtgaccttgaagttggttgagttg 
aagttacatttaagaaaatttataagaaagattgggccgggtgagctagactagaagagc 
tatcttggccaggaccctgcagaaaaagaggtggaagtgagatagaggaataaagggaat 
tagaataaatcagcctccagagatagagggtaacaggaggaacagttgtttcagtgagag 
gcttccaatgagaagatgcagatcttgggagagcctgaagttcaggccagttgctgacag 
tgctc 

SEQ ID NO:305. >T23935_T10 # TY Transcript # LN 4864 # Source Gene: T23935 
# Encoded protein: T23935_P1 

ctctcccccccaacccccactttcacaacgcggccccccacctttacctccctaagttct 
tcatctctttcgacattttttttgaaggctgatgggggggagggtggaagagaaagagtg 
aagaaaagttgcgagcgtctcctctcttcctaagcctcccgccccaacccacccctcaga 
gaaggagaagataatatactgaaaagaagaggaggaggagagcgacgggacgggacgcga 
gcgggagcgcagccgccctctcggctccgcggcggcgcctcgcaagtccgggaggcgagg 
ggggcccgaggggagacgccgtgacaactttcgtttccctctgagggaattgggaggtcg 
gcggccccaaaagctttcagtccagtgtaaagctgttggagcgcgggagcaaaggtaaag 
aatgatgtaatgcgctggctgctccaaagcatcttttgttgtggaatggttattccagtc 
atctctttatgaatcaaatgtgaggggctgctttgtggacggagtcctttgcaagagcac 
atcaacgggaaagagaaagagacattcacttggagggctcttgctgaaaatgggtttaac 
tctccttttgccagtcaccaccagcctgacctcatacacttttagtacaatggagtggct 
gagcctttgagcacaccaccattacatcatcgtggcaaattaaagaaggaggtgggaaaa 
gaggacttattgttgtcatggcccatgagatgattggaactcaaattgttactgagaggt 
tggtggctctgctggaaagtggaacggaaaaagtgctgctaattgatagccggccatttg 
tggaatacaatacatcccacattttggaagccattaatatcaactgctccaagcttatga 
agcgaaggttgcaacaggacaaagtgttaattacagagctcatccagcattcagcgaaac 
ataaggttgacattgattgcagtcagaaggttgtagtttacgatcaaagctcccaagatg 
ttgcctctctctcttcagactgttttctcactgtacttctgggtaaactggagaagagct 
tcaactctgttcacctgcttgcaggtgggtttgctgagttctctcgttgtttccctggcc 
tctgtgaaggaaaatccactctagtccctacctgcatttctcagccttgcttacctgttg 
ccaacattgggccaacccgaattcttcccaatctttatcttggctgccagcgagatgtcc 
tcaacaaggagctgatgcagcagaatgggattggttatgtgttaaatgccagcaatacct 
gtccaaagcctgactttatccccgagtctcatttcctgcgtgtgcctgtgaatgacagct 
tttgtgagaaaattttgccgtggttggacaaatcagtagatttcattgagaaagcaaaag 
cctccaatggatgtgttctagtgcactgtttagctgggatctcccgctccgccaccatcg 
ctatcgcctacatcatgaagaggatggacatgtctttagatgaagcttacagatttgtga 
aagaaaaaagacctactatatctccaaacttcaattttctgggccaactcctggactatg 
agaagaagattaagaaccagactggagcatcagggccaaagagcaaactcaagctgctgc 
acctggagaagccaaatgaacctgtccctgctgtctcagagggtggacagaaaagcgaga 
cgcccctcagtccaccctgtgccgactctgctacctcagaggcagcaggacaaaggcccg 
tgcatcccgccagcgtgcccagcgtgcccagcgtgcagccgtcgctgttagaggacagcc 
cgctggtacaggcgctcagtgggctgcacctgtccgcagacaggctggaagacagcaata 
agctcaagcgttccttctctctggatatcaaatcagtttcatattcagccagcatggcag 
catccttacatggcttctcctcatcagaagatgctttggaatactacaaaccttccacta 
ctctggatgggaccaacaagctatgccagttctcccctgttcaggaactatcggagcaga 
ctcccgaaaccagtcctgataaggaggaagccagcatccccaagaagctgcagaccgcca 
ggccttcagacagccagagcaagcgattgcattcggtcagaaccagcagcagtggcaccg 
cccagaggtcccttttatctccactgcatcgaagtgggagcgtggaggacaattaccaca 
ccagcttccttttcggcctttccaccagccagcagcacctcacgaagtctgctggcctgg 
gccttaagggctggcactcggatatcttggccccccagacctctaccccttccctgacca 
gcagctggtattttgccacagagtcctcacacttctactctgcctcagccatctacggag 
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gcagtgccagttactctgcctacagctgcagccagctgcccacttgcggagaccaagtct 
attctgtgcgcaggcggcagaagccaagtgacagagctgactcgcggcggagctggcatg 
aagagagcccctttgaaaagcagtttaaacgcagaagctgccaaatggaatttggagaga 
gcatcatgtcagagaacaggtcacgggaagagctggggaaagtgggcagtcagtctagct 
tttcgggcagcatggaaatcattgaggtctcctgagaagaaagacacttgtgacttctat 
agacaatttttttttcttgttcacaaaaaaattccctgtaaatctgaaatatatatatgt 
acatacatatatatttttggaaaatggagctatggtgtaaaagcaacaggtggatcaacc 
cagttgttactctcttaacatctgcatttgagagatcagctaatacttctctcaacaaaa 
atggaagggcagatgctagaatcccccctagacggaggaaaaccattttattcagtgaat 
tacacatcctcttgttcttaaaaaagcaagtgtctttggtgttggaggacaaaatcccct 
accattttcacgttgtgctactaagagatctcaaatattagtctttgtccggacccttcc 
atagtacaccttagcgctgagactgagccagcttgggggtcaggtaggtagaccctgtta 
gggacagagcctagtggtaaatccaagagaaatgatcctatccaaagctgattcacaaac 
ccacgctcacctgacagccgagggacacgagcatcactctgctggacggaccattagggg 
ccttgccaaggtctaccttagagcaaacccagtacctcagacaggaaagtcggggctttg 
accactaccatatctggtagcccattttctaggcattgtgaataggtaggtagctagtca 
cacttttcagaccaattcaaactgtctatgcacaaaattcccgtgggcctagatggagat 
aatttttttttcttctcagctttatgaagagaagggaaactgtctaggattcagctgaac 
caccaggaacctggcaacatcacgatttaagctaaggttgggaggctaacgagtctacct 
ccctctttgtaaatcaaagaattgtttaaaatgggattgtcaatcctttaaataaagatg 
aacttggtttcaagccaaatgtgaatttatttgggttggtagcagagcagcagcaccttc 
aaattctcagccaaagcagatgtttttgccctttctgcttcactgcatggatacagttgg 
taaaatgtaataatatggcagaattttataggaaacttcctagggaggtaaattatggga 
agattaagaaaggtacaaattgctgaggagaagcaggaaacctgtttccttagtggcttt 
tatcccctcggcatgcgatggggctgatgtttctataattgcctcagactttcacattta 
ctagtagggctgagagaggctttagtgaggaaggaatattcagaataaaacggttgagaa 
agctgagaagaccattgagttttgatcagttgtgaatagagtgcaaagccatggccaagc 
tgtttttggaaacgctggccggcgtgtcttcagtggaaaaagcaaatcaaaatggagcga 
gagcaaaggggcgtcctcagtcctcaacctacaatcactgtatggaatcggtcctggcag 
ctgaacataggaggtcactggaacaagtgatagtgcagattggctttcaaacatcctcct 
ggcttgagttttatcagctacaatgtgggtcctcttttgaagccttaattcacaacagca 
gctttttgggggtggggctgggcgggtgttgtcattgttctttcccttcctgtaagtgtc 
gctagttgctgcctcgtatctcaggtttttctctgtttttgagaaatggacagttttttg 
accaggatgtgacttcatgtttcctatggtgacttctaaaaccagcacagaatgatatga 
ctcaacacagaccgacttggttatggggatgatgagccgcacagacctcactagttgtgc 
acaaataatgtgctatgatggggtgtaaagtgaaggcagaagagggtcagccgcattgtt 
atgatactgggaaagtgctggtcaacgatttgagttagtttttagatatacattgaaatc 
tttaatcagacattctcaagtttcacacagtagtttttgatgttatgtacacacaccatc 
tctaatatgacgcccagcacacaagtacagaggctggccaggcgaataaaaaggataagc 
aaggtaagcggtcacccctggggaacgaggcggccagcgatgggggggatggggtcataa 
tagc 

SEQ ID NO:306. >T23935_T11 # TY Transcript # LN 4493 # Source Gene: T23935 
# Encoded protein: T23935_P1 

ctctcccccccaacccccactttcacaacgcggccccccacctttacctccctaagttct 
tcatctctttcgacattttttttgaaggctgatgggggggagggtggaagagaaagagtg 
aagaaaagttgcgagcgtctcctctcttcctaagcctcccgccccaacccacccctcaga 
gaaggagaagataatatactgaaaagaagaggaggaggagagcgacgggacgggacgcga 
gcgggagcgcagccgccctctcggctccgcggcggcgcctcgcaagtccgggaggcgagg 
ggggcccgaggggagacgccgtgacaactttcgtttccctctgagggaattgggaggtcg 
gcggccccaaaagctttcagtccagtgtaaagctgttggagcgcgggagcaaaggtaaag 
aatgatgtaatgcgctggctgctccaaagcatcttttgttgtggaatggttattccagtc 
atctctttatgaatcaaatgtgaggggctgctttgtggacggagtcctttgcaagagcac 
atcaacgggaaagagaaagagacattcacttggagggctcttgctgaaaatgggtttaac 
tctccttttgccagtcaccaccagcctgacctcatacacttttagtacaatggagtggct 
gagcctttgagcacaccaccattacatcatcgtggcaaattaaagaaggaggtgggaaaa 
gaggacttattgttgtcatggcccatgagatgattggaactcaaattgttactgagaggt 
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tggtggctctgctggaaagtggaacggaaaaagtgctgctaattgatagccggccatttg 
tggaatacaatacatcccacattttggaagccattaatatcaactgctccaagcttatga 
agcgaaggttgcaacaggacaaagtgttaattacagagctcatccagcattcagcgaaac 
ataaggttgacattgattgcagtcagaaggttgtagtttacgatcaaagctcccaagatg 
ttgcctctctctcttcagactgttttctcactgtacttctgggtaaactggagaagagct 
tcaactctgttcacctgcttgcaggtgggtttgctgagttctctcgttgtttccctggcc 
tctgtgaaggaaaatccactctagtccctacctgcatttctcagccttgcttacctgttg 
ccaacattgggccaacccgaattcttcccaatctttatcttggctgccagcgagatgtcc 
tcaacaaggagctgatgcagcagaatgggattggttatgtgttaaatgccagcaatacct 
gtccaaagcctgactttatccccgagtctcatttcctgcgtgtgcctgtgaatgacagct 
tttgtgagaaaattttgccgtggttggacaaatcagtagatttcattgagaaagcaaaag 
cctccaatggatgtgttctagtgcactgtttagctgggatctcccgctccgccaccatcg 
ctatcgcctacatcatgaagaggatggacatgtctttagatgaagcttacagatttgtga 
aagaaaaaagacctactatatctccaaacttcaattttctgggccaactcctggactatg 
agaagaagattaagaaccagactggagcatcagggccaaagagcaaactcaagctgctgc 
acctggagaagccaaatgaacctgtccctgctgtctcagagggtggacagaaaagcgaga 
cgcccctcagtccaccctgtgccgactctgctacctcagaggcagcaggacaaaggcccg 
tgcatcccgccagcgtgcccagcgtgcccagcgtgcagccgtcgctgttagaggacagcc 
cgctggtacaggcgctcagtgggctgcacctgtccgcagacaggctggaagacagcaata 
agctcaagcgttccttctctctggatatcaaatcagtttcatattcagccagcatggcag 
catccttacatggcttctcctcatcagaagatgctttggaatactacaaaccttccacta 
ctctggatgggaccaacaagctatgccagttctcccctgttcaggaactatcggagcaga 
ctcccgaaaccagtcctgataaggaggaagccagcatccccaagaagctgcagaccgcca 
ggccttcagacagccagagcaagcgattgcattcggtcagaaccagcagcagtggcaccg 
cccagaggtcccttttatctccactgcatcgaagtgggagcgtggaggacaattaccaca 
ccagcttccttttcggcctttccaccagccagcagcacctcacgaagtctgctggcctgg 
gccttaagggctggcactcggatatcttggccccccagacctctaccccttccctgacca 
gcagctggtattttgccacagagtcctcacacttctactctgcctcagccatctacggag 
gcagtgccagttactctgcctacagctgcagccagctgcccacttgcggagaccaagtct 
attctgtgcgcaggcggcagaagccaagtgacagagctgactcgcggcggagctggcatg 
aagagagcccctttgaaaagcagtttaaacgcagaagctgccaaatggaatttggagaga 
gcatcatgtcagagaacaggtcacgggaagagctggggaaagtgggcagtcagtctagct 
tttcgggcagcatggaaatcattgaggtctcctgagaagaaagacacttgtgacttctat 
agacaatttttttttcttgttcacaaaaaaattccctgtaaatctgaaatatatatatgt 
acatacatatatatttttggaaaatggagctatggtgtaaaagcaacaggtggatcaacc 
cagttgttactctcttaacatctgcatttgagagatcagctaatacttctctcaacaaaa 
atggaagggcagatgctagaatcccccctagacggaggaaaaccattttattcagtgaat 
tacacatcctcttgttcttaaaaaagcaagtgtctttggtgttggaggacaaaatcccct 
accattttcacgttgtgctactaagagatctcaaatattagtctttgtccggacccttcc 
atagtacaccttagcgctgagactgagccagcttgggggtcaggtaggtagaccctgtta 
gggacagagcctagtggtaaatccaagagaaatgatcctatccaaagctgattcacaaac 
ccacgctcacctgacagccgagggacacgagcatcactctgctggacggaccattagggg 
ccttgccaaggtctaccttagagcaaacccagtacctcagacaggaaagtcggggctttg 
accactaccatatctggtagcccattttctaggcattgtgaataggtaggtagctagtca 
cacttttcagaccaattcaaactgtctatgcacaaaattcccgtgggcctagatggagat 
aatttttttttcttctcagctttatgaagagaagggaaactgtctaggattcagctgaac 
caccaggaacctggcaacatcacgatttaagctaaggttgggaggctaacgagtctacct 
ccctctttgtaaatcaaagaattgtttaaaatgggattgtcaatcctttaaataaagatg 
aacttggtttcaagccaaatgtgaatttatttgggttggtagcagagcagcagcaccttc 
aaattctcagccaaagcagatgtttttgccctttctgcttcactgcatggatacagttgg 
taaaatgtaataatatggcagaattttataggaaacttcctagggaggtaaattatggga 
agattaagaaaggtacaaattgctgaggagaagcaggaaacctgtttccttagtggcttt 
tatcccctcggcatgcgatggggctgatgtttctataattgcctcagactttcacattta 
ctagtagggctgagagaggctttagtgaggaaggaatattcagaataaaacggttgagaa 
agctgagaagaccattgagttttgatcagttgtgaatagagtgcaaagccatggccaagc 
tgtttttggaaacgctggccggcgtgtcttcagtggaaaaagcaaatcaaaatggagcga 
gagcaaaggggcgtcctcagtcctcaacctacaatcactgtatggaatcggtcctggcag 
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ctgaacataggaggtcactggaacaagtgatagtgcagattggctttcaaacatcctcct 
ggcttgagttttatcagctacaatgtgggtcctcttttgaagccttaattcacaacagca 
gctttttgggggtggggctgggcgggtgttgtcattgttctttcccttcctgtaagtgtc 
gctagttgctgcctcgtatctcaggtttttctctcattttgatgtgtcatgtatgttaat 
gtatgaaatgattaaataaaatcaaaactggtacctgtttatacataaatacg 

SEQ ID NO: 307. >T23935_T12 # TY Transcript ft LN 3587 # Source Gene: T23935 
# Encoded protein: T23935_P4 

ctctcccccccaacccccactttcacaacgcggccccccacctttacctccctaagttct 
tcatctctttcgacattttttttgaaggctgatgggggggagggtggaagagaaagagtg 
aagaaaagttgcgagcgtctcctctcttcctaagcctcccgccccaacccacccctcaga 
gaaggagaagataatatactgaaaagaagaggaggaggagagcgacgggacgggacgcga 
gcgggagcgcagccgccctctcggctccgcggcggcgcctcgcaagtccgggaggcgagg 
ggggcccgaggggagacgccgtgacaactttcgtttccctctgagggaattgggaggtcg 
gcggccccaaaagctttcagtccagtgtaaagctgttggagcgcgggagcaaaggtaaag 
aatgatgtaatgcgctggctgctccaaagcatcttttgttgtggaatggttattccagtc 
atctctttatgaatcaaatgtgaggggctgctttgtggacggagtcctttgcaagagcac 
atcaacgggaaagagaaagagacattcacttggagggctcttgctgaaaatgggtttaac 
tctccttttgccagtcaccaccagcctgacctcatacacttttagtacaatggagtggct 
gagcctttgagcacaccaccattacatcatcgtggcaaattaaagaaggaggtgggaaaa 
gaggacttattgttgtcatggcccatgagatgattggaactcaaattgttactgagaggt 
tggtggctctgctggaaagtggaacggaaaaagtgctgctaattgatagccggccatttg 
tggaatacaatacatcccacattttggaagccattaatatcaactgctccaagcttatga 
agcgaaggttgcaacaggacaaagtgttaattacagagctcatccagcattcagcgaaac 
ataaggttgacattgattgcagtcagaaggttgtagtttacgatcaaagctcccaagatg 
ttgcctctctctcttcagactgttttctcactgtacttctgggtaaactggagaagagct 
tcaactctgttcacctgcttgcaggtgggtttgctgagttctctcgttgtttccctggcc 
tctgtgaaggaaaatccactctagtccctacctgcatttctcagccttgcttacctgttg 
ccaacattgggccaacccgaattcttcccaatctttatcttggctgccagcgagatgtcc 
tcaacaaggagctgatgcagcagaatgggattggttatgtgttaaatgccagcaatacct 
gtccaaagcctgactttatccccgagtctcatttcctgcgtgtgcctgtgaatgacagct 
tttgtgagaaaattttgccgtggttggacaaatcagtagatttcattgagaaagcaaaag 
cctccaatggatgtgttctagtgcactgtttagctgggatctcccgctccgccaccatcg 
ctatcgcctacatcatgaagaggatggacatgtctttagatgaagcttacagatttgtga 
aagaaaaaagacctactatatctccaaacttcaattttctgggccaactcctggactatg 
agaagaagattaagaaccagactggagcatcagggccaaagagcaaactcaagctgctgc 
acctggagaagccaaatgaacctgtccctgctgtctcagagggtggacagaaaagcgaga 
cgcccctcagtccaccctgtgccgactctgctacctcagaggcagcaggacaaaggcccg 
tgcatcccgccagcacaggctggaagacagcaataagctcaagcgttccttctctctgga 
tatcaaatcagtttcatattcagccagcatggcagcatccttacatggcttctcctcatc 
agaagatgctttggaatactacaaaccttccactactctggatgggaccaacaagctatg 
ccagttctcccctgttcaggaactatcggagcagactcccgaaaccagtcctgataagga 
ggaagccagcatccccaagaagctgcagaccgccaggccttcagacagccagagcaagcg 
attgcattcggtcagaaccagcagcagtggcaccgcccagaggtcccttttatctccact 
gcatcgaagtgggagcgtggaggacaattaccacaccagcttccttttcggcctttccac 
cagccagcagcacctcacgaagtctgctggcctgggccttaagggctggcactcggatat 
cttggccccccagacctctaccccttccctgaccagcagctggtattttgccacagagtc 
ctcacacttctactctgcctcagccatctacggaggcagtgccagttactctgcctacag 
ctgcagccagctgcccacttgcggagaccaagtctattctgtgcgcaggcggcagaagcc 
aagtgacagagctgactcgcggcggagctggcatgaagagagcccctttgaaaagcagtt 
taaacgcagaagctgccaaatggaatttggagagagcatcatgtcagagaacaggtcacg 
ggaagagctggggaaagtgggcagtcagtctagcttttcgggcagcatggaaatcattga 
ggtctcctgagaagaaagacacttgtgacttctatagacaatttttttttcttgttcaca 
aaaaaattccctgtaaatctgaaatatatatatgtacatacatatatatttttggaaaat 
ggagctatggtgtaaaagcaacaggtggatcaacccagttgttactctcttaacatctgc 
atttgagagatcagctaatacttctctcaacaaaaatggaagggcagatgctagaatccc 
ccctagacggaggaaaaccattttattcagtgaattacacatcctcttgttcttaaaaaa 
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gcaagtgtctttggtgttggaggacaaaatcccctaccattttcacgttgtgctactaag 
agatctcaaatattagtctttgtccggacccttccatagtacaccttagcgctgagactg 
agccagcttgggggtcaggtaggtagaccctgttagggacagagcctagtggtaaatcca 
agagaaatgatcctatccaaagctgattcacaaacccacgctcacctgacagccgaggga 
cacgagcatcactctgctggacggaccattaggggccttgccaaggtctaccttagagca 
aacccagtacctcagacaggaaagtcggggctttgaccactaccatatctggtagcccat 
tttctaggcattgtgaataggtaggtagctagtcacacttttcagaccaattcaaactgt 
ctatgcacaaaattcccgtgggcctagatggagataatttttttttcttctcagctttat 
gaagagaagggaaactgtctaggattcagctgaaccaccaggaacctggcaacatcacga 
tttaagctaaggttgggaggctaacgagtctacctccctctttgtaaatcaaagaattgt 
ttaaaatgggattgtcaatcctttaaataaagatgaacttggtttca 

SEQ ID NO: 308. >T23935_T13 # TY Transcript # LN 5791 # Source Gene: T23935 
# Encoded protein: 

aatgcctctttatgccactcaccttatggggctgatggagagtccacgtaaaggataacc 
gtttcctcaaaggatttatgctgtttcacctccatcaagaaaagggctttttaaagttaa 
ggctttcagtccagtgtaaagctgttggagcgcgggagcaaaggtaaagaatgatgtaat 
gcgctggctgctccaaagcatcttttgttgtggaatggttattccagtcatctctttatg 
aatcaaatgtgaggggctgctttgtggacggagtcctttgcaagagcacatcaacgggaa 
agagaaagagacattcacttggagggctcttgctgaaaatgggtttaactctccttttgc 
cagtcaccaccagcctgacctcatacacttttagtacaatggagtggctgagcctttgag 
cacaccaccattacatcatcgtggcaaattaaagaaggaggtgggaaaagaggacttatt 
gttgtcatggcccatgagatgattggaactcaaattgttactgagaggttggtggctctg 
ctggaaagtggaacggaaaaagtgctgctaattgatagccggccatttgtggaatacaat 
acatcccacattttggaagccattaatatcaactgctccaagcttatgaagcgaaggttg 
caacaggacaaagtgttaattacagagctcatccagcattcagcgaaacataaggtaaac 
gctttcctttttgccttttcaaaataaatggacacactgaatgaggaataattttaattg 
aagcctcctctttaagtagagaaagtaatttgaggaagttttggttgctttttaaatttt 
ggtgccttgtggtacctcaaaataatttattgtccatattgccacatttaataatatttc 
taaactatgaaggattaacttgctttagtttggacaagtgctatattttcttgtggcttg 
ctaaaattattttattaaacactattctatattatctgcaaatttactgaaaggaaagtc 
attcttttcaagttctttggaaagaaagtaggaactaattattaggggtgctttttatgt 
gtcataacacatgtggtaacattggtcaagacttaatgatatcttcaagggaagttattc 
ctgggatactaaaatgttgtctgtgtggtgaggaagttggtttttttcttcacaaataca 
aatgagctcatgttttaatttaaataaacaactgatttctttgttaaggtgtaaaagtca 
ttttaaaaaaagtttgtttaggtccagggttatgtttattttgcatattgtttggtttat 
tatgaactttaccattttatagtataaaggatttaaaaccagggaatcgtgagatcacct 
acacagaaatagcatcattttggtttttaagagttttgtatgtgtctgtctcgtaaattg 
actacatctcatttttattcaaattattaattgaaaacaggaagcaattattcgccccag 
cactggtaaatgtgagatcttaagcagtagtttgagattacttttctcaccttcttccta 
acaagcctcatgtttgcatttcaggttgacattgattgcagtcagaaggttgtagtttac 
gatcaaagctcccaagatgttgcctctctctcttcagactgttttctcactgtacttctg 
ggtaaactggagaagagcttcaactctgttcacctgcttgcaggtaaggctcgtgatgga 
attaaaaggactcagaatcttgctgagctgtatgagtaagcatatttcactgtgaactca 
gtacaatagatcatcccaactttgctgtgtaaaatgatttatttatagcttttgattgat 
gtcaccgttctatttaacttgaatactatatcattcttccacataggtcagtgtattttt 
gacttggctcaagtgcctctttgcttgctattaactggtagtgccacatagattygcaca 
cccatatggatagctcaaagccagctgcttctgtctgcctttttccattcacctgtgatg 
taagaacaccaaatgatgaacatctcagctcctgctagttcccacccctgtggcaagaac 
cagaccaatattacagtgtcaaactcttacttaacatttatagttggagcggactccaga 
gataagccttatggtcacctttctgggcattataaacaatgaaatagggggagaatgtgt 
gcagaatgtttcttctggtaattaggggaattgttttaaaagttacttgtatatttttat 
ttgttttataattaaaaaaaagtctcaacgtagaaatgctaagcaaccagatgatcttat 
aaagtgtgtgggaaggaccaaatattctatatctctcctggaggacggcattgtaggagt 
taatcccatcaaattccaaagaaggctattcatcctttgcttctcgttttgcaaatgaaa 
atcctttcccttgccctgtgggctcatagagctgggaatatagtttgacaacacacagta 
accttaacataaagctgctgtgttgggcggattcagcttcagtgataggcaagagtgctt 
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ttaaacatttggtctaagaaaaacaaaacattaagattaattagttgtwggtgtgttaat 
aattttaaactctgggcaggtttagataagatttccatgtctccagttctctctcacaag 
cttttcctcagttctctaagaacaaggatagttcatctttttcagattgtctatctgtac 
acctttcatttatgctgtgattccgtctgatctttcaattagtgttcttaagtgaagcca 
gcttgtcttagaatgtagttttcagaagtctgtaactgagaattataatattagattctg 
tctctgttgatagacacattcttttactgggagtatgagtgtgttaaatgtaacagctat 
tattagcatttcactaaactaattgggtgagtttaggaactcatttcatgtttcaggtgg 
ccactgtgtgagggactttaccttggaatcctgttaatgggatcagggtttttcttttct 
ttttttttttttgagacagaattttgccctgtcgcccaggctagagtgcggtggtgtgac 
cttggcttgctgcaacctctgcttcccaggttcaagcgattctcctgcctcagcctccca 
agtagctgggattctgagcacctgccaccacgcctggctaatttttgtattttagtagag 
accgggtttcaccatgttggccagtctggtctcgaactttgacctcaggtgatctgcctg 
cctcggcctcccaaagtgctgggattacaggcatgagccaccacacccagccagagtctt 
tcttaaagaagcaatatgccaagatgtgcttcgtgacagagtggttatcccaccatgttc 
ttgtttgagttacttttaccctatggggagaccttactcaacatctcggtcaagtagaca 
gccccccagaggagtgagctcacattgagtttctagcacaggtgctttagctacttgtac 
cctttactcctttaccaaagagcattcctttgttacctgggaaggttcttaatctatcct 
gtgtagccttgggaggggcattcactgggcaataaggaagaaacaggactgcctgattag 
ctatgtcaactctagggacacgtggtgtgacacactgagtcagaacaccctgacatccct 
gtttcaaacaggaagtgtttcttaggcctaacggcaagtccatggaggcagcaccaggag 
ccacattaaatgtattagtagcaccatatcttcatccatacctccagaatcctgtgtttc 
tattagcttaggaggtgacttcagagtctgaattgcttgtttgggaatttctgagcagct 
ggatttcttgtgacacctaagagagggttctagatgattagaagccttcccaatcatcca 
aaagcaattggggattctcttggttagtgaattttttcgtcagggtaggtagataattgg 
ggtaagtggataacagtattggagaaggtttggccatcaattcttgggaaactgttttat 
tgcaaaggtccagctgctcagaggacaagccagggatgtgggacagaaggacatattgac 
aaaagtttcttaaatattttggttggggagaaataactgaaaagcaccctagtttctcac 
aggggctactgtaagttacaaggtcactggagacaattttcactacaaactgcccctgtg 
actttcaacagttggagattgttgtcagtctcgtttggacatggtaatagaaagacaaag 
tggtttcttggaataatttttccatcggtttagaactttcagccaccttgagaaaagaaa 
gcatcagatatgtgagatggtttttatggtcattagctatggcagtgaggaattgtggcc 
ccacatctactcacactctcagggctaccttacccacggaagaatgactgtgaataagga 
gaacaagaggactcccaagaaattgatagcagtatcttaacgagctcttttccttcaaac 
aagaggagactgacacttacctggaggacctacttacatggtcatgaaaaatgaaggatt 
actgaaggagaaggaagtgggatttggggaacctctacaggaatgcagtgggcttagttt 
tttaatatggaccaggtcttgtttacctttgtgttcccgcaaggcctagcccttcttaag 
ttttcagtaaatattttgatattagcttacctgaaggttttatattgtttatatttccta 
tgatttatcagtctagaatataagcatattaagcagtgatgaagtctgaaagtagagaaa 
acttcagattgtttcaaaataggtgatttggaaggtgtatttattctgataaagcaaata 
tatagctgcgatgggaaaatatctaatatgaaactttcttcatccatataagcagatgtg 
aagatggtattaaaatctgatgaaaatgctgctgtagtgaaacagtgctaattttctgtg 
gtccttaaatactgatatttgataaagagctaacaaccctagaaatctagctccactcat 
aggaatacgcaagtccttgaattcatgttaattgagaacagtcagaataattattgtatt 
tttaaaaatgtaaatgtaaattgattgctctagtcattagtaaataacctactaaccaag 
tcaaactaagccatagcattgtttcactcccagtggattaaagggtttaatatcagtagc 
agctgggtgtggtggtgcacccctctatttccagctactcgggaagctgatgggggagga 
ttgcttgagcccaggagttcgaggttacagtgagctgtgattgcaacactgcactacagt 
ctgggtgacagagagcctgtctctaaaaaaacaaaacacaaaattatcagtagcaatctc 
atcttaaaaaaaaaacaaatttggatgctttagaaagttctccattactgcccggccatg 
gtgacttacacctgtaatcccagaactttgggaggcygaggtgggtggatcacctgaggt 
caggagttcgagaccagcctggccaacatggcaaaaccccatctctactaaaaatataaa 
aattagccgggcatggtggcatctgcctgtagtctcagctactcaggaggctgaggcagg 
agaattgcttgaacccgggaggcagaggttgcagtgagccaagatcgcatcactgcaccc 
tggtgtaggcaacwgagtgagactctgtctc 

SEQ ID NO: 309. >T23935_T14 # TY Transcript # LN 3689 # Source Gene: T23935 
# Encoded protein: 
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taattcaaggcaaaatctccaggccactttttgaattcagcccatattcactacaattgc 
taagtcttctcctctgtggggttttttgttgttgttcttgctttttatttactcaagcag 
aagaaagacacaagaatgataacccacatacatacagagtagtagaatggcaagagctaa 
ttattctcaaacacaaattcttaaagataaatgtaaaatttttttttaatttttaaattt 
ttttatttttatttattattatactttaagttttagggatctgtggggttttaagccatg 
gttagtgatagcatgtcatttgaagatgtacccccttaatgacttacaggctcatagtga 
ttttttcttttttgttgttgttatttagattagatttgggtaagtctccgggttggttgg 
aaggactcaggtttatgtggagttttgtcttttagttaatgtgccagtaacagagtttgc 
ccgtttgtgctatgggatactcatgaggtgcccccaaacccagctggctacatatacagc 
ctcttgcttcctcttcactgtctcctgggctcttcctccagctctcctcatgccaaactc 
gtgactgcatggacacaacatagggaggtgattaggaccatgtattttggaggcaaactg 
tctgtatttgagtcctgcctaccctggttattagtgaagtggtttgagtaagttacttca 
aacttctgtgcacttcagtttcttgatctggaaactgaaggtaatatacctaccttacag 
ggatattgtgaggattaaaatagataatgtaaaatgaaaaaaaaagaaaaaaagtaaacc 
aaaaagataaaataaaaataagtaaaatagataatgtccattgaacaccttgtaaagtac 
ctgactcataattggctgttaataartattttaaaaattgttatctgaagttcctccttt 
tttctgtcctttgagctggactctgtaacacctcttagtattacttccttctttttagct 
tttgtaacttctcatcctcactgcatctttgcccttttcatttgcagttctctctcccct 
ttcttaacctagattttagccttaattgcggtaatggccacctgcttgtttccagttttc 
atcatcattgccttctcgttctttattcagaaacctgaagaggcttcttagttttctgcr 
ttaaaaaggtagccttggccaggtgcagcgtgtgtaatcaatcacagcactttgggaagc 
ggaggcgggaggattgcttgaggccaggagtttgacaccagcctagtcaacatagcaaga 
tcctgtctctacaaaaaattttcaaaaattagccaggcatggtggtgtgcacctgtagtc 
ctagctgggaggctgaggcaggaggatcgcttgagcctaggagctcacagttacagtgag 
ctatgtttgcactatggcactccagcctggatgaacagagtgagaccctgtctgaaaaaa 
caaatgtagcccttagcctaacattcccaccaccctcaaacacctgcttcccacctttga 
aaccttctcaaaccttctctctggtttcttctccaaggcaggctgttttcgaatgagcac 
tttgctcagtcccactgcagggctttctcgcacacatttcccttctcctctctgccagtt 
gtgtgtgttaaattaattcctttgcttgaagacacagtttagggctcatgtctcccaaga 
tgcatgctggtgaacctcaactccagtatagtcacttgtttttttcatttttaaaagacg 
tcatcaaaagagttgtggttataaaaacagttgtaagcacattgcagaaaatttgaaaaa 
cagagaaaaggaaacctataacctcaacatcgtaacgcatgttactgtcattgttttatt 
tccttccagccttcttctttttttattaaaatctcttttgaaaatatgtatcagccagta 
tcagctggatgcctaccatgtgcctagcgttaagtcttttgggggattgttagaagcaag 
caagatccttctaggtccttataatctaggttttgagatagataattattaaaattaatg 
agagagtattgtcaacattaaaaaacaaaaacccagactccttccagtaaggatttgaag 
cagttttcaataataggttagaaccaaaagatacacatattctcacacatacacattata 
atggtataaaggagcaataatcaagtaataggggctaggaaatgtacagtaactgacaaa 
aagtctgaattgctacttcttattgaaattgagtagaaaccttagctttgagcttcttag 
cagtagggcagaaagggaaggaaacactgagagtaacgtggggtaaatgatagctgagaa 
ggaaggggatgctggaggcacaggaaggcatgagactgtgtacacgagtttatatgttct 
tacttgtatagcatgcaattctttgtttgagtgttgcttggtattcatttgttagtgttt 
tattgtgtttggtttcctgatctagaagggaagctgcttgctgcttaattattcatatgc 
tacccaaaacctgtatgttgctttatatatgatgaataattatgttatatataataatca 
ttacactagaaacttatatttatagcataggcagtgagattgaatagctttgtattattc 
agctgccaaagtaagattctttgcgcatctgttaatcgcaagaattttaacacaaggttc 
agtagacttttggtagtatctaccaagaatgaaaaggggagaattcagcttcatgactga 
gcacacatgtaattatttagaattgaggcagtcatggaactggccagaatgttagtagct 
tccaggctgtgtggaataacctcgtcccctgccacatgtacatattgttaagcccctcag 
cagtgagagatggcagcactgcttggatttccttggttctttggttctaacaaagtgtcc 
gtgagtccttccatcatgttgtggggaattgagataatgccataagacaataactcctgg 
tatcaaggaagggataaaagttctcttggatctcattaaagtgacctgctaaacctgatt 
catagcctatacaaggttttcttctaaaactcagtaaataaagttgagccattttatgac 
atacagaaaagttgcttaaaatagggaggccaggggcagtggctcactcctgtaatccca 
gcactttgggaggctgaggtggacggatcacttgaggccaggagttagagaccagcctgc 
ccaacatggtgaaaccgcaactccactaaaaatagaaaaattagccaggcgtggtgctgc 
acacctgtaatcccagctactctggaggctaaggcaggagaatcacttgaatccgggagg 
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cggagattgcaatgagccgacatcgtgcactccagcctgggcgacagagcaagactctat 
ctcaaaatagtaataatgattttaaaaaatgtaattatatttcttaggtaatcagggaaa 
aatttgaagtgtcagataacaggaatatattaaatattacccatatcttaggggaaaata 
ctaacataagtcaggaaacctgagttagtagtagcttgcttattaacgaattgcttatta 
acaaacagttttggaccccagttgtcctg 

SEQ ID NO: 310 . >T23935_T15 # TY Transcript # LN 2179 # Source Gene: T23935 

# Encoded protein: T23935_P5 

ctctcccccccaacccccactttcacaacgcggccccccacctttacctccctaagttct 
tcatctctttcgacattttttttgaaggctgatgggggggagggtggaagagaaagagtg 
aagaaaagttgcgagcgtctcctctcttcctaagcctcccgccccaacccacccctcaga 
gaaggagaagataatatactgaaaagaagaggaggaggagagcgacgggacgggacgcga 
gcgggagcgcagccgccctctcggctccgcggcggcgcctcgcaagtccgggaggcgagg 
ggggcccgaggggagacgccgtgacaactttcgtttccctctgagggaattgggaggtcg 
gcggccccaaaagctttcagtccagtgtaaagctgttggagcgcgggagcaaaggtaaag 
aatgatgtaatgcgctggctgctccaaagcatcttttgttgtggaatggttattccagtc 
atctctttatgaatcaaatgtgaggggctgctttgtggacggagtcctttgcaagagcac 
atcaacgggaaagagaaagagacattcacttggagggctcttgctgaaaatgggtttaac 
tctccttttgccagtcaccaccagcctgacctcatacacttttagtacaatggagtggct 
gagcctttgagcacaccaccattacatcatcgtggcaaattaaagaaggaggtgggaaaa 
gaggacttattgttgtcatggcccatgagatgattggaactcaaattgttactgagaggt 
tggtggctctgctggaaagtggaacggaaaaagtgctgctaattgatagccggccatttg 
tggaatacaatacatcccacattttggaagccattaatatcaactgctccaagcttatga 
agcgaaggttgcaacaggacaaagtgttaattacagagctcatccagcattcagcgaaac 
ataaggttgacattgattgcagtcagaaggttgtagtttacgatcaaagctcccaagatg 
ttgcctctctctcttcagactcttcagcatgtaattggggaaaccctcgcgtctactagg 
ggtgatacagatggtgattttaaagagcaaaactagacttctatgtgagaagtgctggaa 
aatgatttaggacatgtgtaaagttagatggaaagactgtaaatgtttaatatgaatata 
gtgttcttttgaagtaaggccagctgttgaacggttaaactgtgcatttctcattttgat 
gtgtcatgtatgttaatgtatgaaatgattaaataaaatcaaaactggtacctgtttata 
cataaatacgagaaaagacctatctttgcagccataaactcggtgggaacaccaccactc 
aagttgccaaaggaggcagtggtgaaacctgtcctgttctcacttaaatgaggatttagc 
tcaaaataaagtggtggtgtcatcaggtttattccgtgttctgtcattcacatggaacac 
cggatgattagctaacagtttagtgccagccttcattctttactgtgtacgttaaatgca 
cactacagtgaaaaagcctaagacacttggtaaatattttctagctgactgattccagaa 
cacacaagtacatggtaggaaaaaaaaggccagctacacactcatggcccatttcgatat 
aacatgtaaatgatgttacgattcctacaggaaaaacaggaagggcaattgttgaagagc 
tgtcctttgtccttgccacatacatctcaacaacctctctgctcatgacactagatttga 
tgacttatgagactagaaacttagtctcatcagtttcataaaaattaggcattgaaggaa 
taaagcttcagtgcaaacagatggaatattgactacagaaagaatcacaggtgaccttga 
agttggttgagttgaagttacatttaagaaaatttataagaaagattgggccgggtgagc 
tagactagaagagctatcttggccaggaccctgcagaaaaagaggtggaagtgagataga 
ggaataaagggaattagaataaatcagcctccagagatagagggtaacaggaggaacagt 
tgtttcagtgagaggcttccaatgagaagatgcagatcttgggagagcctgaagttcagg 
ccagttgctgacagtgctc 

SEQ ID NO: 311. >T23935_P1 # TY Protein # CC #LN 665 # Source Gene: T23935 

# Encoding Transcript: 1 

MAHEMI GTQI VTERLVALLESGTEKVLLI DSRPFVEYNTS HI LEAININCS KLMKRRLQQ 

DKVLITELIQHSAKHKVDIDCSQKVWYDQSSQDVASLSSDCFLTVLLGKLEKSFNSVHL 

LAGGFAEFSRCFPGLCEGKSTLVPTCI SQPCLPVANIGPTRI LPNLYLGCQRDVLNKELM 

QQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCEKILPWLDKSVDFIEKAECASNGCV 

LVHCLAGISRSATIAIAYIMKRMDMSLDEAYRFVKEKRPTISPNFNFLGQLLDYEKKIKN 

QTGASGPKSKLKLLHLEKPNEPVPAVSEGGQKSETPLSPPCADSATSEAAGQRPVHPASV 

PSVPSVQPSLLEDS PLVQALSGLHLSADRLEDSNKLKRSFSLDIKSVSYSASMAASLHGF ' 

SSSEDALEYYKPSTTLDGTNKLCQFSPVQEL 

SEQTPETSPDKEEASIPKKLQTARPSDSQ 
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SKRLHSVRTSSSGTAQRSLLSPLHRSGSVEDNYHTSFLFGLSTSQQHLTKSAGLGLKGWH 
SDILAPQTSTPSLTSSWYFATESSHFYSASAIYGGSASYSAYSCSQLPTCGDQVYSVRRR 
QKPSDRADSRRSWHEESPFEKQFKRRSCQMEFGESIMSENRSREELGKVGSQSSFSGSME 
IIEVS 

SEQ ID NO: 312. >T23935_P6 # TY Protein # CC #LN 143 # Source Gene: T23935 

# Encoding Transcript: 3 

MAHEMIGTQIVTERLVALLESGTEKVLLIDSRPFVEYNTSHILEAININCSKLMKRRLQQ 
DKVLITELIQHSAKHKVDIDCSQKWVYDQSSQDVASLSSDCFLTVLLGKLEKSFNSVHL 
LAGADAAEWDWLCVKCQQYLSKA 

SEQ ID NO: 313. >T23935_P3 # TY Protein # CC #LN 594 # Source Gene: T23935 

# Encoding Transcript: 4 

MFAFQVDIDCSQKVWYDQSSQDVASLSSDCFLTVLLGKLEKSFNSVHLLAGGFAEFSRC 
FPGLCEGKSTLVPTCISQPCLPVANIGPTRILPNLYLGCQRDVLNKELMQQNGIGYVLNA 
SNTCPKPDFIPESHFLRVPVNDSFCEKILPWLDKSVDFIEKAKASNGCVLVHCLAGISRS 
ATIAIAYIMKRMDMSLDEAYRFVKEKRPTISPNFNFLGQLLDYEKKIKNQTGASGPKSKL 
KLLHLEKPNEPVPAVSEGGQKSETPLSPPCADSATSEAAGQRPVHPASVPSVPSVQPSLL 
EDS PLVQALSGLHLSADRLEDSNKLKRS FS LDI KS VS YSASMAASLHG FS S S E DALE Y YK 
PSTTLDGTN KLCQ FS P VQEL 

SEQTPETSPDKEEASIPKKLQTARPSDSQSKRLHSVRTSS 

SGTAQRSLLSPLHRSGSVEDNYHTSFLFGLSTSQQHLTKSAGLGLKGWHSDILAPQTSTP 
SLTSSWYFATESSHFYSASAIYGGSASYSAYSCSQLPTCGDQVYSVRRRQKPSDRADSRR 
SWHEESPFEKQFKRRSCQMEFGESIMSENRSREELGKVGSQSSFSGSMEIIEVS 

SEQ ID NO: 314 . >T23935_P4 # TY Protein # CC #LN 524 # Source Gene: T23935 

# Encoding Transcript: 12 

MAHEMIGTQIVTERLVALLESGTEKVLLIDSRPFVEYNTSHILEAININCSKLMKRRLQQ 
DKVLITELIQHSAKHKVDIDCSQKVVVYDQSSQDVASLSSDCFLTVLLGKLEKSFNSVHL 
LAGGFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTRILPNLYLGCQRDVLNKELM 
QQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCEKILPWLDKSVDFIEKAKASNGCV 
LVHCLAGISRSATIAIAYIMKRMDMSLDEAYRFVKEKRPTISPNFNFLGQLLDYEKKIKN 
QTGASGPKSKLKLLHLEKPNEPVPAVSEGGQKSETPLSPPCADSATSEAAGQRPVHPAST 
GWKTAISSSVPSLWISNQFHIQPAWQHPYMASPHQKMLWNTTNLPLLWMGPTSYASSPLF 
RNYRSRLPKPVLIRRKPASPRSCRPPGLQTARASDCIRSEPAAVAPPRGPFYLHCIEVGA 
WRTITTPAS FSAFPPASSTSRSLLAWALRAGTRISWPPRPLPLP 

SEQ ID NO: 315. >T23935_P5 # TY Protein # CC #LN 113 # Source Gene: T23935 

# Encoding Transcript: 15 

MAHEMIGTQIVTERLVALLESGTEKVLLIDSRPFVEYNTSHILEAININCSKLMKRRLQQ 
DKVLITELIQHSAKHKVDIDCSQKVWYDQSSQDVASLSSDSSACNWGNPRVY 



SEQ ID NO: 316. >T60764 # TY Consensus # Length 11053 # Number of exons 33 

cccggcgcccgggcctgaggactggctcggcggagggagaagaggaaacagacttgagca 

gctccccgttgtctcgcaactccactgccgaggaactctcatttcttccctcgctccttc 

accccccacctcatgtagaagggtgctgaggcgtcgggagggaggaggagcctgggctac 

cgtccctgccctccccacccccttcccggggcgctttggtgggcgtggagttggggttgg 

gggggtgggtgggggttgctttttggagtgctggggaacttttttcccttcttcaggtca 

ggggaaagggaatgcccaattcagagagacatgggggcaagaaggacgggagtggaggag 

cttctggaactttgcagccgtcatcgggaggcggcagctctaacagcagagagcgtcacc 

gcttggtatcgaagcacaagcggcataagtccaaacactccaaagacatggggttggtga 

cccccgaagcagcatccctgggcacagttatcaaacctttggtggagtatgatgatatca 
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gctctgattccgacaccttctccgatgacatggccttcaaactagaccgaagggagaacg 
acgaacgtcgtggatcagatcggagcgaccgcctgcacaaacatcgtcaccaccagcaca 
ggcgttcccgggacttactaaaagctaaacagaccgaaaaagaaaaaagccaagaagtct 
ccagcaagtcgggatcgatgaaggaccggatatcgggaagttcaaagcgttcgaatgagg 
agactgatgactatgggaaggcgcaggtagccaaaagcagcagcaaggaatccaggtcat 
ccaagctccacaaggagaagaccaggaaagaacgggagctgaagtctgggcacaaagacc 
ggagtaaaagtcatcgaaaaagggaaacacccaaaagttacaaaacagtggacagcccaa 
aacggagatccaggagcccccacaggaagtggtctgacagctccaaacaagatgatagcc 
cctcgggagcttcttatggccaagattatgaccttagtccctcacgatctcatacctcga 
gcaattatgactcctacaagaaaagtcctggaagtacctcgagaaggcagtcggtcagtc 
ccccttacaaggagccttcggcctaccagtccagcacccggtcaccgagcccctacagta 
ggcgacagagatctgtcagtccctatagcaggagacggtcgtccagctacgaaagaagtg 
gctcttacagcgggcgatcgcccagtccctatggtcgaaggcggtccagcagccctttcc 
tgagcaagcggtctctgagtcggagtccactccccagtaggaaatccatgaagtccagaa 
gtagaagtcctgcatattcaagacattcatcttctcatagtaaaaagaagagatccagtt 
cacgcagtcgtcattccagtatctcacctgtcaggcttccacttaattccagtctgggag 
ctgaactcagtaggaaaaagaaggaaagagcagctgctgctgctgcagcaaagatggatg 
gaaaggagtccaagggttcacctgtatttttgcctagaaaagagaacagttcagtagagg 
ctaaggattcaggtttggagtctaaaaagttacccagaagtgtaaaattggaaaaatctg 
ccccagatactgaactggtgaatgtaacacatctaaacacagaggtaaaaaattcttcag 
atacagggaaagtaaagttggatgagaactccgagaagcatcttgttaaagatttgaaag 
cacagggaacaagagactctaaacccatagcactgaaagaggagattgttactccaaagg 
agacagaaacatcagaaaaggagacccctccacctcttcccacaattgcttctcccccac 
cccctctaccaactactacccctccacctcagacaccccctttgccacctttgcctccaa 
taccagctcttccacagcaaccacctctgcctccttctcagccagcatttagtcaggttc 
ctgcttccagtacttcaactttgcccccttctactcactcaaagacatctgctgtgtcct 
ctcaggcaaattctcagccccctgtacaggtttctgtgaagactcaagtatctgtaacag 
ctgctattccacacctgaaaacttcaacgttgcctcctttgcccctcccacccttattac 
ctggagatgatgacatggatagtccaaaagaaactcttccttcaaaacctgtgaagaaag 
agaaggaacagaggacacgtcacttactcacagaccttcctctccctccagagctccctg 
gtggagatctgtctcccccagactctccagaaccaaaggcaatcacaccacctcagcaac 
catataaaaagagaccaaaaatttgttgtcctcgttatggagaaagaagacaaacagaaa 
gcgactgggggaaacgctgtgtggacaagtttgacattattgggattattggagaaggaa 
cctatggccaagtatataaagccaaggacaaagacacaggagaactagtggctctgaaga 
aggtgagactagacaatgagaaagagggcttcccaatcacagccattcgtgaaatcaaaa 
tccttcgtcagttaatccaccgaagtgttgttaacatgaaggaaattgtcacagataaac 
aagatgcactggatttcaagaaggacaaaggtttattattaggaaggactaattattcaa 
ctactcttcgtttcagtttcttcatctgtaaaattcaagggttttttctagagaacctgt 
ggtcaactccaaaattatattaggacttgaggcattgttatttcagcattcttttttttt 
tcttgcttttaattttttttcgtctttatgtaggtgccttttaccttgtatttgagtata 
tggaccatgacttaatgggactgctagaatctggtttggtgcacttttctgaggaccata 
tcaagtcgttcatgaaacagctaatggaaggattggaatactgtcacaaaaagaatttcc 
tgcatcgggatattaagtgttctaacattttgctgaataacaggtaacatagtaaccaaa 
taagattaagcactttcctcttctcctctgacctttttagtttcaaatggttaattggta 
ttataattagacctaatagtgcagtattcacacactatttagtgatatttgggcaaatca 
aactagcagattttggacttgctcggctctataactctgaagagagtcgcccttacacaa 
acaaagtcattactttgtggtaccgacctccagaactactgctaggagaggaacgttaca 
caccagccatagatgtttggagctgtggatgtattcttggggaactattcacaaagaagc 
ctatttttcaagccaatctggaactggctcagctagaactgatcagccgactttgtggta 
gcccttgtccagctgtgtggcctgatgttatcaaactgccctacttcaacaccatgaaac 
cgaagaagcaatatcgaaggcgtctacgagaagaattctctttcattccttctgcagcac 
ttgatttattggaccacatgctgacactagatcctagtaagcggtgcacagctgaacaga 
ccctacagagcgacttccttaaagatgtcgaactcagcaaaatggctcctccagacctcc 
cccactggcaggattgccatgagttgtggagtaagaaacggcgacgtcagcgacaaagtg 
gtgttgtagtcgaagagccacctccatccaaaacttctcgaaaagaaactacctcaggga 
caagtactgagcctgtgaagaacagcagcccagcaccacctcagcctgctcctggcaagg 
tggagtctggggctggggatgcaataggccttgctgacatcacacaacagctgaatcaaa 
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gtgaattggcagtgttattaaacctgctgcagagccaaaccgacctgagcatccctcaaa 
tggcacagctgcttaacatccactccaacccagagatgcagcagcagctggaagccctga 
accaatccatcagtgccctgacggaagctacttcccagcagcaggactcagagaccatgg 
ccccagaggagtctttgaaggaagcaccctctgccccagtgatcctgccttcagcagaac 
agacgacccttgaagcttcaagcacaccagctgacatgcagaatatattggcagttctct 
tgagtcagctgatgaaaacccaagagccagcaggcagtctggaggaaaacaacagtgaca 
agaacagtgggccacaggggccccgaagaactcccacaatgccacaggaggaggcagcag 
catgtcctcctcacattcttccaccagagaagaggccccctgagccccccggacctccac 
cgccgccacctccaccccctctggttgaaggcgatctttccagcgccccccaggagttga 
acccagccgtgacagccgccttgctgcaacttttatcccagcctgaagcagagcctcctg 
gccacctgccacatgagcaccaggccttgagaccaatggagtactccacccgaccccgtc 
caaacaggacttatggaaacactgatgggcctgaaacagggttcagtgccattgacactg 
atgaacgaaactctggtccagccttgacagaatccttggtccagaccctggtgaagaaca 
ggaccttctcaggctctctgagccaccttggggagtccagcagttaccagggcacagggt 
cagtgcagtttccaggggaccaggacctccgttttgccagggtccccttagcgttacacc 
cggtggtcgggcaaccattcctgaaggctgagggaagcagcaattctgtggtacatgcag 
agaccaaattgcaaaactatggggagctggggccaggaaccactggggccagcagctcag 
gagcaggccttcactgggggggcccaactcagtcttctgcttatggaaaactctatcggg 
ggcctacaagagtcccaccaagagggggaagagggagaggagttccttactaacccagag 
acttcagtgtcctgaaagattcctttcctatccatccttccatccagttctctgaatctt 
taatgaaatcatttgccagagcgaggtaatcatctgcatttggctactgcaaagctgtcc 
gttgtattccttgctcacttgctactagcaggcgacttacgaaataatgatgttggcacc 
agttccccctggatgggctatagccagaacatttacttcaactctaccttagtagataca 
agtagagaatatggagaggatcattacattgaaaagtaaatgttttattagttcattgcc 
tgcacttactgatcggaagagagaaagaacagtttcagtattgagatggctcaggagagg 
ctctttgatttttaaagttttggggtggggggttgtgtgtggtttctttcttttgaattt 
taatttaggtgttttgggtttttttcctttaaagagaatagtgttcacaaaatttgagct 
gctctttggcttttgctataagggaaacagagtggcctggctgatttgaataaatgtttc 
tttcctctccaccatctcacattttgcttttaagtgaacactttttccccattgagcatc 
ttgaacatactttttttccaaataaattactcatccttaaagtttactccactttgacaa 
aagatacgcccttctccctgcacataaagcaggttgtagaacgtggcattcttgggcaag 
taggtagactttacccagtctctttccttttttgctgatgtgtgctctctctctctcttt 
ctctctctctctctctctctctctctctctgtctgtctcgcttgctcgctctcgctgttt 
ctctctctttgaggcatttgtttggaaaaaatcgttgagatgcccaagaacctgggataa 
ttctttactttttttgaaataaaggaaaggaaattcagactcttacattgttctctgtaa 
ctcttcaattctaaaatgttttgttttttaaaccatgttctgatggggaagttgatttgt 
aagtgtggacagcttggacattgctgctgagctgtggttagagatgatgcctccattcct 
agagggctaataacagcatttagcatattgtttacacatatatttttatgtcaaaaaaaa 
aacaaaaacctttcaaacagagcattgtgatattgtcaaagagaaaaacaaatcctgaag 
atacatggaaatgtaacctagtttagggtgggtatttttctgaagatacatcaatacctg 
accttttttaaaaaaataattttaaaacagcatactgtgaggaagaacagtattgacata 
cccacatcccagcatgtgtaccctgccagttcttttagggatttttcctccaaagagatt 
tggatttggttttggtaaaaggggttaaattgtgcttccaggcaagaactttgccttatc 
ataaacaggaaatgaaaaagggaagggctgtcaggatgggataatttgggaggcttctca 
ttctggcttctatttctatgtgagtaccagcatatagagtgttttaaaaacagatacatg 
tcatataatttatctgcacagacttagaccttcaggaaacataggttaagccccctttta 
caaagaaaaagtaaacatacttcagcatcttggagggtagttttcaaaactcaagtttca 
tgtttcaatgccaagttcttattttaaaaaataaaatctacttataagagaaaggtgcat 
tacttaaaaaaaaaaaaactttaaagaaatgaaagaagaaccctcttcagatacttactt 
gaagactgttttcccctgttaatgagatatagctagatatcggtgtgtgtatttctttat 
tattctctggtttttgatctggccttgcctccagggccaaacactgatttagaaagagag 
ccttctagctattttggcattgatggctttttataccagtgtgtccagttagatttacta 
ggcttactgacatgctattggtaaatcgcattaaagttcatctgaaccttctgtctgttg 
acttcttagtcctcagacatgggcctttgtgttttagaatatttgaatttgagttattgg 
gccccactccctgttttttattaaagaacgtgagcctgggatactttcagaagtatctgt 
tcaatgaaaaaaagttggtttcccatcaaatatgaataaaattctctatatatttcattg 
tattttggttatcagcagtcatcaataatgtttttccctcccctctcccacctcttattt 



192/365 



Figure 1 (Cont'd) 



ttaattatgccaaatatcctaaataatatacttaagcctccattccctcatccctactag 
ggaagggggtgagtgtatgtgtgagtgtatgtgtatgtatgatcccatctcacccccacc 
cccattttgggagtcttttaaaatgaaaacaaagtttggtagttttgactatttctaaaa 
gcagaggagaaaaaaaaacttatttaaatatcctggaatctgtatggaggaagaaaaggt 
atttgttaatttttcagttacgttatctataaacatgatggaagtaaaggtttggcagaa 
tttcaccttgactatttgaaaattacagacccaattaattccattcaaaagtggttttcg 
ttttgttttaattattgtacaatgagagatattgtctattaaatacattattttgaacag 
atgagaaatctgattctgttcatgagtgggaggcaaaactggtttgaccgtgatcatttt 
tgtggttttgaaaacaaatatacttgacccagtttccttagttttttcttcaactgtcca 
taggaacgataagtatttgaaagcaacatcaaatctatacgtttaaagcagggcagttag 
cacaaatttgcaagtagaacttctattagcttatgccatagaaaaaaaaaaaaaaaaaaa 
aaaaaaaaaattttgttggcgcgccgcgcctcggagcccccgagaaagaagctcttttaa 
acacccattctggttgtgggcgcgccgcgcggcgccccagtgtaaggttactgtgtgaac 
acagtgggttccatcagccgtgtgttcgccttcaaggagagaactcaggctgggtcctga 
gaacatgaggtgcttgtgggattttttcccaccacatattaaaaaacaccgcccgcgggg 
gaacaacgagcgaaggtgttttcgtgaaacaaatcttcctcagaaggagagttcccggaa 
gggcttaacatcacggggatgtccttttaaaaaggagggtttccacgacagagctcaaat 
ttttctatttacccacatcacccaaccacttgtatgtgtgtgtgtatatataatatgcat 
atatagttaccgtgctaaaatggttaccagcaggttttgagagagaatgctgcatcagaa 
aagtgtcagttgccacctcattctccctgatttaggttcctgacactgattcctttctct 
ctcgtttttgacccccattgggtgtatcttgtctatgtacagatattttgtaatatatta 
aatttttttctttcagtttataaaaatggaaagtggagattggaaaattaaatatttcct 
gttactataccacttttgctccattgcatttacttcttaatctgtaccccctgagcatat 
ctaatcatgtataaaggacgtttttcctccactttatcttaggggttctctgtctcagaa 
tcattatagactcattaactccccctcccagcaaaaggttatcaggatttgaagaggtgc 
ttgaaaacgctagacctaggaactagagaataaatgagttgggaaaaaccatgaaatgtg 
atttttttaaagtagaaaagttatacaaataatggtaccaaaccatcaaaagagttgagc 
ttcatgtaccctgactcctcctgacaggagaggtaagtgggtttgagctcaactgtcatc 
aagggaagttggtaagaggctgtttagacccaaaggatagtcttaaaccagacttcacca 
cccaccctacctcagttcccatgttattacatgcagagtcagcatggggattagtgtacc 
tacctttgctgagatttcccgatgcgttgccaatccagaaagtgaatcaaaaagttgttt 
aaaagttaaaatctctattgtttccaaaatctttcccatctccacctgaagacagaattg 
cttccccttctcgctgctgatgaagaaatagatcccagtacatatcctttacgttttgtt 
catcaactttgatagacagcatagaggaaatgtggaactggtctccatgtaaatttgtct 
ctagggaatgtatgatagacctacttgtccagattacattaagtttgctgtcctgcaaac 
ctcctttctcacagtcaggatccttttttgccttagagtccttcctgaaggagctgttat 
ggatgagagagccacaagtccaacctcatgatctgatctaaaactgggtggcctcctccc 
agtcaccttcttttattagccttatattaaccattaaccttcattgatctttatgattct 
gggtttcacagcttctgtttctgttgaattccctaactttaagaaccatgtttttgcttt 
tatcagcctgggagctaggtccctggagacaagaattttagaacagttatgactctctct 
gggagtgtactaaggttagtctgaatccttctggggaattcaggtttgaaggagttagaa 
gagtgctaacatcaccgtcactgttacctaccaatggatgcttcctgggtgcaagaccaa 
agatgttctttggtgtctagtcccttgtctcatgtctaatgtaacctgccattttccttt 
acaggtaggagcaacggtgggaatgccctctgaggaatgcagtgatgtccatcagtcacc 
tctcacccagagcctggcaaattcctctgtctgtctcgcgcgcgctctctcttgttctct 
ctctggagggtgtgtgtggaaagatctcggagagatgcgcgaagagcgaggagatattta 
ctatagttagtgagaatataagaggaaaagatataccagactcttagtattgttctgctg 
tatctcgtaaaactactagagtggttgtgttttagaacagctatgttgtgatggggatgt 
ttgttgtgtgatgtgtggagcgctgtggatattgatgccagactgaggatagagaagatg 
ctatcatatagagagggggaacgaacgattgcagatagtggacacatatatttttgttgt 
gtaaaaaaaaaaaaggttttaaaaacaaggcgagttgggattgttgcaagagaaaaacca 
tcgtcgaatcttcgggaatgtaccatcggcgggggtattggaaagaacaaactgcgggtt 
gtaaaacttttacacggggtggggagacaatttttgttcctctgagttgccggagcttga 
gggttcccaaaatgggttgtgaagggatatgtcacacacctcttttaaaaaagggggggg 
gtatgggctcttctttgaaatgagaattcacgaaaaaagggcatctggctgcagtgaggc 
aggcataggtgcaccctctgcagagaagctctcagcttcaatcctcttatcctcaatgta 
ggccacggccccctcagaagcccatgggaatctctgcaaggtggctttgtctcatctgcg 
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gaggtggctgatgcgatcggatcatgctggggccttggaggcctctggagttgggcaggc 
ctctggagttgggtactcatccttaaagtttactccacttggacaaagataggcccttct 
ccctggacataaagcaggttgacaccgtggctttcttggggccagtagtacgatttaacc 
aagttcttacttatggcggagggggcacgccaccacattgcgaaactaacactcaaaaca 
ggagagagatgcc 

SEQ ID NO: 317. >T60764_T1 # TY Transcript « LN 7980 # Source Gene: T60764 
# Encoded protein: T60764_P1 

cccggcgcccgggcctgaggactggctcggcggagggagaagaggaaacagacttgagca 

gctccccgttgtctcgcaactccactgccgaggaactctcatttcttccctcgctccttc 

accccccacctcatgtagaagggtgctgaggcgtcgggagggaggaggagcctgggctac 

cgtccctgccctccccacccccttcccggggcgctttggtgggcgtggagttggggttgg 

gggggtgggtgggggttgctttttggagtgctggggaacttttttcccttcttcaggtca 

ggggaaagggaatgcccaattcagagagacatgggggcaagaaggacgggagtggaggag 

cttctggaactttgcagccgtcatcgggaggcggcagctctaacagcagagagcgtcacc 

gcttggtatcgaagcacaagcggcataagtccaaacactccaaagacatggggttggtga ' 

cccccgaagcagcatccctgggcacagttatcaaacctttggtggagtatgatgatatca 

gctctgattccgacaccttctccgatgacatggccttcaaactagaccgaagggagaacg 

acgaacgtcgtggatcagatcggagcgaccgcctgcacaaacatcgtcaccaccagcaca 

ggcgttcccgggacttactaaaagctaaacagaccgaaaaagaaaaaagccaagaagtct 

ccagcaagtcgggatcgatgaaggaccggatatcgggaagttcaaagcgttcgaatgagg 

agactgatgactatgggaaggcgcaggtagccaaaagcagcagcaaggaatccaggtcat 

ccaagctccacaaggagaagaccaggaaagaacgggagctgaagtctgggcacaaagacc 

ggagtaaaagtcatcgaaaaagggaaacacccaaaagttacaaaacagtggacagcccaa 

aacggagatccaggagcccccacaggaagtggtctgacagctccaaacaagatgatagcc 

cctcgggagcttcttatggccaagattatgaccttagtccctcacgatctcatacctcga 

gcaattatgactcctacaagaaaagtcctggaagtacctcgagaaggcagtcggtcagtc 

ccccttacaaggagccttcggcctaccagtccagcacccggtcaccgagcccctacagta 

ggcgacagagatctgtcagtccctatagcaggagacggtcgtccagctacgaaagaagtg 

gctcttacagcgggcgatcgcccagtccctatggtcgaaggcggtccagcagccctttcc 

tgagcaagcggtctctgagtcggagtccactccccagtaggaaatccatgaagtccagaa 

gtagaagtcctgcatattcaagacattcatcttctcatagtaaaaagaagagatccagtt 

cacgcagtcgtcattccagtatctcacctgtcaggcttccacttaattccagtctgggag 

ctgaactcagtaggaaaaagaaggaaagagcagctgctgctgctgcagcaaagatggatg 

gaaaggagtccaagggttcacctgtatttttgcctagaaaagagaacagttcagtagagg 

ctaaggattcaggtttggagtctaaaaagttacccagaagtgtaaaattggaaaaatctg 

ccccagatactgaactggtgaatgtaacacatctaaacacagaggtaaaaaattcttcag 

atacagggaaagtaaagttggatgagaactccgagaagcatcttgttaaagatttgaaag 

cacagggaacaagagactctaaacccatagcactgaaagaggagattgttactccaaagg 

agacagaaacatcagaaaaggagacccctccacctcttcccacaattgcttctcccccac 

cccctctaccaactactacccctccacctcagacaccccctttgccacctttgcctccaa 

taccagctcttccacagcaaccacctctgcctccttctcagccagcatttagtcaggttc 

ctgcttccagtacttcaactttgcccccttctactcactcaaagacatctgctgtgtcct 

ctcaggcaaattctcagccccctgtacaggtttctgtgaagactcaagtatctgtaacag 

ctgctattccacacctgaaaacttcaacgttgcctcctttgcccctcccacccttattac 

ctggagatgatgacatggatagtccaaaagaaactcttccttcaaaacctgtgaagaaag 

agaaggaacagaggacacgtcacttactcacagaccttcctctccctccagagctccctg 

gtggagatctgtctcccccagactctccagaaccaaaggcaatcacaccacctcagcaac 

catataaaaagagaccaaaaatttgttgtcctcgttatggagaaagaagacaaacagaaa 

gcgactgggggaaacgctgtgtggacaagtttgacattattgggattattggagaaggaa 

cctatggccaagtatataaagccaaggacaaagacacaggagaactagtggctctgaaga 

aggtgagactagacaatgagaaagagggcttcccaatcacagccattcgtgaaatcaaaa 

tccttcgtcagttaatccaccgaagtgttgttaacatgaaggaaattgtcacagataaac 

aagatgcactggatttcaagaaggacaaaggtgccttttaccttgtatttgagtatatgg 

accatgacttaatgggactgctagaatctggtttggtgcacttttctgaggaccatatca 

agtcgttcatgaaacagctaatggaaggattggaatactgtcacaaaaagaatttcctgc 

atcgggatattaagtgttctaacattttgctgaataacagtgggcaaatcaaactagcag 
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attttggacttgctcggctctataactctgaagagagtcgcccttacacaaacaaagtca 
ttactttgtggtaccgacctccagaactactgctaggagaggaacgttacacaccagcca 
tagatgtttggagc'tgtggatgtattcttggggaactattcacaaagaagcctatttttc 
aagccaatctggaactggctcagctagaactgatcagccgactttgtggtagcccttgtc 
cagctgtgtggcctgatgttatcaaactgccctacttcaacaccatgaaaccgaagaagc 
aatatcgaaggcgtctacgagaagaattctctttcattccttctgcagcacttgatttat 
tggaccacatgctgacactagatcctagtaagcggtgcacagctgaacagaccctacaga 
gcgacttccttaaagatgtcgaactcagcaaaatggctcctccagacctcccccactggc 
aggattgccatgagttgtggagtaagaaacggcgacgtcagcgacaaagtggtgttgtag 
tcgaagagccacctccatccaaaacttctcgaaaagaaactacctcagggacaagtactg 
agcctgtgaagaacagcagcccagcaccacctcagcctgctcctggcaaggtggagtctg 
gggctggggatgcaataggccttgctgacatcacacaacagctgaatcaaagtgaattgg 
cagtgttattaaacctgctgcagagccaaaccgacctgagcatccctcaaatggcacagc 
tgcttaacatccactccaacccagagatgcagcagcagctggaagccctgaaccaatcca 
tcagtgccctgacggaagctacttcccagcagcaggactcagagaccatggccccagagg 
agtctttgaaggaagcaccctctgccccagtgatcctgccttcagcagaacagacgaccc 
ttgaagcttcaagcacaccagctgacatgcagaatatattggcagttctcttgagtcagc 
tgatgaaaacccaagagccagcaggcagtctggaggaaaacaacagtgacaagaacagtg 
ggccacaggggccccgaagaactcccacaatgccacaggaggaggcagcagagaagaggc 
cccctgagccccccggacctccaccgccgccacctccaccccctctggttgaaggcgatc 
tttccagcgccccccaggagttgaacccagccgtgacagccgccttgctgcaacttttat 
cccagcctgaagcagagcctcctggccacctgccacatgagcaccaggccttgagaccaa 
tggagtactccacccgaccccgtccaaacaggacttatggaaacactgatgggcctgaaa 
cagggttcagtgccattgacactgatgaacgaaactctggtccagccttgacagaatcct 
tggtccagaccctggtgaagaacaggaccttctcaggctctctgagccaccttggggagt 
ccagcagttaccagggcacagggtcagtgca^gtttccaggggaccaggacctccgttttg 
ccagggtccccttagcgttacacccggtggtcgggcaaccattcctgaaggctgagggaa 
gcagcaattctgtggtacatgcagagaccaaattgcaaaactatggggagctggggccag 
gaaccactggggccagcagctcaggagcaggccttcactgggggggcccaactcagtctt 
ctgcttatggaaaactctatcgggggcctacaagagtcccaccaagagggggaagaggga 
gaggagttccttactaacccagagacttcagtgtcctgaaagattcctttcctatccatc 
cttccatccagttctctgaatctttaatgaaatcatttgccagagcgaggtaatcatctg 
catttggctactgcaaagctgtccgttgtattccttgctcacttgctactagcaggcgac 
ttacgaaataatgatgttggcaccagttccccctggatgggctatagccagaacatttac 
ttcaactctaccttagtagatacaagtagagaatatggagaggatcattacattgaaaag 
taaatgttttattagttcattgcctgcacttactgatcggaagagagaaagaacagtttc 
agtattgagatggctcaggagaggctctttgatttttaaagttttggggtggggggttgt 
gtgtggtttctttcttttgaattttaatttaggtgttttgggtttttttcctttaaagag 
aatagtgttcacaaaatttgagctgctctttggcttttgctataagggaaacagagtggc 
ctggctgatttgaataaatgtttctttcctctccaccatctcacattttgcttttaagtg 
aacactttttccccattgagcatcttgaacatactttttttccaaataaattactcatcc 
ttaaagtttactccactttgacaaaagatacgcccttctccctgcacataaagcaggttg 
tagaacgtggcattcttgggcaagtaggtagactttacccagtctctttccttttttgct 
gatgtgtgctctctctctctctttctctctctctctctctctctctctctctctgtctgt 
ctcgcttgctcgctctcgctgtttctctctctttgaggcatttgtttggaaaaaatcgtt 
gagatgcccaagaacctgggataattctttactttttttgaaataaaggaaaggaaattc 
agactcttacattgttctctgtaactcttcaattctaaaatgttttgttttttaaaccat 
gttctgatggggaagttgatttgtaagtgtggacagcttggacattgctgctgagctgtg 
gttagagatgatgcctccattcctagagggctaataacagcatttagcatattgtttaca 
catatatttttatgtcaaaaaaaaaacaaaaacctttcaaacagagcattgtgatattgt 
caaagagaaaaacaaatcctgaagatacatggaaatgtaacctagtttagggtgggtatt 
tttctgaagatacatcaatacctgaccttttttaaaaaaataattttaaaacagcatact 
gtgaggaagaacagtattgacatacccacatcccagcatgtgtaccctgccagttctttt 
agggatttttcctccaaagagatttggatttggttttggtaaaaggggttaaattgtgct 
tccaggcaagaactttgccttatcataaacaggaaatgaaaaagggaagggctgtcagga 
tgggataatttgggaggcttctcattctggcttctatttctatgtgagtaccagcatata 
gagtgttttaaaaacagatacatgtcatataatttatctgcacagacttagaccttcagg 
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aaacataggttaagcccccttttacaaagaaaaagtaaacatacttcagcatcttggagg 
gtagttttcaaaactcaagtttcatgtttcaatgccaagttcttattttaaaaaataaaa 
tctacttataagagaaaggtgcattacttaaaaaaaaaaaaactttaaagaaatgaaaga 
agaaccctcttcagatacttacttgaagactgttttcccctgttaatgagatatagctag 
atatcggtgtgtgtatttctttattattctctggtttttgatctggccttgcctccaggg 
ccaaacactgatttagaaagagagccttctagctattttggcattgatggctttttatac 
cagtgtgtccagttagatttactaggcttactgacatgctattggtaaatcgcattaaag 
ttcatctgaaccttctgtctgttgacttcttagtcctcagacatgggcctttgtgtttta 
gaatatttgaatttgagttattgggccccactccctgttttttattaaagaacgtgagcc 
tgggatactttcagaagtatctgttcaatgaaaaaaagttggtttcccatcaaatatgaa 
taaaattctctatatatttcattgtattttggttatcagcagtcatcaataatgtttttc 
cctcccctctcccacctcttatttttaattatgccaaatatcctaaataatatacttaag 
cctccattccctcatccctactagggaagggggtgagtgtatgtgtgagtgtatgtgtat 
gtatgatcccatctcacccccacccccattttgggagtcttttaaaatgaaaacaaagtt 
tggtagttttgactatttctaaaagcagaggagaaaaaaaaacttatttaaatatcctgg 
aatctgtatggaggaagaaaaggtatttgttaatttttcagttacgttatctataaacat 
gatggaagtaaaggtttggcagaatttcaccttgactatttgaaaattacagacccaatt 
aattccattcaaaagtggttttcgttttgttttaattattgtacaatgagagatattgtc 
tattaaatacattattttgaacagatgagaaatctgattctgttcatgagtgggaggcaa 
aactggtttgaccgtgatcatttttgtggttttgaaaacaaatatacttgacccagtttc 
cttagttttttcttcaactgtccataggaacgataagtatttgaaagcaacatcaaatct 
atacgtttaaagcagggcagttagcacaaatttgcaagtagaacttctattagcttatgc 
catagacatcacccaaccacttgtatgtgtgtgtgtatatataatatgcatatatagtta 
ccgtgctaaaatggttaccagcaggttttgagagagaatgctgcatcagaaaagtgtcag 
ttgccacctcattctccctgatttaggttcctgacactgattcctttctctctcgttttt 
gacccccattgggtgtatcttgtctatgtacagatattttgtaatatattaaattttttt 
ctttcagtttataaaaatggaaagtggagattggaaaattaaatatttcctgttactata 

SEQ ID NO: 318. >T60764_T2 # TY Transcript # LN 8583 # Source Gene: T60764 
ft Encoded protein: T60764_P1 

cccggcgcccgggcctgaggactggctcggcggagggagaagaggaaacagacttgagca 
gctccccgttgtctcgcaactccactgccgaggaactctcatttcttccctcgctccttc 
accccccacctcatgtagaagggtgctgaggcgtcgggagggaggaggagcctgggctac 
cgtccctgccctccccacccccttcccggggcgctttggtgggcgtggagttggggttgg 
gggggtgggtgggggttgctttttggagtgctggggaacttttttcccttcttcaggtca 
ggggaaagggaatgcccaattcagagagacatgggggcaagaaggacgggagtggaggag 
cttctggaactttgcagccgtcatcgggaggcggcagctctaacagcagagagcgtcacc 
gcttggtatcgaagcacaagcggcataagtccaaacactccaaagacatggggttggtga 
cccccgaagcagcatccctgggcacagttatcaaacctttggtggagtatgatgatatca 
gctctgattccgacaccttctccgatgacatggccttcaaactagaccgaagggagaacg 
acgaacgtcgtggatcagatcggagcgaccgcctgcacaaacatcgtcaccaccagcaca 
ggcgttcccgggacttactaaaagctaaacagaccgaaaaagaaaaaagccaagaagtct 
ccagcaagtcgggatcgatgaaggaccggatatcgggaagttcaaagcgttcgaatgagg 
agactgatgactatgggaaggcgcaggtagccaaaagcagcagcaaggaatccaggtcat 
ccaagctccacaaggagaagaccaggaaagaacgggagctgaagtctgggcacaaagacc 
ggagtaaaagtcatcgaaaaagggaaacacccaaaagttacaaaacagtggacagcccaa 
aacggagatccaggagcccccacaggaagtggtctgacagctccaaacaagatgatagcc 
cctcgggagcttcttatggccaagattatgaccttagtccctcacgatctcatacctcga 
gcaattatgactcctacaagaaaagtcctggaagtacctcgagaaggcagtcggtcagtc 
ccccttacaaggagccttcggcctaccagtccagcacccggtcaccgagcccctacagta 
ggcgacagagatctgtcagtccctatagcaggagacggtcgtccagctacgaaagaagtg 
gctcttacagcgggcgatcgcccagtccctatggtcgaaggcggtccagcagccctttcc 
tgagcaagcggtctctgagtcggagtccactccccagtaggaaatccatgaagtccagaa 
gtagaagtcctgcatattcaagacattcatcttctcatagtaaaaagaagagatccagtt 
cacgcagtcgtcattccagtatctcacctgtcaggcttccacttaattccagtctgggag 
ctgaactcagtaggaaaaagaaggaaagagcagctgctgctgctgcagcaaagatggatg 
gaaaggagtccaagggttcacctgtatttttgcctagaaaagagaacagttcagtagagg 
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ctaaggattcaggtttggagtctaaaaagttacccagaagtgtaaaattggaaaaatctg 
ccccagatactgaactggtgaatgtaacacatctaaacacagaggtaaaaaattcttcag 
atacagggaaagtaaagttggatgagaactccgagaagcatcttgttaaagatttgaaag 
cacagggaacaagagactctaaacccatagcactgaaagaggagattgttactccaaagg 
agacagaaacatcagaaaaggagacccctccacctcttcccacaattgcttctcccccac 
cccctctaccaactactacccctccacctcagacaccccctttgccacctttgcctccaa 
taccagctcttccacagcaaccacctctgcctccttctcagccagcatttagtcaggttc 
ctgcttccagtacttcaactttgcccccttctactcactcaaagacatctgctgtgtcct 
ctcaggcaaattctcagccccctgtacaggtttctgtgaagactcaagtatctgtaacag 
ctgctattccacacctgaaaacttcaacgttgcctcctttgcccctcccacccttattac 
ctggagatgatgacatggatagtccaaaagaaactcttccttcaaaacctgtgaagaaag 
agaaggaacagaggacacgtcacttactcacagaccttcctctccctccagagctccctg 
gtggagatctgtctcccccagactctccagaaccaaaggcaatcacaccacctcagcaac 
catataaaaagagaccaaaaatttgttgtcctcgttatggagaaagaagacaaacagaaa 
gcgactgggggaaacgctgtgtggacaagtttgacattattgggattattggagaaggaa 
cctatggccaagtatataaagccaaggacaaagacacaggagaactagtggctctgaaga 
aggtgagactagacaatgagaaagagggcttcccaatcacagccattcgtgaaatcaaaa 
tccttcgtcagttaatccaccgaagtgttgttaacatgaaggaaattgtcacagataaac 
aagatgcactggatttcaagaaggacaaaggtgccttttaccttgtatttgagtatatgg 
accatgacttaatgggactgctagaatctggtttggtgcacttttctgaggaccatatca 
agtcgttcatgaaacagctaatggaaggattggaatactgtcacaaaaagaatttcctgc 
atcgggatattaagtgttctaacattttgctgaataacagtgggcaaatcaaactagcag 
attttggacttgctcggctctataactctgaagagagtcgcccttacacaaacaaagtca 
ttactttgtggtaccgacctccagaactactgctaggagaggaacgttacacaccagcca 
tagatgtttggagctgtggatgtattcttggggaactattcacaaagaagcctatttttc 
aagccaatctggaactggctcagctagaactgatcagccgactttgtggtagcccttgtc 
cagctgtgtggcctgatgttatcaaactgccctacttcaacaccatgaaaccgaagaagc 
aatatcgaaggcgtctacgagaagaattctctttcattccttctgcagcacttgatttat 
tggaccacatgctgacactagatcctagtaagcggtgcacagctgaacagaccctacaga 
gcgacttccttaaagatgtcgaactcagcaaaatggctcctccagacctcccccactggc 
aggattgccatgagttgtggagtaagaaacggcgacgtcagcgacaaagtggtgttgtag 
tcgaagagccacctccatccaaaacttctcgaaaagaaactacctcagggacaagtactg 
agcctgtgaagaacagcagcccagcaccacctcagcctgctcctggcaaggtggagtctg 
gggctggggatgcaataggccttgctgacatcacacaacagctgaatcaaagtgaattgg 
cagtgttattaaacctgctgcagagccaaaccgacctgagcatccctcaaatggcacagc 
tgcttaacatccactccaacccagagatgcagcagcagctggaagccctgaaccaatcca 
tcagtgccctgacggaagctacttcccagcagcaggactcagagaccatggccccagagg 
agtctttgaaggaagcaccctctgccccagtgatcctgccttcagcagaacagacgaccc 
ttgaagcttcaagcacaccagctgacatgcagaatatattggcagttctcttgagtcagc 
tgatgaaaacccaagagccagcaggcagtctggaggaaaacaacagtgacaagaacagtg 
ggccacaggggccccgaagaactcccacaatgccacaggaggaggcagcagagaagaggc 
cccctgagccccccggacctccaccgccgccacctccaccccctctggttgaaggcgatc 
tttccagcgccccccaggagttgaacccagccgtgacagccgccttgctgcaacttttat 
cccagcctgaagcagagcctcctggccacctgccacatgagcaccaggccttgagaccaa 
tggagtactccacccgaccccgtccaaacaggacttatggaaacactgatgggcctgaaa 
cagggttcagtgccattgacactgatgaacgaaactctggtccagccttgacagaatcct 
tggtccagaccctggtgaagaacaggaccttctcaggctctctgagccaccttggggagt 
ccagcagttaccagggcacagggtcagtgcagtttccaggggaccaggacctccgttttg 
ccagggtccccttagcgttacacccggtggtcgggcaaccattcctgaaggctgagggaa 
gcagcaattctgtggtacatgcagagaccaaattgcaaaactatggggagctggggccag 
gaaccactggggccagcagctcaggagcaggccttcactgggggggcccaactcagtctt 
ctgcttatggaaaactctatcgggggcctacaagagtcccaccaagagggggaagaggga 
gaggagttccttactaacccagagacttcagtgtcctgaaagattcctttcctatccatc 
cttccatccagttctctgaatctttaatgaaatcatttgccagagcgaggtaatcatctg 
catttggctactgcaaagctgtccgttgtattccttgctcacttgctactagcaggcgac 
ttacgaaataatgatgttggcaccagttccccctggatgggctatagccagaacatttac 
ttcaactctaccttagtagatacaagtagagaatatggagaggatcattacattgaaaag 
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taaatgtttt at tagt teat tgcctgcact tact gatcggaagagagaaagaacagtttc 
agtattgagatggctcaggagaggctctttgatttttaaagttttggggtggggggttgt 
gtgtggtttctttcttttgaattttaatttaggtgttttgggtttttttcctttaaagag 
aatagtgttcacaaaatttgagctgctctttggcttttgctataagggaaacagagtggc 
ctggctgatttgaataaatgtttctttcctctccaccatctcacattttgcttttaagtg 
aacactttttccccattgagcatcttgaacatactttttttccaaataaattactcatcc 
ttaaagtttactccactttgacaaaagatacgcccttctccctgcacataaagcaggttg 
tagaacgtggcattcttgggcaagtaggtagactttacccagtctctttccttttttgct 
gatgtgtgctctctctctctctttctctctctctctctctctctctctctctctgtctgt 
ctcgcttgctcgctctcgctgtttctctctctttgaggcatttgtttggaaaaaatcgtt 
gagatgcccaagaacctgggataattctttactttttttgaaataaaggaaaggaaattc 
agactcttacattgttctctgtaactcttcaattctaaaatgttttgttttttaaaccat 
gttctgatggggaagttgatttgtaagtgtggacagcttggacattgctgctgagctgtg 
gttagagatgatgcctccattcctagagggctaataacagcatttagcatattgtttaca 
catatatttttatgtcaaaaaaaaaacaaaaacctttcaaacagagcattgtgatattgt 
caaagagaaaaacaaatcctgaagatacatggaaatgtaacctagtttagggtgggtatt 
tttctgaagatacatcaatacctgaccttttttaaaaaaataattttaaaacagcatact 
gtgaggaagaacagtattgacatacccacatcccagcatgtgtaccctgccagttctttt 
agggatttttcctccaaagagatttggatttggttttggtaaaaggggttaaattgtgct 
tccaggcaagaactttgccttatcataaacaggaaatgaaaaagggaagggctgtcagga 
tgggataatttgggaggcttctcattctggcttctatttctatgtgagtaccagcatata 
gagtgttttaaaaacagatacatgtcatataatttatctgcacagacttagaccttcagg 
aaacataggttaagcccccttttacaaagaaaaagtaaacatacttcagcatcttggagg 
gtagttttcaaaactcaagtttcatgtttcaatgccaagttcttattttaaaaaataaaa 
tctacttataagagaaaggtgcattacttaaaaaaaaaaaaactttaaagaaatgaaaga 
agaaccctcttcagatacttacttgaagactgttttcccctgttaatgagatatagctag 
atatcggtgtgtgtatttctttattattctctggtttttgatctggccttgcctccaggg 
ccaaacactgatttagaaagagagccttctagctattttggcattgatggctttttatac 
cagtgtgtccagttagatttactaggcttactgacatgctattggtaaatcgcattaaag 
ttcatctgaaccttctgtctgttgacttcttagtcctcagacatgggcctttgtgtttta 
gaatatttgaatttgagttattgggccccactccctgttttttattaaagaacgtgagcc 
tgggatactttcagaagtatctgttcaatgaaaaaaagttggtttcccatcaaatatgaa 
taaaattctctatatatttcattgtattttggttatcagcagtcatcaataatgtttttc 
cctcccctctcccacctcttatttttaattatgccaaatatcctaaataatatacttaag 
cctccattccctcatccctactagggaagggggtgagtgtatgtgtgagtgtatgtgtat 
gtatgatcccatctcacccccacccccattttgggagtcttttaaaatgaaaacaaagtt 
tggtagttttgactatttctaaaagcagaggagaaaaaaaaacttatttaaatatcctgg 
aatctgtatggaggaagaaaaggtatttgttaatttttcagttacgttatctataaacat 
gatggaagtaaaggtttggcagaatttcaccttgactatttgaaaattacagacccaatt 
aattccattcaaaagtggttttcgttttgttttaattattgtacaatgagagatattgtc 
tattaaatacattattttgaacagatgagaaatctgattctgttcatgagtgggaggcaa 
aactggtttgaccgtgatcatttttgtggttttgaaaacaaatatacttgacccagtttc 
cttagttttttcttcaactgtccataggaacgataagtatttgaaagcaacatcaaatct 
atacgtttaaagcagggcagttagcacaaatttgcaagtagaacttctattagcttatgc 
catagacatcacccaaccacttgtatgtgtgtgtgtatatataatatgcatatatagtta 
ccgtgctaaaatggttaccagcaggttttgagagagaatgctgcatcagaaaagtgtcag 
ttgccacctcattctccctgatttaggttcctgacactgattcctttctctctcgttttt 
gacccccattgggtgtatcttgtctatgtacagatattttgtaatatattaaattttttt 
ctttcagtttataaaaatggaaagtggagattggaaaattaaatatttcctgttactata 
ccacttttgctccattgcatttacttcttaatctgtaccccctgagcatatctaatcatg 
tataaaggacgtttttcctccactttatcttaggggttctctgtctcagaatcattatag 
actcattaactccccctcccagcaaaaggttatcaggatttgaagaggtgcttgaaaacg 
ctagacctaggaactagagaataaatgagttgggaaaaaccatgaaatgtgattttttta 
aagtagaaaagttatacaaataatggtaccaaaccatcaaaagagttgagcttcatgtac 
cctgactcctcctgacaggagaggtaagtgggtttgagctcaactgtcatcaagggaagt 
tggtaagaggctgtttagacccaaaggatagtcttaaaccagacttcaccacccacccta 
cctcagttcccatgttattacatgcagagtcagcatggggattagtgtacctacctttgc 
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Figure 1 (Cont'd) 



tgagatttcccgatgcgttgccaatccagaaagtgaatcaaaaagttgtttaaaagttaa 
aatctctattgtttccaaaatctttcccatctccacctgaagacagaattgcttcccctt 
etc 

SEQ ID NO: 319. >T60764_T3 # TY Transcript # LN 5724 # Source Gene: T60764 
# Encoded protein: T60764_P6 

cccggcgcccgggcctgaggactggctcggcggagggagaagaggaaacagacttgagca 
gctccccgttgtctcgcaactccactgccgaggaactctcatttcttccctcgctccttc 
accccccacctcatgtagaagggtgctgaggcgtcgggagggaggaggagcctgggctac 
cgtccctgccctccccacccccttcccggggcgctttggtgggcgtggagttggggttgg 
gggggtgggtgggggttgctttttggagtgctggggaacttttttcccttcttcaggtca 
ggggaaagggaatgcccaattcagagagacatgggggcaagaaggacgggagtggaggag 
cttctggaactttgcagccgtcatcgggaggcggcagctctaacagcagagagcgtcacc 
gcttggtatcgaagcacaagcggcataagtccaaacactccaaagacatggggttggtga 
cccccgaagcagcatccctgggcacagttatcaaacctttggtggagtatgatgatatca 
gctctgattccgacaccttctccgatgacatggccttcaaactagaccgaagggagaacg 
acgaacgtcgtggatcagatcggagcgaccgcctgcacaaacatcgtcaccaccagcaca 
ggcgttcccgggacttactaaaagctaaacagaccgaaaaagaaaaaagccaagaagtct 
ccagcaagtcgggatcgatgaaggaccggatatcgggaagttcaaagcgttcgaatgagg 
agactgatgactatgggaaggcgcaggtagccaaaagcagcagcaaggaatccaggtcat 
ccaagctccacaaggagaagaccaggaaagaacgggagctgaagtctgggcacaaagacc 
ggagtaaaagtcatcgaaaaagggaaacacccaaaagttacaaaacagtggacagcccaa 
aacggagatccaggagcccccacaggaagtggtctgacagctccaaacaagatgatagcc 
cctcgggagcttcttatggccaagattatgaccttagtccctcacgatctcatacctcga 
gcaattatgactcctacaagaaaagtcctggaagtacctcgagaaggcagtcggtcagtc 
ccccttacaaggagccttcggcctaccagtccagcacccggtcaccgagcccctacagta 
ggcgacagagatctgtcagtccctatagcaggagacggtcgtccagctacgaaagaagtg 
gctcttacagcgggcgatcgcccagtccctatggtcgaaggcggtccagcagccctttcc 
tgagcaagcggtctctgagtcggagtccactccccagtaggaaatccatgaagtccagaa 
gtagaagtcctgcatattcaagacattcatcttctcatagtaaaaagaagagatccagtt 
cacgcagtcgtcattccagtatctcacctgtcaggcttccacttaattccagtctgggag 
ctgaactcagtaggaaaaagaaggaaagagcagctgctgctgctgcagcaaagatggatg 
gaaaggagtccaagggttcacctgtatttttgcctagaaaagagaacagttcagtagagg 
ctaaggattcaggtttggagtctaaaaagttacccagaagtgtaaaattggaaaaatctg 
ccccagatactgaactggtgaatgtaacacatctaaacacagaggtaaaaaattcttcag 
atacagggaaagtaaagttggatgagaactccgagaagcatcttgttaaagatttgaaag 
cacagggaacaagagactctaaacccatagcactgaaagaggagattgttactccaaagg 
agacagaaacatcagaaaaggagacccctccacctcttcccacaattgcttctcccccac 
cccctctaccaactactacccctccacctcagacaccccctttgccacctttgcctccaa 
taccagctcttccacagcaaccacctctgcctccttctcagccagcatttagtcaggttc 
ctgcttccagtacttcaactttgcccccttctactcactcaaagacatctgctgtgtcct 
ctcaggcaaattctcagccccctgtacaggtttctgtgaagactcaagtatctgtaacag 
ctgctattccacacctgaaaacttcaacgttgcctcctttgcccctcccacccttattac 
ctggagatgatgacatggatagtccaaaagaaactcttccttcaaaacctgtgaagaaag 
agaaggaacagaggacacgtcacttactcacagaccttcctctccctccagagctccctg 
gtggagatctgtctcccccagactctccagaaccaaaggcaatcacaccacctcagcaac 
catataaaaagagaccaaaaatttgttgtcctcgttatggagaaagaagacaaacagaaa 
gcgactgggggaaacgctgtgtggacaagtttgacattattgggattattggagaaggaa 
cctatggccaagtatataaagccaaggacaaagacacaggagaactagtggctctgaaga 
aggtgagactagacaatgagaaagagggcttcccaatcacagccattcgtgaaatcaaaa 
tccttcgtcagttaatccaccgaagtgttgttaacatgaaggaaattgtcacagataaac 
aagatgcactggatttcaagaaggacaaaggtgccttttaccttgtatttgagtatatgg 
accatgacttaatgggactgctagaatctggtttggtgcacttttctgaggaccatatca 
agtcgttcatgaaacagctaatggaaggattggaatactgtcacaaaaagaatttcctgc 
atcgggatattaagtgttctaacattttgctgaataacagtgggcaaatcaaactagcag 
attttggacttgctcggctctataactctgaagagagtcgcccttacacaaacaaagtca 
ttactttgtggtaccgacctccagaactactgctaggagaggaacgttacacaccagcca 
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Figure 1 (Cont'd) 



tagatgtttggagctgtggatgtattcttggggaactattcacaaagaagcctatttttc 
aagccaatctggaactggctcagctagaactgatcagccgactttgtggtagcccttgtc 
cagctgtgtggcctgatgttatcaaactgccctacttcaacaccatgaaaccgaagaagc 
aatatcgaaggcgtctacgagaagaattctctttcattccttctgcagcacttgatttat 
tggaccacatgctgacactagatcctagtaagcggtgcacagctgaacagaccctacaga 
gcgacttccttaaagatgtcgaactcagcaaaatggctcctccagacctcccccactggc 
aggattgccatgagttgtggagtaagaaacggcgacgtcagcgacaaagtggtgttgtag 
tcgaagagccacctccatccaaaacttctcgaaaagaaactacctcagggacaagtactg 
agcctgtgaagaacagcagcccagcaccacctcagcctgctcctggcaaggtggagtctg 
gggctggggatgcaataggccttgctgacatcacacaacagctgaatcaaagtgaattgg 
cagtgttattaaacctgctgcagagccaaaccgacctgagcatccctcaaatggcacagc 
tgcttaacatccactccaacccagagatgcagcagcagctggaagccctgaaccaatcca 
tcagtgccctgacggaagctacttcccagcagcaggactcagagaccatggccccagagg 
agtctttgaaggaagcaccctctgccccagtgatcctgccttcagcagaacagacgaccc 
ttgaagcttcaagcacaccagctgacatgcagaatatattggcagttctcttgagtcagc 
tgatgaaaacccaagagccagcaggcagtctggaggaaaacaacagtgacaagaacagtg 
ggccacaggggccccgaagaactcccacaatgccacaggaggaggcagcagcatgtcctc 
ctcacattcttccaccagagaagaggccccctgagccccccggacctccaccgccgccac 
ctccaccccctctggttgaaggcgatctttccagcgccccccaggagttgaacccagccg 
tgacagccgccttgctgcaacttttatcccagcctgaagcagagcctcctggccacctgc 
cacatgagcaccaggccttgagaccaatggagtactccacccgaccccgtccaaacagga 
cttatggaaacactgatgggcctgaaacagggttcagtgccattgacactgatgaacgaa 
actctggtccagccttgacagaatccttggtccagaccctggtgaagaacaggaccttct 
caggctctctgagccaccttggggagtccagcagttaccagggcacagggtcagtgcagt 
ttccaggggaccaggacctccgttttgccagggtccccttagcgttacacccggtggtcg 
ggcaaccattcctgaaggctgagggaagcagcaattctgtggtacatgcagagaccaaat 
tgcaaaactatggggagctggggccaggaaccactggggccagcagctcaggagcaggcc 
ttcactgggggggcccaactcagtcttctgcttatggaaaactctatcgggggcctacaa 
gagtcccaccaagagggggaagagggagaggagttccttactaacccagagacttcagtg 
tcctgaaagattcctttcctatccatccttccatccagttctctgaatctttaatgaaat 
catttgccagagcgaggtaatcatctgcatttggctactgcaaagctgtccgttgtattc 
cttgctcacttgctactagcaggcgacttacgaaataatgatgttggcaccagttccccc 
tggatgggctatagccagaacatttacttcaactctaccttagtagatacaagtagagaa 
tatggagaggatcattacattgaaaagtaaatgttttattagttcattgcctgcacttac 
tgatcggaagagagaaagaacagtttcagtattgagatggctcaggagaggctctttgat 
ttttaaagttttggggtggggggttgtgtgtggtttctttcttttgaattttaatttagg 
tgttttgggtttttttcctttaaagagaatagtgttcacaaaatttgagctgctctttgg 
cttttgctataagggaaacagagtggcctggctgatttgaataaatgtttctttcctctc 
caccatctcacattttgcttttaagtgaacactttttccccattgagcatcttgaacata 
ctttttttccaaataaattactcatccttaaagtttactccactttgacaaaagatacgc 
ccttctccctgcacataaagcaggttgtagaacgtggcattcttgggcaagtaggtagac 
tttacccagtctctttccttttttgctgatgtgtgctctctctctctctttctctctctc 
tctctctctctctctctctctgtctgtctcgcttgctcgctctcgctgtttctctctctt 
tgaggcatttgtttggaaaaaatcgttgagatgcccaagaacctgggataattctttact 
ttttttgaaataaaggaaaggaaa 

SEQ ID NO: 320. >T60764_T4 # TY Transcript # LN 6522 # Source Gene: T60764 
# Encoded protein: T60764_P1 

cccggcgcccgggcctgaggactggctcggcggagggagaagaggaaacagacttgagca 
gctccccgttgtctcgcaactccactgccgaggaactctcatttcttccctcgctccttc 
accccccacctcatgtagaagggtgctgaggcgtcgggagggaggaggagcctgggctac 
cgtccctgccctccccacccccttcccggggcgctttggtgggcgtggagttggggttgg 
gggggtgggtgggggttgctttttggagtgctggggaacttttttcccttcttcaggtca 
ggggaaagggaatgcccaattcagagagacatgggggcaagaaggacgggagtggaggag 
cttctggaactttgcagccgtcatcgggaggcggcagctctaacagcagagagcgtcacc 
gcttggtatcgaagcacaagcggcataagtccaaacactccaaagacatggggttggtga 
cccccgaagcagcatccctgggcacagttatcaaacctttggtggagtatgatgatatca 
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Figure 1 (Cont'd) 



gctctgattccgacaccttctccgatgacatggccttcaaactagaccgaagggagaacg 
acgaacgtcgtggatcagatcggagcgaccgcctgcacaaacatcgtcaccaccagcaca 
ggcgttcccgggacttactaaaagctaaacagaccgaaaaagaaaaaagccaagaagtct 
ccagcaagtcgggatcgatgaaggaccggatatcgggaagttcaaagcgttcgaatgagg 
agactgatgactatgggaaggcgcaggtagccaaaagcagcagcaaggaatccaggtcat 
ccaagctccacaaggagaagaccaggaaagaacgggagctgaagtctgggcacaaagacc 
ggagtaaaagtcatcgaaaaagggaaacacccaaaagttacaaaacagtggacagcccaa 
aacggagatccaggagcccccacaggaagtggtctgacagctccaaacaagatgatagcc 
cctcgggagcttcttatggccaagattatgaccttagtccctcacgatctcatacctcga 
gcaattatgactcctacaagaaaagtcctggaagtacctcgagaaggcagtcggtcagtc 
ccccttacaaggagccttcggcctaccagtccagcacccggtcaccgagcccctacagta 
ggcgacagagatctgtcagtccctatagcaggagacggtcgtccagctacgaaagaagtg 
gctcttacagcgggcgatcgcccagtccctatggtcgaaggcggtccagcagccctttcc 
tgagcaagcggtctctgagtcggagtccactccccagtaggaaatccatgaagtccagaa 
gtagaagtcctgcatattcaagacattcatcttctcatagtaaaaagaagagatccagtt 
cacgcagtcgtcattccagtatctcacctgtcaggcttccacttaattccagtctgggag 
ctgaactcagtaggaaaaagaaggaaagagcagctgctgctgctgcagcaaagatggatg 
gaaaggagtccaagggttcacctgtatttttgcctagaaaagagaacagttcagtagagg 
ctaaggattcaggtttggagtctaaaaagttacccagaagtgtaaaattggaaaaatctg 
ccccagatactgaactggtgaatgtaacacatctaaacacagaggtaaaaaattcttcag 
atacagggaaagtaaagttggatgagaactccgagaagcatcttgttaaagatttgaaag 
cacagggaacaagagactctaaacccatagcactgaaagaggagattgttactccaaagg 
agacagaaacatcagaaaaggagacccctccacctcttcccacaattgcttctcccccac 
cccctctaccaactactacccctccacctcagacaccccctttgccacctttgcctccaa 
taccagctcttccacagcaaccacctctgcctccttctcagccagcatttagtcaggttc 
ctgcttccagtacttcaactttgcccccttctactcactcaaagacatctgctgtgtcct 
ctcaggcaaattctcagccccctgtacaggtttctgtgaagactcaagtatctgtaacag 
ctgctattccacacctgaaaacttcaacgttgcctcctttgcccctcccacccttattac 
ctggagatgatgacatggatagtccaaaagaaactcttccttcaaaacctgtgaagaaag 
agaaggaacagaggacacgtcacttactcacagaccttcctctccctccagagctccctg 
gtggagatctgtctcccccagactctccagaaccaaaggcaatcacaccacctcagcaac 
catataaaaagagaccaaaaatttgttgtcctcgttatggagaaagaagacaaacagaaa 
gcgactgggggaaacgctgtgtggacaagtttgacattattgggattattggagaaggaa 
cctatggccaagtatataaagccaaggacaaagacacaggagaactagtggctctgaaga 
aggtgagactagacaatgagaaagagggcttcccaatcacagccattcgtgaaatcaaaa 
tccttcgtcagttaatccaccgaagtgttgttaacatgaaggaaattgtcacagataaac 
aagatgcactggatttcaagaaggacaaaggtgccttttaccttgtatttgagtatatgg 
accatgacttaatgggactgctagaatctggtttggtgcacttttctgaggaccatatca 
agtcgttcatgaaacagctaatggaaggattggaatactgtcacaaaaagaatttcctgc 
atcgggatattaagtgttctaacattttgctgaataacagtgggcaaatcaaactagcag 
attttggacttgctcggctctataactctgaagagagtcgcccttacacaaacaaagtca 
ttactttgtggtaccgacctccagaactactgctaggagaggaacgttacacaccagcca 
tagatgtttggagctgtggatgtattcttggggaactattcacaaagaagcctatttttc 
aagccaatctggaactggctcagctagaactgatcagccgactttgtggtagcccttgtc 
cagctgtgtggcctgatgttatcaaactgccctacttcaacaccatgaaaccgaagaagc 
aatatcgaaggcgtctacgagaagaattctctttcattccttctgcagcacttgatttat 
tggaccacatgctgacactagatcctagtaagcggtgcacagctgaacagaccctacaga 
gcgacttccttaaagatgtcgaactcagcaaaatggctcctccagacctcccccactggc 
aggattgccatgagttgtggagtaagaaacggcgacgtcagcgacaaagtggtgttgtag 
tcgaagagccacctccatccaaaacttctcgaaaagaaactacctcagggacaagtactg 
agcctgtgaagaacagcagcccagcaccacctcagcctgctcctggcaaggtggagtctg 
gggctggggatgcaataggccttgctgacatcacacaacagctgaatcaaagtgaattgg 
cagtgttattaaacctgctgcagagccaaaccgacctgagcatccctcaaatggcacagc 
tgcttaacatccactccaacccagagatgcagcagcagctggaagccctgaaccaatcca 
tcagtgccctgacggaagctacttcccagcagcaggactcagagaccatggccccagagg 
agtctttgaaggaagcaccctctgccccagtgatcctgccttcagcagaacagacgaccc 
ttgaagcttcaagcacaccagctgacatgcagaatatattggcagttctcttgagtcagc 
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Figure 1 (Cont'd) 



tgatgaaaacccaagagccagcaggcagtctggaggaaaacaacagtgacaagaacagtg 
ggccacaggggccccgaagaactcccacaatgccacaggaggaggcagcagagaagaggc 
cccctgagccccccggacctccaccgccgccacctccaccccctctggttgaaggcgatc 
tttccagcgccccccaggagttgaacccagccgtgacagccgccttgctgcaacttttat 
cccagcctgaagcagagcctcctggccacctgccacatgagcaccaggccttgagaccaa 
tggagtactccacccgaccccgtccaaacaggacttatggaaacactgatgggcctgaaa 
cagggttcagtgccattgacactgatgaacgaaactctggtccagccttgacagaatcct 
tggtccagaccctggtgaagaacaggaccttctcaggctctctgagccaccttggggagt 
ccagcagttaccagggcacagggtcagtgcagtttccaggggaccaggacctccgttttg 
ccagggtccccttagcgttacacccggtggtcgggcaaccattcctgaaggctgagggaa 
gcagcaattctgtggtacatgcagagaccaaattgcaaaactatggggagctggggccag 
gaaccactggggccagcagctcaggagcaggccttcactgggggggcccaactcagtctt 
ctgcttatggaaaactctatcgggggcctacaagagtcccaccaagagggggaagaggga 
gaggagttccttactaacccagagacttcagtgtcctgaaagattcctttcctatccatc 
cttccatccagttctctgaatctttaatgaaatcatttgccagagcgaggtaatcatctg 
catttggctactgcaaagctgtccgttgtattccttgctcacttgctactagcaggcgac 
ttacgaaataatgatgttggcaccagttccccctggatgggctatagccagaacatttac 
ttcaactctaccttagtagatacaagtagagaatatggagaggatcattacattgaaaag 
taaatgttttattagttcattgcctgcacttactgatcggaagagagaaagaacagtttc 
agtattgagatggctcaggagaggctctttgatttttaaagttttggggtggggggttgt 
gtgtggtttctttcttttgaattttaatttaggtgttttgggtttttttcctttaaagag 
aatagtgttcacaaaatttgagctgctctttggcttttgctataagggaaacagagtggc 
ctggctgatttgaataaatgtttctttcctctccaccatctcacattttgcttttaagtg 
aacactttttccccattgagcatcttgaacatactttttttccaaataaattactcatcc 
ttaaagtttactccactttgacaaaagatacgcccttctccctgcacataaagcaggttg 
tagaacgtggcattcttgggcaagtaggtagactttacccagtctctttccttttttgct 
gatgtgtgctctctctctctctttctctctctctctctctctctctctctctctgtctgt 
ctcgcttgctcgctctcgctgtttctctctctttgaggcatttgtttggaaaaaatcgtt 
gagatgcccaagaacctgggataattctttactttttttgaaataaaggaaaggaaattc 
agactcttacattgttctctgtaactcttcaattctaaaatgttttgttttttaaaccat 
gttctgatggggaagttgatttgtaagtgtggacagcttggacattgctgctgagctgtg 
gttagagatgatgcctccattcctagagggctaataacagcatttagcatattgtttaca 
catatatttttatgtcaaaaaaaaaacaaaaacctttcaaacagagcattgtgatattgt 
caaagagaaaaacaaatcctgaagatacatggaaatgtaacctagtttagggtgggtatt 
tttctgaagatacatcaatacctgaccttttttaaaaaaataattttaaaacagcatact 
gtgaggaagaacagtattgacatacccacatcccagcatgtgtaccctgccagttctttt 
agggatttttcctccaaagagatttggatttggttttggtaaaaggggttaaattgtgct 
tccaggcaagaactttgccttatcataaacaggaaatgaaaaagggaagggctgtcagga 
tgggataatttgggaggcttctcattctggcttctatttctatgtgagtaccagcatata 
gagtgttttaaaaacagatacatgtcatataatttatctgcacagacttagaccttcagg 
aaacataggttaagcccccttttacaaagaaaaagtaaacatacttcagcatcttggagg 
gtagttttcaaaactcaagtttcatgtttcaatgccaagttcttattttaaaaaataaaa 
tctacttataagagaaaggtgcattacttaaaaaaaaaaaaa 

SEQ ID NO: 321. >T60764JT5 # TY Transcript # LN 7374 # Source Gene: T60764 
# Encoded protein: T60764_P1 

cccggcgcccgggcctgaggactggctcggcggagggagaagaggaaacagacttgagca 
gctccccgttgtctcgcaactccactgccgaggaactctcatttcttccctcgctccttc 
accccccacctcatgtagaagggtgctgaggcgtcgggagggaggaggagcctgggctac 
cgtccctgccctccccacccccttcccggggcgctttggtgggcgtggagttggggttgg 
gggggtgggtgggggttgctttttggagtgctggggaacttttttcccttcttcaggtca 
ggggaaagggaatgcccaattcagagagacatgggggcaagaaggacgggagtggaggag 
cttctggaactttgcagccgtcatcgggaggcggcagctctaacagcagagagcgtcacc 
gcttggtatcgaagcacaagcggcataagtccaaacactccaaagacatggggttggtga 
cccccgaagcagcatccctgggcacagttatcaaacctttggtggagtatgatgatatca 
gctctgattccgacaccttctccgatgacatggccttcaaactagaccgaagggagaacg 
acgaacgtcgtggatcagatcggagcgaccgcctgcacaaacatcgtcaccaccagcaca 
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Figure 1 (Cont'd) 



ggcgttcccgggacttactaaaagctaaacagaccgaaaaagaaaaaagccaagaagtct 
ccagcaagtcgggatcgatgaaggaccggatatcgggaagttcaaagcgttcgaatgagg 
agactgatgactatgggaaggcgcaggtagccaaaagcagcagcaaggaatccaggtcat 
ccaagctccacaaggagaagaccaggaaagaacgggagctgaagtctgggcacaaagacc 
ggagtaaaagtcatcgaaaaagggaaacacccaaaagttacaaaacagtggacagcccaa 
aacggagatccaggagcccccacaggaagtggtctgacagctccaaacaagatgatagcc 
cctcgggagcttcttatggccaagattatgaccttagtccctcacgatctcatacctcga 
gcaattatgactcctacaagaaaagtcctggaagtacctcgagaaggcagtcggtcagtc 
ccccttacaaggagccttcggcctaccagtccagcacccggtcaccgagcccctacagta 
ggcgacagagatctgtcagtccctatagcaggagacggtcgtccagctacgaaagaagtg 
gctcttacagcgggcgatcgcccagtccctatggtcgaaggcggtccagcagccctttcc 
tgagcaagcggtctctgagtcggagtccactccccagtaggaaatccatgaagtccagaa 
gtagaagtcctgcatattcaagacattcatcttctcatagtaaaaagaagagatccagtt 
cacgcagtcgtcattccagtatctcacctgtcaggcttccacttaattccagtctgggag 
ctgaactcagtaggaaaaagaaggaaagagcagctgctgctgctgcagcaaagatggatg 
gaaaggagtccaagggttcacctgtatttttgcctagaaaagagaacagttcagtagagg 
ctaaggattcaggtttggagtctaaaaagttacccagaagtgtaaaattggaaaaatctg 
ccccagatactgaactggtgaatgtaacacatctaaacacagaggtaaaaaattcttcag 
atacagggaaagtaaagttggatgagaactccgagaagcatcttgttaaagatttgaaag 
cacagggaacaagagactctaaacccatagcactgaaagaggagattgttactcca'aagg 
agacagaaacatcagaaaaggagacccctccacctcttcccacaattgcttctcccccac 
cccctctaccaactactacccctccacctcagacaccccctttgccacctttgcctccaa 
taccagctcttccacagcaaccacctctgcctccttctcagccagcatttagtcaggttc 
ctgcttccagtacttcaactttgcccccttctactcactcaaagacatctgctgtgtcct 
ctcaggcaaattctcagccccctgtacaggtttctgtgaagactcaagtatctgtaacag 
ctgctattccacacctgaaaacttcaacgttgcctcctttgcccctcccacccttattac 
ctggagatgatgacatggatagtccaaaagaaactcttccttcaaaacctgtgaagaaag 
agaaggaacagaggacacgtcacttactcacagaccttcctctccctccagagctccctg 
gtggagatctgtctcccccagactctccagaaccaaaggcaatcacaccacctcagcaac 
catataaaaagagaccaaaaatttgttgtcctcgttatggagaaagaagacaaacagaaa 
gcgactgggggaaacgctgtgtggacaagtttgacattattgggattattggagaaggaa 
cctatggccaagtatataaagccaaggacaaagacacaggagaactagtggctctgaaga 
aggtgagactagacaatgagaaagagggcttcccaatcacagccattcgtgaaatcaaaa 
tccttcgtcagttaatccaccgaagtgttgttaacatgaaggaaattgtcacagataaac 
aagatgcactggatttcaagaaggacaaaggtgccttttaccttgtatttgagtatatgg 
accatgacttaatgggactgctagaatctggtttggtgcacttttctgaggaccatatca 
agtcgttcatgaaacagctaatggaaggattggaatactgtcacaaaaagaatttcctgc 
atcgggatattaagtgttctaacattttgctgaataacagtgggcaaatcaaactagcag 
attttggacttgctcggctctataactctgaagagagtcgcccttacacaaacaaagtca 
ttactttgtggtaccgacctccagaactactgctaggagaggaacgttacacaccagcca 
tagatgtttggagctgtggatgtattcttggggaactattcacaaagaagcctatttttc 
aagccaatctggaactggctcagctagaactgatcagccgactttgtggtagcccttgtc 
cagctgtgtggcctgatgttatcaaactgccctacttcaacaccatgaaaccgaagaagc 
aatatcgaaggcgtctacgagaagaattctctttcattccttctgcagcacttgatttat 
tggaccacatgctgacactagatcctagtaagcggtgcacagctgaacagaccctacaga 
gcgacttccttaaagatgtcgaactcagcaaaatggctcctccagacctcccccactggc 
aggattgccatgagttgtggagtaagaaacggcgacgtcagcgacaaagtggtgttgtag 
tcgaagagccacctccatccaaaacttctcgaaaagaaactacctcagggacaagtactg 
agcctgtgaagaacagcagcccagcaccacctcagcctgctcctggcaaggtggagtctg 
gggctggggatgcaataggccttgctgacatcacacaacagctgaatcaaagtgaattgg 
cagtgttattaaacctgctgcagagccaaaccgacctgagcatccctcaaatggcacagc 
tgcttaacatccactccaacccagagatgcagcagcagctggaagccctgaaccaatcca 
tcagtgccctgacggaagctacttcccagcagcaggactcagagaccatggccccagagg 
agtctttgaaggaagcaccctctgccccagtgatcctgccttcagcagaacagacgaccc 
ttgaagcttcaagcacaccagctgacatgcagaatatattggcagttctcttgagtcagc 
tgatgaaaacccaagagccagcaggcagtctggaggaaaacaacagtgacaagaacagtg 
ggccacaggggccccgaagaactcccacaatgccacaggaggaggcagcagagaagaggc 
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Figur 1 (Cont'd) 



cccctgagccccccggacctccaccgccgccacctccaccccctctggttgaaggcgatc 
tttccagcgccccccaggagttgaacccagccgtgacagccgccttgctgcaacttttat 
cccagcctgaagcagagcctcctggccacctgccacatgagcaccaggccttgagaccaa 
tggagtactccacccgaccccgtccaaacaggacttatggaaacactgatgggcctgaaa 
cagggttcagtgccattgacactgatgaacgaaactctggtccagccttgacagaatcct 
tggtccagaccctggtgaagaacaggaccttctcaggctctctgagccaccttggggagt 
ccagcagttaccagggcacagggtcagtgcagtttccaggggaccaggacctccgttttg 
ccagggtccccttagcgttacacccggtggtcgggcaaccattcctgaaggctgagggaa 
gcagcaattctgtggtacatgcagagaccaaattgcaaaactatggggagctggggccag 
gaaccactggggccagcagctcaggagcaggccttcactgggggggcccaactcagtctt 
ctgcttatggaaaactctatcgggggcctacaagagtcccaccaagagggggaagaggga 
gaggagttccttactaacccagagacttcagtgtcctgaaagattcctttcctatccatc 
cttccatccagttctctgaatctttaatgaaatcatttgccagagcgaggtaatcatctg 
catttggctactgcaaagctgtccgttgtattccttgctcacttgctactagcaggcgac 
ttacgaaataatgatgttggcaccagttccccctggatgggctatagccagaacatttac 
ttcaactctaccttagtagatacaagtagagaatatggagaggatcattacattgaaaag 
taaatgttttattagttcattgcctgcacttactgatcggaagagagaaagaacagtttc 
agtattgagatggctcaggagaggctctttgatttttaaagttttggggtggggggttgt 
gtgtggtttctttcttttgaattttaatttaggtgttttgggtttttttcctttaaagag 
aatagtgttcacaaaatttgagctgctctttggcttttgctataagggaaacagagtggc 
ctggctgatttgaataaatgtttctttcctctccaccatctcacattttgcttttaagtg 
aacactttttccccattgagcatcttgaacatactttttttccaaataaattactcatcc 
ttaaagtttactccactttgacaaaagatacgcccttctccctgcacataaagcaggttg 
tagaacgtggcattcttgggcaagtaggtagactttacccagtctctttccttttttgct 
gatgtgtgctctctctctctctttctctctctctctctctctctctctctctctgtctgt 
ctcgcttgctcgctctcgctgtttctctctctttgaggcatttgtttggaaaaaatcgtt 
gagatgcccaagaacctgggataattctttactttttttgaaataaaggaaaggaaattc 
agactcttacattgttctctgtaactcttcaattctaaaatgttttgttttttaaaccat 
gttctgatggggaagttgatttgtaagtgtggacagcttggacattgctgctgagctgtg 
gttagagatgatgcctccattcctagagggctaataacagcatttagcatattgtttaca 
catatatttttatgtcaaaaaaaaaacaaaaacctttcaaacagagcattgtgatattgt 
caaagagaaaaacaaatcctgaagatacatggaaatgtaacctagtttagggtgggtatt 
tttctgaagatacatcaatacctgaccttttttaaaaaaataattttaaaacagcatact 
gtgaggaagaacagtattgacatacccacatcccagcatgtgtaccctgccagttctttt 
agggatttttcctccaaagagatttggatttggttttggtaaaaggggttaaattgtgct 
tccaggcaagaactttgccttatcataaacaggaaatgaaaaagggaagggctgtcagga 
tgggataatttgggaggcttctcattctggcttctatttctatgtgagtaccagcatata 
gagtgttttaaaaacagatacatgtcatataatttatctgcacagacttagaccttcagg 
aaacataggttaagcccccttttacaaagaaaaagtaaacatacttcagcatcttggagg 
gtagttttcaaaactcaagtttcatgtttcaatgccaagttcttattttaaaaaataaaa 
tctacttataagagaaaggtgcattacttaaaaaaaaaaaaactttaaagaaatgaaaga 
agaaccctcttcagatacttacttgaagactgttttcccctgttaatgagatatagctag 
atatcggtgtgtgtatttctttattattctctggtttttgatctggccttgcctccaggg 
ccaaacactgatttagaaagagagccttctagctattttggcattgatggctttttatac 
cagtgtgtccagttagatttactaggcttactgacatgctattggtaaatcgcattaaag 
ttcatctgaaccttctgtctgttgacttcttagtcctcagacatgggcctttgtgtttta 
gaatatttgaatttgagttattgggccccactccctgttttttattaaagaacgtgagcc 
tgggatactttcagaagtatctgttcaatgaaaaaaagttggtttcccatcaaatatgaa 
taaaattctctatatatttcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaattttgttggc 
gcgccgcgcctcggagcccccgagaaagaagctcttttaaacacccattctggttgtggg 
cgcgccgcgcggcgccccagtgtaaggttactgtgtgaacacagtgggttccatcagccg 
tgtgttcgccttcaaggagagaactcaggctgggtcctgagaacatgaggtgcttgtggg 
attttttcccaccacatattaaaaaacaccgcccgcgggggaacaacgagcgaaggtgtt 
ttcgtgaaacaaatcttcctcagaaggagagttcccggaagggcttaacatcacggggat 
gtccttttaaaaaggagggtttccacgacagagctcaaatttttctatttaccc 
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Figure 1 (Cont'd) 



SEQ ID NO: 322. >T60764_T6 # TY Transcript # LN 6704 8 Source Gene: T60764 
f Encoded protein: T607 64_P1 

cccggcgcccgggcctgaggactggctcggcggagggagaagaggaaacagacttgagca 
gctccccgttgtctcgcaactccactgccgaggaactctcatttcttccctcgctccttc 
accccccacctcatgtagaagggtgctgaggcgtcgggagggaggaggagcctgggctac 
cgtccctgccctccccacccccttcccggggcgctttggtgggcgtggagttggggttgg 
gggggtgggtgggggttgctttttggagtgctggggaacttttttcccttcttcaggtca 
ggggaaagggaatgcccaattcagagagacatgggggcaagaaggacgggagtggaggag 
cttctggaactttgcagccgtcatcgggaggcggcagctctaacagcagagagcgtcacc 
gcttggtatcgaagcacaagcggcataagtccaaacactccaaagacatggggttggtga 
cccccgaagcagcatccctgggcacagttatcaaacctttggtggagtatgatgatatca 
gctctgattccgacaccttctccgatgacatggccttcaaactagaccgaagggagaacg 
acgaacgtcgtggatcagatcggagcgaccgcctgcacaaacatcgtcaccaccagcaca 
ggcgttcccgggacttactaaaagctaaacagaccgaaaaagaaaaaagccaagaagtct 
ccagcaagtcgggatcgatgaaggaccggatatcgggaagttcaaagcgttcgaatgagg 
agactgatgactatgggaaggcgcaggtagccaaaagcagcagcaaggaatccaggtcat 
ccaagctccacaaggagaagaccaggaaagaacgggagctgaagtctgggcacaaagacc 
ggagtaaaagtcatcgaaaaagggaaacacccaaaagttacaaaacagtggacagcccaa 
aacggagatccaggagcccccacaggaagtggtctgacagctccaaacaagatgatagcc 
cctcgggagcttcttatggccaagattatgaccttagtccctcacgatctcatacctcga 
gcaattatgactcctacaagaaaagtcctggaagtacctcgagaaggcagtcggtcagtc 
ccccttacaaggagccttcggcctaccagtccagcacccggtcaccgagcccctacagta 
ggcgacagagatctgtcagtccctatagcaggagacggtcgtccagctacgaaagaagtg 
gctcttacagcgggcgatcgcccagtccctatggtcgaaggcggtccagcagccctttcc 
tgagcaagcggtctctgagtcggagtccactccccagtaggaaatccatgaagtccagaa 
gtagaagtcctgcatattcaagacattcatcttctcatagtaaaaagaagagatccagtt 
cacgcagtcgtcattccagtatctcacctgtcaggcttccacttaattccagtctgggag 
ctgaactcagtaggaaaaagaaggaaagagcagctgctgctgctgcagcaaagatggatg 
gaaaggagtccaagggttcacctgtatttttgcctagaaaagagaacagttcagtagagg 
ctaaggattcaggtttggagtctaaaaagttacccagaagtgtaaaattggaaaaatctg 
ccccagatactgaactggtgaatgtaacacatctaaacacagaggtaaaaaattcttcag 
atacagggaaagtaaagttggatgagaactccgagaagcatcttgttaaagatttgaaag 
cacagggaacaagagactctaaacccatagcactgaaagaggagattgttactccaaagg 
agacagaaacatcagaaaaggagacccctccacctcttcccacaattgcttctcccccac 
cccctctaccaactactacccctccacctcagacaccccctttgccacctttgcctccaa 
taccagctcttccacagcaaccacctctgcctccttctcagccagcatttagtcaggttc 
ctgcttccagtacttcaactttgcccccttctactcactcaaagacatctgctgtgtcct 
ctcaggcaaattctcagccccctgtacaggtttctgtgaagactcaagtatctgtaacag 
ctgctattccacacctgaaaacttcaacgttgcctcctttgcccctcccacccttattac 
ctggagatgatgacatggatagtccaaaagaaactcttccttcaaaacctgtgaagaaag 
agaaggaacagaggacacgtcacttactcacagaccttcctctccctccagagctccctg 
gtggagatctgtctcccccagactctccagaaccaaaggcaatcacaccacctcagcaac 
catataaaaagagaccaaaaatttgttgtcctcgttatggagaaagaagacaaacagaaa 
gcgactgggggaaacgctgtgtggacaagtttgacattattgggattattggagaaggaa 
cctatggccaagtatataaagccaaggacaaagacacaggagaactagtggctctgaaga 
aggtgagactagacaatgagaaagagggcttcccaatcacagccattcgtgaaatcaaaa 
tccttcgtcagttaatccaccgaagtgttgttaacatgaaggaaattgtcacagataaac 
aagatgcactggatttcaagaaggacaaaggtgccttttaccttgtatttgagtatatgg 
accatgacttaatgggactgctagaatctggtttggtgcacttttctgaggaccatatca 
agtcgttcatgaaacagctaatggaaggattggaatactgtcacaaaaagaatttcctgc 
atcgggatattaagtgttctaacattttgctgaataacagtgggcaaatcaaactagcag 
attttggacttgctcggctctataactctgaagagagtcgcccttacacaaacaaagtca 
ttactttgtggtaccgacctccagaactactgctaggagaggaacgttacacaccagcca 
tagatgtttggagctgtggatgtattcttggggaactattcacaaagaagcctatttttc 
aagccaatctggaactggctcagctagaactgatcagccgactttgtggtagcccttgtc 
cagctgtgtggcctgatgttatcaaactgccctacttcaacaccatgaaaccgaagaagc 
aatatcgaaggcgtctacgagaagaattctctttcattccttctgcagcacttgatttat 
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Figur 1 (Cont'd) 



tggaccacatgctgacactagatcctagtaagcggtgcacagctgaacagaccctacaga 
gcgacttccttaaagatgtcgaactcagcaaaatggctcctccagacctcccccactggc 
aggattgccatgagttgtggagtaagaaacggcgacgtcagcgacaaagtggtgttgtag 
tcgaagagccacctccatccaaaacttctcgaaaagaaactacctcagggacaagtactg 
agcctgtgaagaacagcagcccagcaccacctcagcctgctcctggcaaggtggagtctg 
gggctggggatgcaataggccttgctgacatcacacaacagctgaatcaaagtgaattgg 
cagtgttattaaacctgctgcagagccaaaccgacctgagcatccctcaaatggcacagc 
tgcttaacatccactccaacccagagatgcagcagcagctggaagccctgaaccaatcca 
tcagtgccctgacggaagctacttcccagcagcaggactcagagaccatggccccagagg 
agtctttgaaggaagcaccctctgccccagtgatcctgccttcagcagaacagacgaccc 
ttgaagcttcaagcacaccagctgacatgcagaatatattggcagttctcttgagtcagc 
tgatgaaaacccaagagccagcaggcagtctggaggaaaacaacagtgacaagaacagtg 
ggccacaggggccccgaagaactcccacaatgccacaggaggaggcagcagagaagaggc 
cccctgagccccccggacctccaccgccgccacctccaccccctctggttgaaggcgatc 
tttccagcgccccccaggagttgaacccagccgtgacagccgccttgctgcaacttttat 
cccagcctgaagcagagcctcctggccacctgccacatgagcaccaggccttgagaccaa 
tggagtactccacccgaccccgtccaaacaggacttatggaaacactgatgggcctgaaa 
cagggttcagtgccattgacactgatgaacgaaactctggtccagccttgacagaatcct 
tggtccagaccctggtgaagaacaggaccttctcaggctctctgagccaccttggggagt 
ccagcagttaccagggcacagggtcagtgcagtttccaggggaccaggacctccgttttg 
ccagggtccccttagcgttacacccggtggtcgggcaaccattcctgaaggctgagggaa 
gcagcaattctgtggtacatgcagagaccaaattgcaaaactatggggagctggggccag 
gaaccactggggccagcagctcaggagcaggccttcactgggggggcccaactcagtctt 
ctgcttatggaaaactctatcgggggcctacaagagtcccaccaagagggggaagaggga 
gaggagttccttactaacccagagacttcagtgtcctgaaagattcctttcctatccatc 
cttccatccagttctctgaatctttaatgaaatcatttgccagagcgaggtaatcatctg 
catttggctactgcaaagctgtccgttgtattccttgctcacttgctactagcaggcgac 
ttacgaaataatgatgttggcaccagttccccctggatgggctatagccagaacatttac 
ttcaactctaccttagtagatacaagtagagaatatggagaggatcattacattgaaaag 
taaatgttttattagttcattgcctgcacttactgatcggaagagagaaagaacagtttc 
agtattgagatggctcaggagaggctctttgatttttaaagttttggggtggggggttgt 
gtgtggtttctttcttttgaattttaatttaggtgttttgggtttttttcctttaaagag 
aatagtgttcacaaaatttgagctgctctttggcttttgctataagggaaacagagtggc 
ctggctgatttgaataaatgtttctttcctctccaccatctcacattttgcttttaagtg 
aacactttttccccattgagcatcttgaacatactttttttccaaataaat tact ca tec 
ttaaagtttactccactttgacaaaagatacgcccttctccctgcacataaagcaggttg 
tagaacgtggcattcttgggcaagtaggtagactttacccagtctctttccttttttgct 
gatgtgtgctctctctctctctttctctctctctctctctctctctctctctctgtctgt 
ctcgcttgctcgctctcgctgtttctctctctttgaggcatttgtttggaaaaaatcgtt 
gagatgcccaagaacctgggataattctttactttttttgaaataaaggaaaggaaattc 
agactcttacattgttctctgtaactcttcaattctaaaatgttttgttttttaaaccat 
gttctgatggggaagttgatttgtaagtgtggacagcttggacattgctgctgagctgtg 
gttagagatgatgcctccattcctagagggctaataacagcatttagcatattgtttaca 
catatatttttatgtcaaaaaaaaaacaaaaacctttcaaacagagcattgtgatattgt 
caaagagaaaaacaaatcctgaagatacatggaaatgtaacctagtttagggtgggtatt 
tttctgaagatacatcaatacctgaccttttttaaaaaaataattttaaaacagcatact 
gtgaggaagaacagtattgacatacccacatcccagcatgtgtaccctgccagttctttt 
agggatttttcctccaaagagatttggatttggttttggtaaaaggggttaaattgtgct 
tccaggcaagaactttgccttatcataaacaggaaatgaaaaagggaagggctgtcagga 
tgggataatttgggaggcttctcattctggcttctatttctatgtgagtaccagcatata 
gagtgttttaaaaacagatacatgtcatataatttatctgcacagacttagaccttcagg 
aaacataggttaagcccccttttacaaagaaaaagtaaacatacttcagcatcttggagg 
gtagttttcaaaactcaagtttcatgtttcaatgccaagttcttattttaaaaaataaaa 
tctacttataagagaaaggtgcattacttaaaaaaaaaaaaaaaaaaagggcatctggct 
gcagtgaggcaggcataggtgcaccctctgcagagaagctctcagcttcaatcctcttat 
cctcaatgtaggccacggccccctcagaagcccatgggaatctctgcaaggtggctttgt 
ctcatctgcggaggtggctgatgcgatcggatcatgctggggcc 
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Figur 1 (Cont'd) 



SEQ ID NO: 323. >T60764_T7 # TY Transcript # LN 7803 # Source Gene: T60764 
# Encoded protein: T60764_P3 

cccggcgcccgggcctgaggactggctcggcggagggagaagaggaaacagacttgagca 

gctccccgttgtctcgcaactccactgccgaggaactctcatttcttccctcgctccttc 

accccccacctcatgtagaagggtgctgaggcgtcgggagggaggaggagcctgggctac 

cgtccctgccctccccacccccttcccggggcgctttggtgggcgtggagttggggttgg „ 

gggggtgggtgggggttgctttttggagtgctggggaacttttttcccttcttcaggtca 

ggggaaagggaatgcccaattcagagagacatgggggcaagaaggacgggagtggaggag 

cttctggaactttgcagccgtcatcgggaggcggcagctctaacagcagagagcgtcacc 

gcttggtatcgaagcacaagcggcataagtccaaacactccaaagacatggggttggtga 

cccccgaagcagcatccctgggcacagttatcaaacctttggtggagtatgatgatatca 

gctctgattccgacaccttctccgatgacatggccttcaaactagaccgaagggagaacg 

acgaacgtcgtggatcagatcggagcgaccgcctgcacaaacatcgtcaccaccagcaca 

ggcgttcccgggacttactaaaagctaaacagaccgaaaaagaaaaaagccaagaagtct 

ccagcaagtcgggatcgatgaaggaccggatatcgggaagttcaaagcgttcgaatgagg 

agactgatgactatgggaaggcgcaggtagccaaaagcagcagcaaggaatccaggtcat 

ccaagctccacaaggagaagaccaggaaagaacgggagctgaagtctgggcacaaagacc 

ggagtaaaagtcatcgaaaaagggaaacacccaaaagttacaaaacagtggacagcccaa 

aacggagatccaggagcccccacaggaagtggtctgacagctccaaacaagatgatagcc 

cctcgggagcttcttatggccaagattatgaccttagtccctcacgatctcatacctcga 

gcaattatgactcctacaagaaaagtcctggaagtacctcgagaaggcagtcggtcagtc 

ccccttacaaggagccttcggcctaccagtccagcacccggtcaccgagcccctacagta 

ggcgacagagatctgtcagtccctatagcaggagacggtcgtccagctacgaaagaagtg 

gctcttacagcgggcgatcgcccagtccctatggtcgaaggcggtccagcagccctttcc 

tgagcaagcggtctctgagtcggagtccactccccagtaggaaatccatgaagtccagaa 

gtagaagtcctgcatattcaagacattcatcttctcatagtaaaaagaagagatccagtt 

cacgcagtcgtcattccagtatctcacctgtcaggcttccacttaattccagtctgggag 

ctgaactcagtaggaaaaagaaggaaagagcagctgctgctgctgcagcaaagatggatg 

gaaaggagtccaagggttcacctgtatttttgcctagaaaagagaacagttcagtagagg 

ctaaggattcaggtttggagtctaaaaagttacccagaagtgtaaaattggaaaaatctg 

ccccagatactgaactggtgaatgtaacacatctaaacacagaggtaaaaaattcttcag 

atacagggaaagtaaagttggatgagaactccgagaagcatcttgttaaagatttgaaag 

cacagggaacaagagactctaaacccatagcactgaaagaggagattgttactccaaagg 

agacagaaacatcagaaaaggagacccctccacctcttcccacaattgcttctcccccac 

cccctctaccaactactacccctccacctcagacaccccctttgccacctttgcctccaa 

taccagctcttccacagcaaccacctctgcctccttctcagccagcatttagtcaggttc 

ctgcttccagtacttcaactttgcccccttctactcactcaaagacatctgctgtgtcct 

ctcaggcaaattctcagccccctgtacaggtttctgtgaagactcaagtatctgtaacag 

ctgctattccacacctgaaaacttcaacgttgcctcctttgcccctcccacccttattac 

ctggagatgatgacatggatagaatttgttgtcctcgttatggagaaagaagacaaacag 

aaagcgactgggggaaacgctgtgtggacaagtttgacattattgggattattggagaag 

gaacctatggccaagtatataaagccaaggacaaagacacaggagaactagtggctctga 

agaaggtgagactagacaatgagaaagagggcttcccaatcacagccattcgtgaaatca 

aaatccttcgtcagttaatccaccgaagtgttgttaacatgaaggaaattgtcacagata 

aacaagatgcactggatttcaagaaggacaaaggtgccttttaccttgtatttgagtata 

tggaccatgacttaatgggactgctagaatctggtttggtgcacttttctgaggaccata 

tcaagtcgttcatgaaacagctaatggaaggattggaatactgtcacaaaaagaatttcc 

tgcatcgggatattaagtgttctaacattttgctgaataacagtgggcaaatcaaactag 

cagattttggacttgctcggctctataactctgaagagagtcgcccttacacaaacaaag 

tcattactttgtggtaccgacctccagaactactgctaggagaggaacgttacacaccag 

ccatagatgtttggagctgtggatgtattcttggggaactattcacaaagaagcctattt 

ttcaagccaatctggaactggctcagctagaactgatcagccgactttgtggtagccctt 

gtccagctgtgtggcctgatgttatcaaactgccctacttcaacaccatgaaaccgaaga 

agcaatatcgaaggcgtctacgagaagaattctctttcattccttctgcagcacttgatt 

tattggaccacatgctgacactagatcctagtaagcggtgcacagctgaacagaccctac 

agagcgacttccttaaagatgtcgaactcagcaaaatggctcctccagacctcccccact 
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ggcaggattgccatgagttgtggagtaagaaacggcgacgtcagcgacaaagtggtgttg 
tagtcgaagagccacctccatccaaaacttctcgaaaagaaactacctcagggacaagta 
ctgagcctgtgaagaacagcagcccagcaccacctcagcctgctcctggcaaggtggagt 
ctggggctggggatgcaataggccttgctgacatcacacaacagctgaatcaaagtgaat 
tggcagtgttattaaacctgctgcagagccaaaccgacctgagcatccctcaaatggcac 
agctgcttaacatccactccaacccagagatgcagcagcagctggaagccctgaaccaat 
ccatcagtgccctgacggaagctacttcccagcagcaggactcagagaccatggccccag 
aggagtctttgaaggaagcaccctctgccccagtgatcctgccttcagcagaacagacga 
cccttgaagcttcaagcacaccagctgacatgcagaatatattggcagttctcttgagtc 
agctgatgaaaacccaagagccagcaggcagtctggaggaaaacaacagtgacaagaaca 
gtgggccacaggggccccgaagaactcccacaatgccacaggaggaggcagcagagaaga 
ggccccctgagccccccggacctccaccgccgccacctccaccccctctggttgaaggcg 
atctttccagcgccccccaggagttgaacccagccgtgacagccgccttgctgcaacttt 
tatcccagcctgaagcagagcctcctggccacctgccacatgagcaccaggccttgagac 
caatggagtactccacccgaccccgtccaaacaggacttatggaaacactgatgggcctg 
aaacagggttcagtgccattgacactgatgaacgaaactctggtccagccttgacagaat 
ccttggtccagaccctggtgaagaacaggaccttctcaggctctctgagccaccttgggg 
agtccagcagttaccagggcacagggtcagtgcagtttccaggggaccaggacctccgtt 
ttgccagggtccccttagcgttacacccggtggtcgggcaaccattcctgaaggctgagg 
gaagcagcaattctgtggtacatgcagagaccaaattgcaaaactatggggagctggggc 
caggaaccactggggccagcagctcaggagcaggccttcactgggggggcccaactcagt 
cttctgcttatggaaaactctatcgggggcctacaagagtcccaccaagagggggaagag 
ggagaggagttccttactaacccagagacttcagtgtcctgaaagattcctttcctatcc 
atccttccatccagttctctgaatctttaatgaaatcatttgccagagcgaggtaatcat 
ctgcatttggctactgcaaagctgtccgttgtattccttgctcacttgctactagcaggc 
gacttacgaaataatgatgttggcaccagttccccctggatgggctatagccagaacatt 
tacttcaactctaccttagtagatacaagtagagaatatggagaggatcattacattgaa 
aagtaaatgttttattagttcattgcctgcacttactgatcggaagagagaaagaacagt 
ttcagtattgagatggctcaggagaggctctttgatttttaaagttttggggtggggggt 
tgtgtgtggtttctttcttttgaattttaatttaggtgttttgggtttttttcctttaaa 
gagaatagtgttcacaaaatttgagctgctctttggcttttgctataagggaaacagagt 
ggcctggctgatttgaataaatgtttctttcctctccaccatctcacattttgcttttaa 
gtgaacactttttccccattgagcatcttgaacatactttttttccaaataaattactca 
tccttaaagtttactccactttgacaaaagatacgcccttctccctgcacataaagcagg 
ttgtagaacgtggcattcttgggcaagtaggtagactttacccagtctctttcctttttt 
gctgatgtgtgctctctctctctctttctctctctctctctctctctctctctctctgtc 
tgtctcgcttgctcgctctcgctgtttctctctctttgaggcatttgtttggaaaaaatc 
gttgagatgcccaagaacctgggataattctttactttttttgaaataaaggaaaggaaa 
ttcagactcttacattgttctctgtaactcttcaattctaaaatgttttgttttttaaac 
catgttctgatggggaagttgatttgtaagtgtggacagcttggacattgctgctgagct 
gtggttagagatgatgcctccattcctagagggctaataacagcatttagcatattgttt 
acacatatatttttatgtcaaaaaaaaaacaaaaacctttcaaacagagcattgtgatat 
tgtcaaagagaaaaacaaatcctgaagatacatggaaatgtaacctagtttagggtgggt 
atttttctgaagatacatcaatacctgaccttttttaaaaaaataattttaaaacagcat 
actgtgaggaagaacagtattgacatacccacatcccagcatgtgtaccctgccagttct 
tttagggatttttcctccaaagagatttggatttggttttggtaaaaggggttaaattgt 
gcttccaggcaagaactttgccttatcataaacaggaaatgaaaaagggaagggctgtca 
ggatgggataatttgggaggcttctcattctggcttctatttctatgtgagtaccagcat 
atagagtgttttaaaaacagatacatgtcatataatttatctgcacagacttagaccttc 
aggaaacataggttaagcccccttttacaaagaaaaagtaaacatacttcagcatcttgg 
agggtagttttcaaaactcaagtttcatgtttcaatgccaagttcttattttaaaaaata 
aaatctacttataagagaaaggtgcattacttaaaaaaaaaaaaactttaaagaaatgaa 
agaagaaccctcttcagatacttacttgaagactgttttcccctgttaatgagatatagc 
tagatatcggtgtgtgtatttctttattattctctggtttttgatctggccttgcctcca 
gggccaaacactgatttagaaagagagccttctagctattttggcattgatggcttttta 
taccagtgtgtccagttagatttactaggcttactgacatgctattggtaaatcgcatta 
aagttcatctgaaccttctgtctgttgacttcttagtcctcagacatgggcctttgtgtt 
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ttagaatatttgaatttgagttattgggccccactccctgttttttattaaagaacgtga 
gcctgggatactttcagaagtatctgttcaatgaaaaaaagttggtttcccatcaaatat 
gaataaaattctctatatatttcattgtattttggttatcagcagtcatcaataatgttt 
ttccctcccctctcccacctcttatttttaattatgccaaatatcctaaataatatactt 
aagcctccattccctcatccctactagggaagggggtgagtgtatgtgtgagtgtatgtg 
tatgtatgatcccatctcacccccacccccattttgggagtcttttaaaatgaaaacaaa 
gtttggtagttttgactatttctaaaagcagaggagaaaaaaaaacttatttaaatatcc 
tggaatctgtatggaggaagaaaaggtatttgttaatttttcagttacgttatctataaa 
catgatggaagtaaaggtttggcagaatttcaccttgactatttgaaaattacagaccca 
attaattccattcaaaagtggttttcgttttgttttaattattgtacaatgagagatatt 
gtctattaaatacattattttgaacagatgagaaatctgattctgttcatgagtgggagg 
caaaactggtttgaccgtgatcatttttgtggttttgaaaacaaatatacttgacccagt 
ttccttagttttttcttcaactgtccataggaacgataagtatttgaaagcaacatcaaa 
tctatacgtttaaagcagggcagttagcacaaatttgcaagtagaacttctattagctta 
tgccatagacatcacccaaccacttgtatgtgtgtgtgtatatataatatgcatatatag 
ttaccgtgctaaaatggttaccagcaggttttgagagagaatgctgcatcagaaaagtgt 
cagttgccacctcattctccctgatttaggttcctgacactgattcctttctctctcgtt 
tttgacccccattgggtgtatcttgtctatgtacagatattttgtaatatattaaatttt 
tttctttcagtttataaaaatggaaagtggagattggaaaattaaatatttcctgttact 
ata 

SEQ ID NO:324. >T60764_T8 # TY Transcript # LN 6220 # Source Gene: T60764 
# Encoded protein: T60764_P1 

cccggcgcccgggcctgaggactggctcggcggagggagaagaggaaacagacttgagca 
gctccccgttgtctcgcaactccactgccgaggaactctcatttcttccctcgctccttc 
accccccacctcatgtagaagggtgctgaggcgtcgggagggaggaggagcctgggctac 
cgtccctgccctccccacccccttcccggggcgctttggtgggcgtggagttggggttgg 
gggggtgggtgggggttgctttttggagtgctggggaacttttttcccttcttcaggtca 
ggggaaagggaatgcccaattcagagagacatgggggcaagaaggacgggagtggaggag 
cttctggaactttgcagccgtcatcgggaggcggcagctctaacagcagagagcgtcacc 
gcttggtatcgaagcacaagcggcataagtccaaacactccaaagacatggggttggtga 
cccccgaagcagcatccctgggcacagttatcaaacctttggtggagtatgatgatatca 
gctctgattccgacaccttctccgatgacatggccttcaaactagaccgaagggagaacg 
acgaacgtcgtggatcagatcggagcgaccgcctgcacaaacatcgtcaccaccagcaca 
ggcgttcccgggacttactaaaagctaaacagaccgaaaaagaaaaaagccaagaagtct 
ccagcaagtcgggatcgatgaaggaccggatatcgggaagttcaaagcgttcgaatgagg 
agactgatgactatgggaaggcgcaggtagccaaaagcagcagcaaggaatccaggtcat 
ccaagctccacaaggagaagaccaggaaagaacgggagctgaagtctgggcacaaagacc 
ggagtaaaagtcatcgaaaaagggaaacacccaaaagttacaaaacagtggacagcccaa 
aacggagatccaggagcccccacaggaagtggtctgacagctccaaacaagatgatagcc 
cctcgggagcttcttatggccaagattatgaccttagtccctcacgatctcatacctcga 
gcaattatgactcctacaagaaaagtcctggaagtacctcgagaaggcagtcggtcagtc 
ccccttacaaggagccttcggcctaccagtccagcacccggtcaccgagcccctacagta 
ggcgacagagatctgtcagtccctatagcaggagacggtcgtccagctacgaaagaagtg 
gctcttacagcgggcgatcgcccagtccctatggtcgaaggcggtccagcagccctttcc 
tgagcaagcggtctctgagtcggagtccactccccagtaggaaatccatgaagtccagaa 
gtagaagtcctgcatattcaagacattcatcttctcatagtaaaaagaagagatccagtt 
cacgcagtcgtcattccagtatctcacctgtcaggcttccacttaattccagtctgggag 
ctgaactcagtaggaaaaagaaggaaagagcagctgctgctgctgcagcaaagatggatg 
gaaaggagtccaagggttcacctgtatttttgcctagaaaagagaacagttcagtagagg 
ctaaggattcaggtttggagtctaaaaagttacccagaagtgtaaaattggaaaaatctg 
ccccagatactgaactggtgaatgtaacacatctaaacacagaggtaaaaaattcttcag 
atacagggaaagtaaagttggatgagaactccgagaagcatcttgttaaagatttgaaag 
cacagggaacaagagactctaaacccatagcactgaaagaggagattgttactccaaagg 
agacagaaacatcagaaaaggagacccctccacctcttcccacaattgcttctcccccac 
cccctctaccaactactacccctccacctcagacaccccctttgccacctttgcctccaa 
taccagctcttccacagcaaccacctctgcctccttctcagccagcatttagtcaggttc 
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ctgcttccagtacttcaactttgcccccttctactcactcaaagacatctgctgtgtcct 
ctcaggcaaattctcagccccctgtacaggtttctgtgaagactcaagtatctgtaacag 
ctgctattccacacctgaaaacttcaacgttgcctcctttgcccctcccacccttattac 
ctggagatgatgacatggatagtccaaaagaaactcttccttcaaaacctgtgaagaaag 
agaaggaacagaggacacgtcacttactcacagaccttcctctccctccagagctccctg 
gtggagatctgtctcccccagactctccagaaccaaaggcaatcacaccacctcagcaac 
catataaaaagagaccaaaaatttgttgtcctcgttatggagaaagaagacaaacagaaa 
gcgactgggggaaacgctgtgtggacaagtttgacattattgggattattggagaaggaa 
cctatggccaagtatataaagccaaggacaaagacacaggagaactagtggctctgaaga 
aggtgagactagacaatgagaaagagggcttcccaatcacagccattcgtgaaatcaaaa 
tccttcgtcagttaatccaccgaagtgttgttaacatgaaggaaattgtcacagataaac 
aagatgcactggatttcaagaaggacaaaggtgccttttaccttgtatttgagtatatgg 
accatgacttaatgggactgctagaatctggtttggtgcacttttctgaggaccatatca 
agtcgttcatgaaacagctaatggaaggattggaatactgtcacaaaaagaatttcctgc 
atcgggatattaagtgttctaacattttgctgaataacagtgggcaaatcaaactagcag 
attttggacttgctcggctctataactctgaagagagtcgcccttacacaaacaaagtca 
ttactttgtggtaccgacctccagaactactgctaggagaggaacgttacacaccagcca 
tagatgtttggagctgtggatgtattcttggggaactattcacaaagaagcctatttttc 
aagccaatctggaactggctcagctagaactgatcagccgactttgtggtagcccttgtc 
cagctgtgtggcctgatgttatcaaactgccctacttcaacaccatgaaaccgaagaagc 
aatatcgaaggcgtctacgagaagaattctctttcattccttctgcagcacttgatttat 
tggaccacatgctgacactagatcctagtaagcggtgcacagctgaacagaccctacaga 
gcgacttccttaaagatgtcgaactcagcaaaatggctcctccagacctcccccactggc 
aggattgccatgagttgtggagtaagaaacggcgacgtcagcgacaaagtggtgttgtag 
tcgaagagccacctccatccaaaacttctcgaaaagaaactacctcagggacaagtactg 
agcctgtgaagaacagcagcccagcaccacctcagcctgctcctggcaaggtggagtctg 
gggctggggatgcaataggccttgctgacatcacacaacagctgaatcaaagtgaattgg 
cagtgttattaaacctgctgcagagccaaaccgacctgagcatccctcaaatggcacagc 
tgcttaacatccactccaacccagagatgcagcagcagctggaagccctgaaccaatcca 
tcagtgccctgacggaagctacttcccagcagcaggactcagagaccatggccccagagg 
agtctttgaaggaagcaccctctgccccagtgatcctgccttcagcagaacagacgaccc 
ttgaagcttcaagcacaccagctgacatgcagaatatattggcagttctcttgagtcagc 
tgatgaaaacccaagagccagcaggcagtctggaggaaaacaacagtgacaagaacagtg 
ggccacaggggccccgaagaactcccacaatgccacaggaggaggcagcagagaagaggc 
cccctgagccccccggacctccaccgccgccacctccaccccctctggttgaaggcgatc 
tttccagcgccccccaggagttgaacccagccgtgacagccgccttgctgcaacttttat 
cccagcctgaagcagagcctcctggccacctgccacatgagcaccaggccttgagaccaa 
tggagtactccacccgaccccgtccaaacaggacttatggaaacactgatgggcctgaaa 
cagggttcagtgccattgacactgatgaacgaaactctggtccagccttgacagaatcct 
tggtccagaccctggtgaagaacaggaccttctcaggctctctgagccaccttggggagt 
ccagcagttaccagggcacagggtcagtgcagtttccaggggaccaggacctccgttttg 
ccagggtccccttagcgttacacccggtggtcgggcaaccattcctgaaggctgagggaa 
gcagcaattctgtggtacatgcagagaccaaattgcaaaactatggggagctggggccag 
gaaccactggggccagcagctcaggagcaggccttcactgggggggcccaactcagtctt 
ctgcttatggaaaactctatcgggggcctacaagagtcccaccaagagggggaagaggga 
gaggagttccttactaacccagagacttcagtgtcctgaaagattcctttcctatccatc 
cttccatccagttctctgaatctttaatgaaatcatttgccagagcgaggtaatcatctg 
catttggctactgcaaagctgtccgttgtattccttgctcacttgctactagcaggcgac 
ttacgaaataatgatgttggcaccagttccccctggatgggctatagccagaacatttac 
ttcaactctaccttagtagatacaagtagagaatatggagaggatcattacattgaaaag 
taaatgttttattagttcattgcctgcacttactgatcggaagagagaaagaacagtttc 
agtattgagatggctcaggagaggctctttgatttttaaagttttggggtggggggttgt 
gtgtggtttctttcttttgaattttaatttaggtgttttgggtttttttcctttaaagag 
aatagtgttcacaaaatttgagctgctctttggcttttgctataagggaaacagagtggc 
ctggctgatttgaataaatgtttctttcctctccaccatctcacattttgcttttaagtg 
aacactttttccccattgagcatcttgaacatactttttttccaaataaattactcatcc 
ttaaagtttactccactttgacaaaagatacgcccttctccctgcacataaagcaggttg 
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tagaacgtggcattcttgggcaagtaggtagactttacccagtctctttccttttttgct 
gatgtgtgctctctctctctctttctctctctctctctctctctctctctctctgtctgt 
ctcgcttgctcgctctcgctgtttctctctctctctgtctgtctcgcgcgcgctctctct 
tgttctctctctggagggtgtgtgtggaaagatctcggagagatgcgcgaagagcgagga 
gatatttactatagttagtgagaatataagaggaaaagatataccagactcttagtattg 
ttctgctgtatctcgtaaaactactagagtggttgtgttttagaacagctatgttgtgat 
ggggatgtttgttgtgtgatgtgtggagcgctgtggatattgatgccagactgaggatag 
agaagatgctatcatatagagagggggaacgaacgattgcagatagtggacacatatatt 
tttgttgtgtaaaaaaaaaaaaggttttaaaaacaaggcgagttgggattgttgcaagag 
aaaaaccatcgtcgaatcttcgggaatgtaccatcggcgggggtattggaaagaacaaac 
tgcgggttgtaaaacttttacacggggtggggagacaatttttgttcctctgagttgccg 
gagcttgagggttcccaaaatgggttgtgaagggatatgtcacacacctcttttaaaaaa 
gggggggggtatgggctcttctttgaaatgagaattcacg 

SEQ ID NO: 325, >T60764_T9 # TY Transcript # LN 5816 # Source Gene: T60764 
# Encoded protein: T60764_P1 

cccggcgcccgggcctgaggactggctcggcggagggagaagaggaaacagacttgagca 
gctccccgttgtctcgcaactccactgccgaggaactctcatttcttccctcgctccttc 
accccccacctcatgtagaagggtgctgaggcgtcgggagggaggaggagcctgggctac 
cgtccctgccctccccacccccttcccggggcgctttggtgggcgtggagttggggttgg 
gggggtgggtgggggttgctttttggagtgctggggaacttttttcccttcttcaggtca 
ggggaaagggaatgcccaattcagagagacatgggggcaagaaggacgggagtggaggag 
cttctggaactttgcagccgtcatcgggaggcggcagctctaacagcagagagcgtcacc 
gcttggtatcgaagcacaagcggcataagtccaaacactccaaagacatggggttggtga 
cccccgaagcagcatccctgggcacagttatcaaacctttggtggagtatgatgatatca 
gctctgattccgacaccttctccgatgacatggccttcaaactagaccgaagggagaacg 
acgaacgtcgtggatcagatcggagcgaccgcctgcacaaacatcgtcaccaccagcaca 
ggcgttcccgggacttactaaaagctaaacagaccgaaaaagaaaaaagccaagaagtct 
ccagcaagtcgggatcgatgaaggaccggatatcgggaagttcaaagcgttcgaatgagg 
agactgatgactatgggaaggcgcaggtagccaaaagcagcagcaaggaatccaggtcat 
ccaagctccacaaggagaagaccaggaaagaacgggagctgaagtctgggcacaaagacc 
ggagtaaaagtcatcgaaaaagggaaacacccaaaagttacaaaacagtggacagcccaa 
aacggagatccaggagcccccacaggaagtggtctgacagctccaaacaagatgatagcc 
cctcgggagcttcttatggccaagattatgaccttagtccctcacgatctcatacctcga 
gcaattatgactcctacaagaaaagtcctggaagtacctcgagaaggcagtcggtcagtc 
ccccttacaaggagccttcggcctaccagtccagcacccggtcaccgagcccctacagta 
ggcgacagagatctgtcagtccctatagcaggagacggtcgtccagctacgaaagaagtg 
gctcttacagcgggcgatcgcccagtccctatggtcgaaggcggtccagcagccctttcc 
tgagcaagcggtctctgagtcggagtccactccccagtaggaaatccatgaagtccagaa 
gtagaagtcctgcatattcaagacattcatcttctcatagtaaaaagaagagatccagtt 
cacgcagtcgtcattccagtatctcacctgtcaggcttccacttaattccagtctgggag 
ctgaactcagtaggaaaaagaaggaaagagcagctgctgctgctgcagcaaagatggatg 
gaaaggagtccaagggttcacctgtatttttgcctagaaaagagaacagttcagtagagg 
ctaaggattcaggtttggagtctaaaaagttacccagaagtgtaaaattggaaaaatctg 
ccccagatactgaactggtgaatgtaacacatctaaacacagaggtaaaaaattcttcag 
atacagggaaagtaaagttggatgagaactccgagaagcatcttgttaaagatttgaaag 
cacagggaacaagagactctaaacccatagcactgaaagaggagattgttactccaaagg 
agacagaaacatcagaaaaggagacccctccacctcttcccacaattgcttctcccccac 
cccctctaccaactactacccctccacctcagacaccccctttgccacctttgcctccaa 
taccagctcttccacagcaaccacctctgcctccttctcagccagcatttagtcaggttc 
ctgcttccagtacttcaactttgcccccttctactcactcaaagacatctgctgtgtcct 
ctcaggcaaattctcagccccctgtacaggtttctgtgaagactcaagtatctgtaacag 
ctgctattccacacctgaaaacttcaacgttgcctcctttgcccctcccacccttattac 
ctggagatgatgacatggatagtccaaaagaaactcttccttcaaaacctgtgaagaaag 
agaaggaacagaggacacgtcacttactcacagaccttcctctccctccagagctccctg 
gtggagatctgtctcccccagactctccagaaccaaaggcaatcacaccacctcagcaac 
catataaaaagagaccaaaaatttgttgtcctcgttatggagaaagaagacaaacagaaa 
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gcgactgggggaaacgctgtgtggacaagtttgacattattgggattattggagaaggaa 
cctatggccaagtatataaagccaaggacaaagacacaggagaactagtggctctgaaga 
aggtgagactagacaatgagaaagagggcttcccaatcacagccattcgtgaaatcaaaa 
tccttcgtcagttaatccaccgaagtgttgttaacatgaaggaaattgtcacagataaac 
aagatgcactggatttcaagaaggacaaaggtgccttttaccttgtatttgagtatatgg 
accatgacttaatgggactgctagaatctggtttggtgcacttttctgaggaccatatca 
agtcgttcatgaaacagctaatggaaggattggaatactgtcacaaaaagaatttcctgc 
atcgggatattaagtgttctaacattttgctgaataacagtgggcaaatcaaactagcag 
attttggacttgctcggctctataactctgaagagagtcgcccttacacaaacaaagtca 
ttactttgtggtaccgacctccagaactactgctaggagaggaacgttacacaccagcca 
tagatgtttggagctgtggatgtattcttggggaactattcacaaagaagcctatttttc 
aagccaatctggaactggctcagctagaactgatcagccgactttgtggtagcccttgtc 
cagctgtgtggcctgatgttatcaaactgccctacttcaacaccatgaaaccgaagaagc 
aatatcgaaggcgtctacgagaagaattctctttcattccttctgcagcacttgatttat 
tggaccacatgctgacactagatcctagtaagcggtgcacagctgaacagaccctacaga 
gcgacttccttaaagatgtcgaactcagcaaaatggctcctccagacctcccccactggc 
aggattgccatgagttgtggagtaagaaacggcgacgtcagcgacaaagtggtgttgtag 
tcgaagagccacctccatccaaaacttctcgaaaagaaactacctcagggacaagtactg 
agcctgtgaagaacagcagcccagcaccacctcagcctgctcctggcaaggtggagtctg 
gggctggggatgcaataggccttgctgacatcacacaacagctgaatcaaagtgaattgg 
cagtgttattaaacctgctgcagagccaaaccgacctgagcatccctcaaatggcacagc 
tgcttaacatccactccaacccagagatgcagcagcagctggaagccctgaaccaatcca 
tcagtgccctgacggaagctacttcccagcagcaggactcagagaccatggccccagagg 
agtctttgaaggaagcaccctctgccccagtgatcctgccttcagcagaacagacgaccc 
ttgaagcttcaagcacaccagctgacatgcagaatatattggcagttctcttgagtcagc 
tgatgaaaacccaagagccagcaggcagtctggaggaaaacaacagtgacaagaacagtg 
ggccacaggggccccgaagaactcccacaatgccacaggaggaggcagcagagaagaggc 
cccctgagccccccggacctccaccgccgccacctccaccccctctggttgaaggcgatc 
tttccagcgccccccaggagttgaacccagccgtgacagccgccttgctgcaacttttat 
cccagcctgaagcagagcctcctggccacctgccacatgagcaccaggccttgagaccaa 
tggagtactccacccgaccccgtccaaacaggacttatggaaacactgatgggcctgaaa 
cagggttcagtgccattgacactgatgaacgaaactctggtccagccttgacagaatcct 
tggtccagaccctggtgaagaacaggaccttctcaggctctctgagccaccttggggagt 
ccagcagttaccagggcacagggtcagtgcagtttccaggggaccaggacctccgttttg 
ccagggtccccttagcgttacacccggtggtcgggcaaccattcctgaaggctgagggaa 
gcagcaattctgtggtacatgcagagaccaaattgcaaaactatggggagctggggccag 
gaaccactggggccagcagctcaggagcaggccttcactgggggggcccaactcagtctt 
ctgcttatggaaaactctatcgggggcctacaagagtcccaccaagagggggaagaggga 
gaggagttccttactaacccagagacttcagtgtcctgaaagattcctttcctatccatc 
cttccatccagttctctgaatctttaatgaaatcatttgccagagcgaggtaatcatctg 
catttggctactgcaaagctgtccgttgtattccttgctcacttgctactagcaggcgac 
ttacgaaataatgatgttggcaccagttccccctggatgggctatagccagaacatttac 
ttcaactctaccttagtagatacaagtagagaatatggagaggatcattacattgaaaag 
taaatgttttattagttcattgcctgcacttactgatcggaagagagaaagaacagtttc 
agtattgagatggctcaggagaggctctttgatttttaaagttttggggtggggggttgt 
gtgtggtttctttcttttgaattttaatttaggtgttttgggtttttttcctttaaagag 
aatagtgttcacaaaatttgagctgctctttggcttttgctataagggaaacagagtggc 
ctggctgatttgaataaatgtttctttcctctccaccatctcacattttgcttttaagtg 
aacactttttccccattgagcatcttgaacatactttttttccaaataaattactcatcc 
ttaaagtttactccactttgacaaaagatacgcccttctccctgcacataaagcaggttg 
tagaacgtggcattcttgggcaagtaggtagactttacccagtctctttccttttttgct 
gatgtgtgctctctctctctctttctctctctctctctctctctctctctctctgtctgt 
ctcgcttgctcgctctcgctgtttctctctctttgaggcatttgtttggaaaaaattact 
catccttaaagtttactccacttggacaaagataggcccttctccctggacataaagcag 
gttgacaccgtggctttcttggggccagtagtacgatttaaccaagttcttacttatggc 
ggagggggcacgccaccacattgcgaaactaacactcaaaacaggagagagatgcc 



212/365 



Figure 1 (Cont'd) 



SEQ ID NO:326. >T60764JT10 # TY Transcript ft LN 4034 # Source Gene: T60764 
# Encoded protein: T60764_P4 

cccggcgcccgggcctgaggactggctcggcggagggagaagaggaaacagacttgagca 
gctccccgttgtctcgcaactccactgccgaggaactctcatttcttccctcgctccttc 
accccccacctcatgtagaagggtgctgaggcgtcgggagggaggaggagcctgggctac 
cgtccctgccctccccacccccttcccggggcgctttggtgggcgtggagttggggttgg 
gggggtgggtgggggttgctttttggagtgctggggaacttttttcccttcttcaggtca 
ggggaaagggaatgcccaattcagagagacatgggggcaagaaggacgggagtggaggag 
cttctggaactttgcagccgtcatcgggaggcggcagctctaacagcagagagcgtcacc 
gcttggtatcgaagcacaagcggcataagtccaaacactccaaagacatggggttggtga 
cccccgaagcagcatccctgggcacagttatcaaacctttggtggagtatgatgatatca 
gctctgattccgacaccttctccgatgacatggccttcaaactagaccgaagggagaacg 
acgaacgtcgtggatcagatcggagcgaccgcctgcacaaacatcgtcaccaccagcaca 
ggcgttcccgggacttactaaaagctaaacagaccgaaaaagaaaaaagccaagaagtct 
ccagcaagtcgggatcgatgaaggaccggatatcgggaagttcaaagcgttcgaatgagg 
agactgatgactatgggaaggcgcaggtagccaaaagcagcagcaaggaatccaggtcat 
ccaagctccacaaggagaagaccaggaaagaacgggagctgaagtctgggcacaaagacc 
ggagtaaaagtcatcgaaaaagggaaacacccaaaagttacaaaacagtggacagcccaa 
aacggagatccaggagcccccacaggaagtggtctgacagctccaaacaagatgatagcc 
cctcgggagcttcttatggccaagattatgaccttagtccctcacgatctcatacctcga 
gcaattatgactcctacaagaaaagtcctggaagtacctcgagaaggcagtcggtcagtc 
ccccttacaaggagccttcggcctaccagtccagcacccggtcaccgagcccctacagta 
ggcgacagagatctgtcagtccctatagcaggagacggtcgtccagctacgaaagaagtg 
gctcttacagcgggcgatcgcccagtccctatggtcgaaggcggtccagcagccctttcc 
tgagcaagcggtctctgagtcggagtccactccccagtaggaaatccatgaagtccagaa 
gtagaagtcctgcatattcaagacattcatcttctcatagtaaaaagaagagatccagtt 
cacgcagtcgtcattccagtatctcacctgtcaggcttccacttaattccagtctgggag 
ctgaactcagtaggaaaaagaaggaaagagcagctgctgctgctgcagcaaagatggatg 
gaaaggagtccaagggttcacctgtatttttgcctagaaaagagaacagttcagtagagg 
ctaaggattcaggtttggagtctaaaaagttacccagaagtgtaaaattggaaaaatctg 
ccccagatactgaactggtgaatgtaacacatctaaacacagaggtaaaaaattcttcag 
atacagggaaagtaaagttggatgagaactccgagaagcatcttgttaaagatttgaaag 
cacagggaacaagagactctaaacccatagcactgaaagaggagattgttactccaaagg 
agacagaaacatcagaaaaggagacccctccacctcttcccacaattgcttctcccccac 
cccctctaccaactactacccctccacctcagacaccccctttgccacctttgcctccaa 
taccagctcttccacagcaaccacctctgcctccttctcagccagcatttagtcaggttc 
ctgcttccagtacttcaactttgcccccttctactcactcaaagacatctgctgtgtcct 
ctcaggcaaattctcagccccctgtacaggtttctgtgaagactcaagtatctgtaacag 
ctgctattccacacctgaaaacttcaacgttgcctcctttgcccctcccacccttattac 
ctggagatgatgacatggatagtccaaaagaaactcttccttcaaaacctgtgaagaaag 
agaaggaacagaggacacgtcacttactcacagaccttcctctccctccagagctccctg 
gtggagatctgtctcccccagactctccagaaccaaaggcaatcacaccacctcagcaac 
catataaaaagagaccaaaaatttgttgtcctcgttatggagaaagaagacaaacagaaa 
gcgactgggggaaacgctgtgtggacaagtttgacattattgggattattggagaaggaa 
cctatggccaagtatataaagccaaggacaaagacacaggagaactagtggctctgaaga 
aggtgagactagacaatgagaaagagggcttcccaatcacagccattcgtgaaatcaaaa 
tccttcgtcagttaatccaccgaagtgttgttaacatgaaggaaattgtcacagataaac 
aagatgcactggatttcaagaaggacaaaggtgccttttaccttgtatttgagtatatgg 
accatgacttaatgggactgctagaatctggtttggtgcacttttctgaggaccatatca 
agtcgttcatgaaacagctaatggaaggattggaatactgtcacaaaaagaatttcctgc 
atcgggatattaagtgttctaacattttgctgaataacagtgggcaaatcaaactagcag 
attttggacttgctcggctctataactctgaagagagtcgcccttacacaaacaaagtca 
ttactttgtggtaccgacctccagaactactgctaggagaggaacgttacacaccagcca 
tagatgtttggagctgtggatgtattcttggggaactattcacaaagaagcctatttttc 
aagccaatctggaactggctcagctagaactgatcagccgactttgtggtagcccttgtc 
cagctgtgtggcctgatgttatcaaactgccctacttcaacaccatgaaaccgaagaagc 
aatatcgaaggcgtctacgagaagaattctctttcattccttctgcagcacttgatttat 
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tggaccacatgctgacactagatcctagtaagcggtgcacagctgaacagaccctacaga 
gcgacttccttaaagatgtcgaactcagcaaaatggctcctccagacctcccccactggc 
aggattgccatgagttgtggagtaagaaacggcgacgtcagcgacaaagtggtgttgtag 
tcgaagagccacctccatccaaaacttctcgaaaagaaactacctcagggacaagtactg 
agcctgtgaagaacagcagcccagcaccacctcagcctgctcctggcaaggtggagtctg 
gggctggggatgcaataggccttgctgacatcacacaacagctgaatcaaagtgaattgg 
cagtgttattaaacctgctgcagagccaaaccgacctgagcatccctcaaatggcacagc 
tgcttaacatccactccaacccagagatgcagcagcagctggaagccctgaaccaatcca 
tcagtgccctgacggaagctacttcccagcagcaggactcagagaccatggccccagagg 
agtctttgaaggaagcaccctctgccccagtgatcctgccttcagcagaacagacgaccc 
ttgaagcttcaagcacaccagctgacatgcagaatatattggcagttctcttgagtcagc 
tgatgaaaacccaagagccagcaggcagtctggagttggaggcctctggagttgggcagg 
cctctggagttggg 

SEQ ID NO: 327. >T60764_T11 # TY Transcript # LN 3046 # Source Gene: T60764 
# Encoded protein: T60764_P5 

cccggcgcccgggcctgaggactggctcggcggagggagaagaggaaacagacttgagca 
gctccccgttgtctcgcaactccactgccgaggaactctcatttcttccctcgctccttc 
accccccacctcatgtagaagggtgctgaggcgtcgggagggaggaggagcctgggctac 
cgtccctgccctccccacccccttcccggggcgctttggtgggcgtggagttggggttgg 
gggggtgggtgggggttgctttttggagtgctggggaacttttttcccttcttcaggtca 
ggggaaagggaatgcccaattcagagagacatgggggcaagaaggacgggagtggaggag 
cttctggaactttgcagccgtcatcgggaggcggcagctctaacagcagagagcgtcacc 
gcttggtatcgaagcacaagcggcataagtccaaacactccaaagacatggggttggtga 
cccccgaagcagcatccctgggcacagttatcaaacctttggtggagtatgatgatatca 
gctctgattccgacaccttctccgatgacatggccttcaaactagaccgaagggagaacg 
acgaacgtcgtggatcagatcggagcgaccgcctgcacaaacatcgtcaccaccagcaca 
ggcgttcccgggacttactaaaagctaaacagaccgaaaaagaaaaaagccaagaagtct 
ccagcaagtcgggatcgatgaaggaccggatatcgggaagttcaaagcgttcgaatgagg 
agactgatgactatgggaaggcgcaggtagccaaaagcagcagcaaggaatccaggtcat 
ccaagctccacaaggagaagaccaggaaagaacgggagctgaagtctgggcacaaagacc 
ggagtaaaagtcatcgaaaaagggaaacacccaaaagttacaaaacagtggacagcccaa 
aacggagatccaggagcccccacaggaagtggtctgacagctccaaacaagatgatagcc 
cctcgggagcttcttatggccaagattatgaccttagtccctcacgatctcatacctcga 
gcaattatgactcctacaagaaaagtcctggaagtacctcgagaaggcagtcggtcagtc 
ccccttacaaggagccttcggcctaccagtccagcacccggtcaccgagcccctacagta 
ggcgacagagatctgtcagtccctatagcaggagacggtcgtccagctacgaaagaagtg 
gctcttacagcgggcgatcgcccagtccctatggtcgaaggcggtccagcagccctttcc 
tgagcaagcggtctctgagtcggagtccactccccagtaggaaatccatgaagtccagaa 
gtagaagtcctgcatattcaagacattcatcttctcatagtaaaaagaagagatccagtt 
cacgcagtcgtcattccagtatctcacctgtcaggcttccacttaattccagtctgggag 
ctgaactcagtaggaaaaagaaggaaagagcagctgctgctgctgcagcaaagatggatg 
gaaaggagtccaagggttcacctgtatttttgcctagaaaagagaacagttcagtagagg 
ctaaggattcaggtttggagtctaaaaagttacccagaagtgtaaaattggaaaaatctg 
ccccagatactgaactggtgaatgtaacacatctaaacacagaggtaaaaaattcttcag 
atacagggaaagtaaagttggatgagaactccgagaagcatcttgttaaagatttgaaag 
cacagggaacaagagactctaaacccatagcactgaaagaggagattgttactccaaagg 
agacagaaacatcagaaaaggagacccctccacctcttcccacaattgcttctcccccac 
cccctctaccaactactacccctccacctcagacaccccctttgccacctttgcctccaa 
taccagctcttccacagcaaccacctctgcctccttctcagccagcatttagtcaggttc 
ctgcttccagtacttcaactttgcccccttctactcactcaaagacatctgctgtgtcct 
ctcaggcaaattctcagccccctgtacaggtttctgtgaagactcaagtatctgtaacag 
ctgctattccacacctgaaaacttcaacgttgcctcctttgcccctcccacccttattac 
ctggagatgatgacatggatagtccaaaagaaactcttccttcaaaacctgtgaagaaag 
agaaggaacagaggacacgtcacttactcacagaccttcctctccctccagagctccctg 
gtggagatctgtctcccccagactctccagaaccaaaggcaatcacaccacctcagcaac 
catataaaaagagaccaaaaatttgttgtcctcgttatggagaaagaagacaaacagaaa 
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gcgactgggggaaacgctgtgtggacaagtttgacattattgggattattggagaaggaa 
cctatggccaagtatataaagccaaggacaaagacacaggagaactagtggctctgaaga 
aggtgagactagacaatgagaaagagggcttcccaatcacagccattcgtgaaatcaaaa 
tccttcgtcagttaatccaccgaagtgttgttaacatgaaggaaattgtcacagataaac 
aagatgcactggatttcaagaaggacaaaggtgccttttaccttgtatttgagtatatgg 
accatgacttaatgggactgctagaatctggtttggtgcacttttctgaggaccatatca 
agtcgttcatgaaacagctaatggaaggattggaatactgtcacaaaaagaatttcctgc 
atcgggatattaagtgttctaacattttgctgaataacaggtaacatagtaaccaaataa 
gattaagcactttcctcttctcctctgacctttttagtttcaaatggttaattggtatta 
taattagacctaatagtgcagtattcacacactatttagtgatatt 

SEQ ID NO: 328 . >T60764JT12 # TY Transcript # LN 499 # Source Gene: T60764 

# Encoded protein: 

gtttattattaggaaggactaattattcaactactcttcgtttcagtttcttcatctgta 
aaattcaagggttttttctagagaacctgtggtcaactccaaaattatattaggacttga 
ggcattgttatttcagcattcttttttttttcttgcttttaattttttttcgtctttatg 
taggtgccttttaccttgtatttgagtatatggaccatgacttaatgggactgctagaat 
ctggtttggtgcacttttctgaggaccatatcaagtcgttcatgaaacagctaatggaag 
gattggaatactgtcacaaaaagaatttcctgcatcgggatattaagtgttctaacattt 
tgctgaataacaggtaacatagtaaccaaataagattaagcactttcctcttctcctctg 
acctttttagtttcaaatggttaattggtattataattagacctaatagtgcagtattca 
cacactatttagtgatatt 

SEQ ID NO: 329. >T60764_T13 # TY Transcript # LN 732 # Source Gene: T60764 

# Encoded protein: 

gctgctgatgaagaaatagatcccagtacatatcctttacgttttgttcatcaactttga 
tagacagcatagaggaaatgtggaactggtctccatgtaaatttgtctctagggaatgta 
tgatagacctacttgtccagattacattaagtttgctgtcctgcaaacctcctttctcac 
agtcaggatccttttttgccttagagtccttcctgaaggagctgttatggatgagagagc 
cacaagtccaacctcatgatctgatctaaaactgggtggcctcctcccagtcaccttctt 
ttattagccttatattaaccattaaccttcattgatctttatgattctgggtttcacagc 
ttctgtttctgttgaattccctaactttaagaaccatgtttttgcttttatcagcctggg 
agctaggtccctggagacaagaattttagaacagttatgactctctctgggagtgtacta 
aggttagtctgaatccttctggggaattcaggtttgaaggagttagaagagtgctaacat 
caccgtcactgttacctaccaatggatgcttcctgggtgcaagaccaaagatgttctttg 
gtgtctagtcccttgtctcatgtctaatgtaacctgccattttcctttacaggtaggagc 
aacggtgggaatgccctctgaggaatgcagtgatgtccatcagtcacctctcacccagag 
cctggcaaattc 

SEQ ID NO: 330. >T60764_P1 # TY Protein # CC #LN 1481 # Source Gene: 

T607 64 # Encoding Transcript: 1 

MPNSERHGGKKDGSGGASGTLQPSSGGGSSNSRERHRLVSKHKRHKSKHSKDMGLVTPEA 
ASLGTVIKPLVEYDDISSDSDTFSDDMAFKLDRRENDERRGSDRSDRLHKHRHHQHRRSR 
DLLKAKQTEKEKSQEVSSKSGSMKDRISGSSKRSNEETDDYGKAQVAKSSSKESRSSKLH 
KEKTRKERELKSGHKDRSKSHRKRETPKSYKTVDSPKRRSRSPHRKWSDSSKQDDSPSGA 
SYGQDYDLSPSRSHTSSNYDSYKKSPGSTSRRQSVSPPYKEPSAYQSSTRSPSPYSRRQR 
SVSPYSRRRSSSYERSGSYSGRSPSPYGRRRSSSPFLSKRSLSRSPLPSRKSMKSRSRSP 
AYSRHSSSHSKKKRSSSRSRHSSISPVRLPLNSSLGAELSRKKKERAAAAAAAKMDGKES 
KGSPVFLPRKENSSVEAKDSGLESKKLPRSVKLEKSAPDTELVNVTHLNTEVKNSSDTGK 
VKLDENSEKHLVKDLBCAQGTRDSKPIALKEEIVTPKETETSEKETPPPLPTIASPPPPLP 
TTTPPPQTPPLPPLPPIPALPQQPPLPPSQPAFSQVPASSTSTLPPSTHSKTSAVSSQAN 
SQPPVQVSVKTQVSVTAAIPHLKTSTLPPLPLPPLLPGDDDMDSPKETLPSKPVKKEKEQ 
RTRHLLTDLPLPPELPGGDLSPPDSPEPKAITPPQQPYKKRPKICCPRYGERRQTESDWG 
KRCVDKFDIIGIIGEGTYGQVYKAKDKDTGELVALKKVRLDNEKEGFPITAIREIKILRQ 
LIHRSWNMKEIVTDKQDALDFKKDKGAFYLVFEYMDHDLMGLLESGLVHFSEDHIKSFM 
KQLMEGLEYCHKKNFLHRDIKCSNILLNNSGQIKLADFGLARLYNSEESRPYTNKVITLW 
YRPPELLLGEERYTPAI DVWSCGCI LGELFTKKPI FQANLELAQLELI SRLCGSPCPAVW 
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PDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLLDHMLTLDPSKRCTAEQTLQSDFL 
KDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVWEEPPPSKTSRKETTSGTSTEPVK 
NSS PAPPQPAPGKVESGAGDAIGLADI TQQLNQSELAVLLNLLQSQTDLSI PQMAQLLNI 
HSNPEMQQQLEALNQSISALTEATSQQQDSETMAPEESLKEAPSAPVILPSAEQTTLEAS 
STPADMQNILAVLLSQLMKTQEPAGSLEENNSDKNSGPQGPRRTPTMPQEEAAEKRPPEP 
PGPPPPPPPPPLVEGDLSSAPQELNPAVTAALLQLLSQPEAEPPGHLPHEHQALRPMEYS 
TRPRPNRTYGNTDGPETGFSAIDTDERNSGPALTESLVQTLVKNRTFSGSLSHLGESSSY 
QGTGSVQFPGDQDLRFARVPLALHPWGQPFLKAEGSSNSVVHAETKLQNYGELGPGTTG 
ASSSGAGLHWGGPTQSSAYGKLYRGPTRVPPRGGRGRGVPY 

SEQ ID NO:331. >T60764_P6 # TY Protein # CC #LN 1490 # Source Gene: 

T607 64 # Encoding Transcript: 3 

MPNSERHGGKKDGSGGASGTLQPSSGGGSSNSRERHRLVSKHKRHKSKHSKDMGLVTPEA 
ASLGTVIKPLVEYDDISSDSDTFSDDMAFKLDRRENDERRGSDRSDRLHKHRHHQHRRSR 
DLLKAKQTEKEKSQEVSSKSGSMKDRISGSSKRSNEETDDYGKAQVAKSSSKESRSSKLH 
KEKTRKERELKSGHKDRSKSHRKRETPKSYKTVDSPKRRSRSPHRKWSDSSKQDDSPSGA 
SYGQDYDLSPSRSHTSSNYDSYKKSPGSTSRRQSVSPPYKEPSAYQSSTRSPSPYSRRQR 
SVSPYSRRRSSSYERSGSYSGRSPSPYGRRRSSSPFLSKRSLSRSPLPSRKSMKSRSRSP 
AYSRHSSSHSKKKRSSSRSRHSSISPVRLPLNSSLGAELSRKKKERAAAAAAAKMDGKES 
KGSPVFLPRKENSSVEAKDSGLESKKLPRSVKLEKSAPDTELVNVTHLNTEVKNSSDTGK 
VKLDENSEKHLVKDLKAQGTRDSKPIALKEEIVTPKETETSEKETPPPLPTIASPPPPLP 
TTTPPPQTPPLPPLPPIPALPQQPPLPPSQPAFSQVPASSTSTLPPSTHSKTSAVSSQAN 
SQPPVQVSVKTQVSVTAAIPHLKTSTLPPLPLPPLLPGGDDMDSPKETLPSKPVKKEKEQ 
RTRHLLTDLPLPPELPGGDLSPPDSPEPKAITPPQQPYKKRPKICCPRYGERRQTESDWG 
KRCVDKFDIIGIIGEGTYGQVYKARDKDTGELVALKKVRLDNEKEGFPITAIREIKILRQ 
LIHRSVVNMKEIVTDKQDALDFKKDKGAFYLVFEYMDHDLMGLLESGLVHFSEDHIKSFM 
KQLMEGLEYCHKKNFLHRDIKCSNILLNNSGQIKLADFGLARLYNSEESRPYTNKVITLW 
Y RPPELLLGEER YT PAI DVWSCGC I LGELFTKKP I FQANLELAQLEL I S RLCGS PCPAVW 
PDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLLDHMLTLDPSKRCTAEQTLQSDFL 
KDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGWVEEPPPSKTSRKETTSGTSTEPVK 
NSS PAPPQPAPGKVESGAGDAIGLADI TQQLNQSELAVLLNLLQSQTDLSI PQMAQLLNI 
HSNPEMQQQLEALNQSISALTEATSQQQDSETMAPEESLKEAPSAPVILPSAEQMTLEAS 
STPADMQNILAVLLSQLMKTQEPAGSLEENNSDKNSGPQGPRRTPTMPQEEAAACPPHIL 
PPEKRPPEPPGPPPPPPPPPLVEGDLSSAPQELNPAVTAALLQLLSQPEAEPPGHLPHEH 
QALRPMEYSTRPRPNRTYGNTDGPETGFSAIDTDERNSGPALTESLVQTLVKNRTFSGSL 
SHLGESSSYQGTGSVQFPGDQDLRFARVPLALHPWGQPFLKAEGSSNSWHAETKLQNY 
GELGPGTTGASSSGAGLHWGGPTQSSAYGKLYRGPTRVPPRGGRGRGVPY 

SEQ ID NO: 332. >T60764_P3 # TY Protein # CC #LN 1422 # Source Gene: 

T607 64 # Encoding Transcript: 7 

MPNSERHGGKKDGSGGASGTLQPSSGGGSSNSRERHRLVSKHKRHKSKHSKDMGLVTPEA 
ASLGTVIKPLVEYDDISSDSDTFSDDMAFKLDRRENDERRGSDRSDRLHKHRHHQHRRSR 
DLLKAKQTEKEKSQEVSSKSGSMKDRISGSSKRSNEETDDYGKAQVAKSSSKESRSSKLH 
KEKTRKERELKSGHKDRSKSHRKRETPKSYKTVDSPKRRSRSPHRKWSDSSKQDDSPSGA 
SYGQDYDLSPSRSHTSSNYDSYKKSPGSTSRRQSVSPPYKEPSAYQSSTRSPSPYSRRQR 
SVSPYSRRRSSSYERSGSYSGRSPSPYGRRRSSSPFLSKRSLSRSPLPSRKSMKSRSRSP 
AYSRHSSSHSKKKRSSSRSRHSSISPVRLPLNSSLGAELSRKKKERAAAAAAAKMDGKES 
KGSPVFLPRKENSSVEAKDSGLESKKLPRSVKLEKSAPDTELVNVTHLNTEVKNSSDTGK 
VKLDENSEKHLVKDLKAQGTRDSKPIALKEEIVTPKETETSEKETPPPLPTIASPPPPLP 
TTTPPPQTPPLPPLPPIPALPQQPPLPPSQPAFSQVPASSTSTLPPSTHSKTSAVSSQAN 
SQPPVQVSVKTQVSVTAAIPHLKTSTLPPLPLPPLLPGDDDMDRICCPRYGERRQTESDW 
GKRCVDKFDI IGI IGEGT YGQVYKAKDKDTGELVALKKVRLDNEKEGFP I TAI RE I K I LR 
QLIHRSVVNMKEIVTDKQDALDFKKDKGAFYLVFEYMDHDLMGLLESGLVHFSEDHIKSF 
MKQLMEGLEYCHKKNFLHRDIKCSNILLNNSGQIKLADFGLARLYNSEESRPYTNKVITL 
WYRPPELLLGEERYTPAIDVWSCGCILGELFTKKPIFQANLELAQLELISRLCGSPCPAV 
WPDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLLDHMLTLDPSKRCTAEQTLQSDF 
LKDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVWEEPPPSKTSRKETTSGTSTEPV 
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KNSSPAPPQPAPGKVESGAGDAIGLADITQQLNQSELAVLLNLLQSQTDLSIPQMAQLLN 
IHSNPEMQQQLEALNQSISALTEATSQQQDSETMAPEESLKEAPSAPVILPSAEQTTLEA 
SST PADMQNI LAVLLSQLMKTQE PAGSLEENNSDKNSGPQG PRRT PTMPQEEAAEKRPPE 
PPGPPPPPPPPPLVEGDLSSAPQELNPAVTAALLQLLSQPEAEPPGHLPHEHQALRPMEY 
STRPRPNRTYGNTDGPETGFSAIDTDERNSGPALTESLVQTLVKNRTFSGSLSHLGESSS 
YQGTGSVQFPGDQDLRFARVPLALHPWGQPFLKAEGSSNSWHAETKLQNYGELGPGTT 
GASSSGAGLHWGGPTQSSAYGKLYRGPTRVPPRGGRGRGVPY 

SEQ ID NO: 333. >T60764_P4 # TY Protein # CC #LN 1241 # Source Gene: 

T607 64 # Encoding Transcript: 10 

MPNSERHGGKKDGSGGASGTLQPSSGGGSSNSRERHRLVSKHKRHKSKHSKDMGLVTPEA 
ASLGTVIKPLVEYDDISSDSDTFSDDMAFKLDRRENDERRGSDRSDRLHKHRHHQHRRSR 
DLLKAKQTEKEKSQEVSSKSGSMKDRISGSSKRSNEETDDYGKAQVAKSSSKESRSSKLH 
KEKTRKERELKSGHKDRSKSHRKRETPKSYKTVDSPKRRSRSPHRECWSDSSKQDDSPSGA 
SYGQDYDLSPSRSHTSSNYDSYKKSPGSTSRRQSVSPPYKEPSAYQSSTRSPSPYSRRQR 
SVSPYSRRRSSSYERSGSYSGRSPSPYGRRRSSSPFLSKRSLSRSPLPSRKSMKSRSRSP 
AYSRHSSSHSKKKRSSSRSRHSSISPVRLPLNSSLGAELSRKKKERAAAAAAAKMDGKES 
KGSPVFLPRKENSSVEAKDSGLESKKLPRSVKLEKSAPDTELVNVTHLNTEVKNSSDTGK 
VKLDENSEKHLVKDLKAQGTRDSKPIALKEEIVTPKETETSEKETPPPLPTIASPPPPLP 
TTTPPPQTPPLPPLPPIPALPQQPPLPPSQPAFSQVPASSTSTLPPSTHSKTSAVSSQAN 
SQPPVQVSVKTQVSVTAAIPHLKTSTLPPLPLPPLLPGDDDMDSPKETLPSKPVKKEKEQ 
RTRHLLTDLPLPPELPGGDLSPPDSPEPKAITPPQQPYKKRPKICCPRYGERRQTESDWG 
KRCVDKFDI IGI IGEGTYGQV YKAKDKDTGELVALKKVRLDNEKEGFP I TAI REI KI LRQ 
LIHRSVVNMKEIVTDKQDALDFKKDKGAFYLVFEYMDHDLMGLLESGLVHFSEDHIKSFM 
KQLMEGLEYCHKKNFLHRDIKCSNILLNNSGQIKLADFGLARLYNSEESRPYTNKVITLW 
YRPPELLLGEERYTPAIDVWSCGCILGELFTKKPIFQANLELAQLELISRLCGSPCPAVW 
PDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLLDHMLTLDPSKRCTAEQTLQSDFL 
KDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVVVEEPPPSKTSRKSTTSGTSTEPVK 
NSSPAPPQPAPGKVESGAGDAIGLADITQQLNQSELAVLLNLLQSQTDLSIPQMAQLLNI 
HSNPEMQQQLEALNQSISALTEATSQQQDSETMAPEESLKEAPSAPVILPSAEQTTLEAS 
STPADMQNILAVLLSQLMKTQEPAGSLELEASGVGQASGVG 

SEQ ID NO: 334 . >T60764_P5 # TY Protein # CC #LN 870 « Source Gene: T60764 
# Encoding Transcript: 11 

MPNSERHGGECKDGSGGASGTLQPSSGGGSSNSRERHRLVSKHKRHKSKHSKDMGLVTPEA 
ASLGTVIKPLVEYDDISSDSDTFSDDMAFKLDRRENDERRGSDRSDRLHKHRHHQHRRSR 
DLLKAKQTEKEKSQEVSSKSGSMKDRISGSSKRSNEETDDYGKAQVAKSSSKESRSSKLH 
KEKTRKERELKSGHKDRSKSHRKRETPKSYKTVDSPKRRSRSPHRKWSDSSKQDDSPSGA 
SYGQDYDLSPSRSHTSSNYDSYKKS PGSTSRRQSVSPPYKEPSAYQSSTRSPSPYSRRQR 
SVSPYSRRRSSSYERSGSYSGRSPSPYGRRRSSSPFLSKRSLSRSPLPSRKSMKSRSRSP 
AYSRHSSSHSKKKRSSSRSRHSSISPVRLPLNSSLGAELSRKKKERAAAAAAAKMDGKES 
KGSPVFLPRKENSSVEAKDSGLESKKLPRSVKLEKSAPDTELVNVTHLNTEVKNSSDTGK 
VKLDENSEKHLVKDLKAQGTRDSKPIALKEEIVTPKETETSEKETPPPLPTIASPPPPLP 
TTTPPPQTPPLPPLPPIPALPQQPPLPPSQPAFSQVPASSTSTLPPSTHSKTSAVSSQAN 
SQPPVQVSVKTQVSVTAAIPHLKTSTLPPLPLPPLLPGDDDMDSPKETLPSKPVKKEKEQ 
RTRHLLT DL PL PPEL PGGDLSPPDSPE PKAIT P PQQPYFGCRPKI CCPRYGERRQTES DWG 
KRCVDKFDI IGI IGEGT YGQV YKAKDKDTGEL VALKKVRLDNEKEG FP I TAI RE I KI LRQ 
LIHRSWNMKEIVTDKQDALDFKKDKGAFYLVFEYMDHDLMGLLESGLVHFSEDHIKSFM 
KQLMEGLEYCHKKNFLHRDIKCSNILLNNR 



SEQ ID NO:335. >T62520 # TY Consensus # Length 4925 # Number of exons 39 
caccccaaaacatayggctaggtacgaggctgggtggtgcgagcgaggcctgggtggcra 
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ttcggacgaggcgcaatttgagtttccatttctcggatttgggaactggtataagcattg 
tctgtgatgtaaacaaagtcttcaatatttggagaaaacatctcctcatacttgagagca 
caagaggaagagagagaccctcactgctggggagtccctgccacactcagtcccccacca 
cactgaatcggaattccgagaggggcgtcaaggggcggtgcctcgggtggcgggaagaag 
aggaggcgcgagaatggaggtggaggccgtctgtggtggcgcgggcgaggtggaggccca 
ggactctgaccctgcccctgccttcagcaaggcccccggcagcgccggccactacgaact 
gccgtgggttgaaaaatataggccagtaaagctgaatgaaattgtcgggaatgaagacac 
cgtgagcaggctagagtcatccagaagcaagttttaaccggttttaatgatcattttcgc 
cgtaggtctttgcaagggaaggaaatgtgcccaacatcatcattgcggggtcagaggtga 
cggagatgcaaacagcagtgaacataactcaagagctacctgaggtccttctggtggttt 
gaagtctttactggagatctcattcccagggccctccaggaaccggcaagaccacaagca 
ttctgtgcttggcccgggccctgctgggcccagcactcaaagatgccatgttggaactca 
atgcttcaaatgacaggggcattgacgttgtgaggaataaaattaaaatgtttgctcaac 
aaaaagtcactcttcccaaaggccgacataagatcatcattctggatgaagcagacagct 
gaatttgtgtcatgaaggactggagagaaccaattcccacaagtgtgatggcagcccttg 
tagatcgtggttttcgctgtggtgtcgctgggctgaggtggaggcccaggactctgaccc 
tgaccccatgctcttcgagacaacggctccccaggcatgcctgtcgcaggtcctatcctt 
agctgaaatcgttgcccgctaggtggtttgcacaagacatatcacgcggcgcacgtacta 
tgtcttgaatagtgaatagttagtcctngtgagagtgatatcgactatcgcagatgaagg 
tctatgggtcatagtagggctcatatctgcgaacgggacatcgcaaactgtgctcaaaat 
cat teat tgcgggtccctccagagtaacctggccaagaatgcaacaaaagcgaattcctt 
gctgactgtagcgaccacgtgtgcacctgacttggtgcccagacactccgaaagatgcca 
tgttgtacactctatgcttcatatgactggggccacttgcgattgtgagggaatatgctt 
atgatgtatgcctctggcggacatggtctatctcttcccaacaggtgctgacaactcatg 
aatcgactctacttgctgttattgcaaggctaaggatcatgccacttgatgcagtaatga 
tgcgccagttgagagccgttgtcagtggattaacctattggaaattctatctctaaacaa 
tcatctcgtcgttgctgcttcattgcccatgaccgacggagcccagcaagccttgaggag 
aaccatggaaatctactctaaaaccactcgcttcgcccttgcttgtaatgcttcggataa 
gatcatcgagcccattcagtcccgctgtgcagtcctccggtacacaaagctgaccgacgc 
ccagatcctcaccaggctgatgaatgttatcgagaaggagagggtaccctacactgatga 
cggcctagaagccatcatcttcacggcccagggagacatgaggcagggtatgtgggctgc 
taccattcaggcgtgggcatcctgtgcctgccatgccccttccatggggaaggggaggga 
agtgacagctgcacaaggtacaacaagagaaagagacagagacctggcaccctcctggga 
cagtgggctgtctagctcggactccaagaagagaggcagggaagggcagagctttaaaag 
tcaccccggaggctgaggcaggaggatcacctgaggtcaggagttcaagaccagcctggc 
caacatggtgaaatcccatctctactaattgtgccactacactccatgctgggtgacaga 
gtgagactctgtctcaaaaaaaaaaaaaaaaaaaaaaagtcacccttggcccggtgcggt 
ggctcatgcctgtaatcccagcacttgggaggcggaggtgggagaatcgcttaagcccag 
gagttcgagaccagcctgggcaacatagcgagactcctgtctctactcaaaaagaaataa 
atataaaaatcatccttgacatgtgtttgctcctgaacaggcgctgaacaacctgcagtc 
caccttctcaggatttggcttcattaacagtgagaacgtgttcaagcaggaaaccatggc 
ttccaccgggaaaaggcaagggcaggtgtgggtgacctctggagccatcaggaggctttc 
attgggaagatgtaggggtcgacattgtaccagaggcagagcctcatgccactgcggaga 
atgctgcagggaggacggagtccccagcaccatgcacctgcctgctggagtagcctgacc 
ccttgtttagggtgtcaggctgaggctgggggaccttttgtagggaaggccagttcctac 
tccggggtggaggtgggaattggcccaaaagacctcccagttctaaagtgaattccaagt 
ggttctgggcaactgaccccccaagtgatctaagacagatgacttgtgactttaaagctg 
tccccaagtgagcatccttcacaggcccttggttgctctcttgttcatctgtcttatgtt 
tgggaaatttatcttcctggtgccggaagcacttctctttgatggtgagaggagcattcc 
ccaaactcatcctggttgctggagaatttgatgaaaacaacaggttgagaagttttaccc 
atagattgcctggaagtcggtctgttggggcctccccgctcacattgtcccttctagccc 
ctgaccgcctttcctttccctttggccaggtctgtgacgagccccacccactgctggtaa 
aggagatgatccagcactgtgtgaatgccaacattgacgaagcctacaagggttggcctc 
cttccccccgtcctggcagaggaaccgctcacttcttgggcagaggagagcgggagccct 
tctgattcttgctcacttgtggcatctgggctactcaccagaagatatcattggcaacat 
ctttcgagtgtgtaaaactttccaaatggcagaatacctgaaactggagtttatcaagga 
actgccaaacggttttccgaagtggctgcagcacgtgcattcccaccagcattccggagg 
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atcctcaacctagccaccacatggagtgctcagtttggtttttaagccattctaatagga 
aattggatacactcacatgaaaatagcggaaggagtgaactctcttttgcagatggcagg 
cctcctggcaaggctgtgtcagaagacaatggccccggtggccagttagagcagagactt 
cactgactgacttacaggtgccctattctgaggtacaggagccgcggctttctgatgggg 
gaaaatgccgccttaggctggagccaacatgactgtcctttaaactccagtggctggcca 
ggcacggtagctcacgcctgtaatcccaacactttgggaggccgaggcaggtggatcacc 
tgaggtcagaagttcaagaccagcctggccaacatggggaaaccctgtctttactaaaaa 
tataaaaattagctgggtgtggtggcgggcacctgtaatcccagctactcgggaggctgt 
ggcaggagaatcgcttgaacccaggaggtggaggttgcagtgagccaagatcacaccatt 
gcactccagcctgggcgacagagtctccatctggggaaaaaaattaaataaataaactcc 
cgtgacttgcatgtttgcttctgggacgtctgtgccccgcaagtgtttgagtttggcctc 
caccccattgatgcggtgacggggcggaaaggcgcagagaagctgagggcggtctctgat 
ctgtgtgtgggttgacattttagctaataaagccttgcagtgtttgttggcaaaaaaaaa 
carcacaaamaagcaaggcaatcaaaataaacggaatgcaggaacaaaagacggcaacgg 
acacatggcacaagtggcaaaaggcggaagcccggaacccaagttacttgtgcacagacg 
tggaaaaaaccgacggagaccacaaagagaaagcgaataactagcgaaaaggccagataa 
accggctagagacaggacaagtctgtgagagrccmcccamtggtggtcacggcgatgatc 
cagactgtgtgacatgcaaaattgacgaagctacaaaatatggtcaactgtggcatctgg 
gatactcacagaaaatatctggcaacatttacgatgtttaacatgagaatcgtgaaccag 
agtggggttgatgagcagaaacatgatcaggctgggagagtcttgggaaataatataatc 
gggtgaaaaaaaaacggctttggcgaaggtagcggagtcccctttagggaggctggtatc 
cttgsytcmctttgkgggcatytggggctactctcccccgaaaaaattcctttgggcaaa 
cttcttttttcagggggggtaaaaggcattgacgttgtgaggaataaaattaaaatgttt 
gctcaacaaaaagtcactcttcccaaaggccgacataagatcatcattctggatgaagca 
gacag 

SEQ ID NO: 336 . >T62520_T1 # TY Transcript # LN 1714 # Source Gene: T62520 
# Encoded protein: T62520_P14 

caatttgagtttccatttctcggatttgggaactggtataagcattgtctgtgatgtaaa 
caaagtcttcaatatttggagaaaacatctcctcatacttgagagcacaagaggaagaga 
gagaccctcactgctggggagtccctgccacactcagtcccccaccacactgaatcggaa 
ttccgagagggaagaggaggcgcgagaatggaggtggaggccgtctgtggtggcgcgggc 
gaggtggaggcccaggactctgaccctgcccctgccttcagcaaggcccccggcagcgcc 
ggccactacgaactgccgtgggttgaaaaatataggccagtaaagctgaatgaaattgtc 
gggaatgaagacaccgtgagcaggctagaggtctttgcaagggaaggaaatgtgcccaac 
atcatcattgcgggccctccaggaaccggcaagaccacaagcattctgtgcttggcccgg 
gccctgctgggcccagcactcaaagatgccatgttggaactcaatgcttcaaatgacagg 
ggcattgacgttgtgaggaataaaattaaaatgtttgctcaacaaaaagtcactcttccc 
aaaggccgacataagatcatcattctggatgaagcagacagcatgaccgacggagcccag 
caagccttgaggagaaccatggaaatctactctaaaaccactcgcttcgcccttgcttgt 
aatgcttcggataagatcatcgagcccattcagtcccgctgtgcagtcctccggtacaca 
aagctgaccgacgcccagatcctcaccaggctgatgaatgttatcgagaaggagagggta 
ccctacactgatgacggcctagaagccatcatcttcacggcccagggagacatgaggcag 
gcgctgaacaacctgcagtccaccttctcaggatttggcttcattaacagtgagaacgtg 
ttcaaggtctgtgacgagccccacccactgctggtaaaggagatgatccagcactgtgtg 
aatgccaacattgacgaagcctacaagattcttgctcacttgtggcatctgggctactca 
ccagaagatatcattggcaacatctttcgagtgtgtaaaactttccaaatggcagaatac 
ctgaaactggagtttatcaaggaaattggatacactcacatgaaaatagcggaaggagtg 
aactctcttttgcagatggcaggcctcctggcaaggctgtgtcagaagacaatggccccg 
gtggccagttagagcagagacttcactgactgacttacaggtgccctattctgaggtaca 
ggagccgcggctttctgatgggggaaaatgccgccttaggctggagccaacatgactgtc 
ctttaaactccagtggctggccaggcacggtagctcacgcctgtaatcccaacactttgg 
gaggccgaggcaggtggatcacctgaggtcagaagttcaagaccagcctggccaacatgg 
ggaaaccctgtctttactaaaaatataaaaattagctgggtgtggtggcgggcacctgta 
atcccagctactcgggaggctgtggcaggagaatcgcttgaacccaggaggtggaggttg 
cagtgagccaagatcacaccattgcactccagcctgggcgacagagtctccatctgggga 
aaaaaattaaataaataaactcccgtgacttgca 
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SEQ ID NO: 337 , >T62520JT2 # TY Transcript ft LN 1557 # Source Gene: T62520 

# Encoded protein: T62520_P2 

gagcgaggcctgggtggcrattcggacgaggcggaagaggaggcgcgagaatggaggtgg 
aggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccctgcccctgcct 
tcagcaaggcccccggcagcgccggccactacgaactgccgtgggttgaaaaatataggc 
cagtaaagctgaatgaaattgtcgggaatgaagacaccgtgagcaggctagaggtctttg 
caagggaaggaaatgtgcccaacatcatcattgcgggccctccaggaaccggcaagacca 
caagcattctgtgcttggcccgggccctgctgggcccagcactcaaagatgccatgttgg 
aactcaatgcttcaaatgacaggggcattgacgttgtgaggaataaaattaaaatgtttg 
ctcaacaaaaagtcactcttcccaaaggccgacataagatcatcattctggatgaagcag 
acagcatgaccgacggagcccagcaagccttgaggagaaccatggaaatctactctaaaa 
ccactcgcttcgcccttgcttgtaatgcttcggataagatcatcgagcccattcagtccc 
gctgtgcagtcctccggtacacaaagctgaccgacgcccagatcctcaccaggctgatga 
atgttatcgagaaggagagggtaccctacactgatgacggcctagaagccatcatcttca 
cggcccagggagacatgaggcaggcgctgaacaacctgcagtccaccttctcaggatttg 
gcttcattaacagtgagaacgtgttcaaggtctgtgacgagccccacccactgctggtaa 
aggagatgatccagcactgtgtgaatgccaacattgacgaagcctacaagattcttgctc 
acttgtggcatctgggctactcaccagaagatatcattggcaacatctttcgagtgtgta 
aaactttccaaatggcagaatacctgaaactggagtttatcaaggaaattggatacactc 
acatgaaaatagcggaaggagtgaactctcttttgcagatggcaggcctcctggcaaggc 
tgtgtcagaagacaatggccccggtggccagttagagcagagacttcactgactgactta 
caggtgccctattctgaggtacaggagccgcggctttctgatgggggaaaatgccgcctt 
aggctggagccaacatgactgtcctttaaactccagtggctggccaggcacggtagctca 
cgcctgtaatcccaacactttgggaggccgaggcaggtggatcacctgaggtcagaagtt 
caagaccagcctggccaacatggggaaaccctgtctttactaaaaatataaaaattagct 
gggtgtggtggcgggcacctgtaatcccagctactcgggaggctgtggcaggagaatcgc 
ttgaacccaggaggtggaggttgcagtgagccaagatcacaccattgcactccagcctgg 
gcgacagagtctccatctggggaaaaaaattaaataaataaactcccgtgacttgca 

SEQ ID NO: 338. >T62520_J3 # TY Transcript # LN 1557 # Source Gene: T62520 

# Encoded protein: T62520_P1 

ggcgtcaaggggcggtgcctcgggtggcgggaagaagaggaggcgcgagaatggaggtgg 
aggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccctgcccctgcct 
tcagcaaggcccccggcagcgccggccactacgaactgccgtgggttgaaaaatataggc 
cagtaaagctgaatgaaattgtcgggaatgaagacaccgtgagcaggctagaggtctttg 
caagggaaggaaatgtgcccaacatcatcattgcgggccctccaggaaccggcaagacca 
caagcattctgtgcttggcccgggccctgctgggcccagcactcaaagatgccatgttgg 
aactcaatgcttcaaatgacaggggcattgacgttgtgaggaataaaattaaaatgtttg 
ctcaacaaaaagtcactcttcccaaaggccgacataagatcatcattctggatgaagcag 
acagcatgaccgacggagcccagcaagccttgaggagaaccatggaaatctactctaaaa 
ccactcgcttcgcccttgcttgtaatgcttcggataagatcatcgagcccattcagtccc 
gctgtgcagtcctccggtacacaaagctgaccgacgcccagatcctcaccaggctgatga 
atgttatcgagaaggagagggtaccctacactgatgacggcctagaagccatcatcttca 
cggcccagggagacatgaggcaggcgctgaacaacctgcagtccaccttctcaggatttg 
gcttcattaacagtgagaacgtgttcaaggtctgtgacgagccccacccactgctggtaa 
aggagatgatccagcactgtgtgaatgccaacattgacgaagcctacaagattcttgctc 
acttgtggcatctgggctactcaccagaagatatcattggcaacatctttcgagtgtgta 
aaactttccaaatggcagaatacctgaaactggagtttatcaaggaaattggatacactc 
acatgaaaatagcggaaggagtgaactctcttttgcagatggcaggcctcctggcaaggc 
tgtgtcagaagacaatggccccggtggccagttagagcagagacttcactgactgactta 
caggtgccctattctgaggtacaggagccgcggctttctgatgggggaaaatgccgcctt 
aggctggagccaacatgactgtcctttaaactccagtggctggccaggcacggtagctca 
cgcctgtaatcccaacactttgggaggccgaggcaggtggatcacctgaggtcagaagtt 
caagaccagcctggccaacatggggaaaccctgtctttactaaaaatataaaaattagct 
gggtgtggtggcgggcacctgtaatcccagctactcgggaggctgtggcaggagaatcgc 
ttgaacccaggaggtggaggttgcagtgagccaagatcacaccattgcactccagcctgg 
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gcgacagagtctccatctggggaaaaaaattaaataaataaactcccgtgacttgca 

SEQ ID NO: 339 . >T62520JT4 # TY Transcript 8 LN 1922 # Source Gene: T62520 

# Encoded protein: T62520_P14 

caatttgagtttccatttctcggatttgggaactggtataagcattgtctgtgatgtaaa 
caaagtcttcaatatttggagaaaacatctcctcatacttgagagcacaagaggaagaga 
gagaccctcactgctggggagtccctgccacactcagtcccccaccacactgaatcggaa 
ttccgagagggaagaggaggcgcgagaatggaggtggaggccgtctgtggtggcgcgggc 
gaggtggaggcccaggactctgaccctgcccctgccttcagcaaggcccccggcagcgcc 
ggccactacgaactgccgtgggttgaaaaatataggccagtaaagctgaatgaaattgtc 
gggaatgaagacaccgtgagcaggctagaggtctttgcaagggaaggaaatgtgcccaac 
atcatcattgcgggccctccaggaaccggcaagaccacaagcattctgtgcttggcccgg 
gccctgctgggcccagcactcaaagatgccatgttggaactcaatgcttcaaatgacagg 
ggcattgacgttgtgaggaataaaattaaaatgtttgctcaacaaaaagtcactcttccc 
aaaggccgacataagatcatcattctggatgaagcagacagcatgaccgacggagcccag 
caagccttgaggagaaccatggaaatctactctaaaaccactcgcttcgcccttgcttgt 
aatgcttcggataagatcatcgagcccattcagtcccgctgtgcagtcctccggtacaca 
aagctgaccgacgcccagatcctcaccaggctgatgaatgttatcgagaaggagagggta 
ccctacactgatgacggcctagaagccatcatcttcacggcccagggagacatgaggcag 
gcgctgaacaacctgcagtccaccttctcaggatttggcttcattaacagtgagaacgtg 
ttcaaggtctgtgacgagccccacccactgctggtaaaggagatgatccagcactgtgtg 
aatgccaacattgacgaagcctacaagattcttgctcacttgtggcatctgggctactca 
ccagaagatatcattggcaacatctttcgagtgtgtaaaactttccaaatggcagaatac 
ctgaaactggagtttatcaaggaaattggatacactcacatgaaaatagcggaaggagtg 
aactctcttttgcagatggcaggcctcctggcaaggctgtgtcagaagacaatggccccg 
gtggccagttagagcagagacttcactgactgacttacaggtgccctattctgaggtaca 
ggagccgcggctttctgatgggggaaaatgccgccttaggctggagccaacatgactgtc 
ctttaaactccagtggctggccaggcacggtagctcacgcctgtaatcccaacactttgg 
gaggccgaggcaggtggatcacctgaggtcagaagttcaagaccagcctggccaacatgg 
ggaaaccctgtctttactaaaaatataaaaattagctgggtgtggtggcgggcacctgta 
atcccagctactcgggaggctgtggcaggagaatcgcttgaacccaggaggtggaggttg 
cagtgagccaagatcacaccattgcactccagcctgggcgacagagtctccatctgggga 
aaaaaattaaataaataaactcccgtgacttgcaaaaaaaaacarcacaaamaagcaagg 
caatcaaaataaacggaatgcaggaacaaaagacggcaacggacacatggcacaagtggc 
aaaaggcggaagcccggaacccaagttacttgtgcacagacgtggaaaaaaccgacggag 
accacaaagagaaagcgaataactagcgaaaaggccagataaaccggctagagacaggac 
aa 

SEQ ID NO: 340. >T62520_T5 # TY Transcript # LN 1875 # Source Gene: T62520 

# Encoded protein: T62520_P14 

caatttgagtttccatttctcggatttgggaactggtataagcattgtctgtgatgtaaa 
caaagtcttcaatatttggagaaaacatctcctcatacttgagagcacaagaggaagaga 
gagaccctcactgctggggagtccctgccacactcagtcccccaccacactgaatcggaa 
ttccgagagggaagaggaggcgcgagaatggaggtggaggccgtctgtggtggcgcgggc 
gaggtggaggcccaggactctgaccctgcccctgccttcagcaaggcccccggcagcgcc 
ggccactacgaactgccgtgggttgaaaaatataggccagtaaagctgaatgaaattgtc 
gggaatgaagacaccgtgagcaggctagaggtctttgcaagggaaggaaatgtgcccaac 
atcatcattgcgggccctccaggaaccggcaagaccacaagcattctgtgcttggcccgg 
gccctgctgggcccagcactcaaagatgccatgttggaactcaatgcttcaaatgacagg 
ggcattgacgttgtgaggaataaaattaaaatgtttgctcaacaaaaagtcactcttccc 
aaaggccgacataagatcatcattctggatgaagcagacagcatgaccgacggagcccag 
caagccttgaggagaaccatggaaatctactctaaaaccactcgcttcgcccttgcttgt 
aatgcttcggataagatcatcgagcccattcagtcccgctgtgcagtcctccggtacaca 
aagctgaccgacgcccagatcctcaccaggctgatgaatgttatcgagaaggagagggta 
ccctacactgatgacggcctagaagccatcatcttcacggcccagggagacatgaggcag 
gcgctgaacaacctgcagtccaccttctcaggatttggcttcattaacagtgagaacgtg 
ttcaaggtctgtgacgagccccacccactgctggtaaaggagatgatccagcactgtgtg 
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aatgccaacattgacgaagcctacaagattcttgctcacttgtggcatctgggctactca 
ccagaagatatcattggcaacatctttcgagtgtgtaaaactttccaaatggcagaatac 
ctgaaactggagtttatcaaggaaattggatacactcacatgaaaatagcggaaggagtg 
aactctcttttgcagatggcaggcctcctggcaaggctgtgtcagaagacaatggccccg 
gtggccagttagagcagagacttcactgactgacttacaggtgccctattctgaggtaca 
ggagccgcggctttctgatgggggaaaatgccgccttaggctggagccaacatgactgtc 
ctttaaactccagtggctggccaggcacggtagctcacgcctgtaatcccaacactttgg 
gaggccgaggcaggtggatcacctgaggtcagaagttcaagaccagcctggccaacatgg 
ggaaaccctgtctttactaaaaatataaaaattagctgggtgtggtggcgggcacctgta 
atcccagctactcgggaggctgtggcaggagaatcgcttgaacccaggaggtggaggttg 
cagtgagccaagatcacaccattgcactccagcctgggcgacagagtctccatctgggga 
aaaaaattaaataaataaactcccgtgacttgcatgtttgcttctgggacgtctgtgccc 
cgcaagtgtttgagtttggcctccaccccattgatgcggtgacggggcggaaaggcgcag 
agaagctgagggcggtctctgatctgtgtgtgggttgacattttagctaataaagccttg 
cagtgtttgttggca 

SEQ ID NO: 341 . >T62520_T6 # TY Transcript # LN 1718 # Source Gene: T62520 

# Encoded protein: T62520_P1 

caatttgagtttccatttctcggatttgggaactggtataagcattgtctgtgatgtaaa 
caaagtcttcaatatttggagaaaacatctcctcatacttgagagcacaagaggaagaga 
gagaccctcactgctggggagtccctgccacactcagtcccccaccacactgaatcggaa 
ttccgagagggaagaggaggcgcgagaatggaggtggaggccgtctgtggtggcgcgggc 
gaggtggaggcccaggactctgaccctgcccctgccttcagcaaggcccccggcagcgcc 
ggccactacgaactgccgtgggttgaaaaatataggccagtaaagctgaatgaaattgtc 
gggaatgaagacaccgtgagcaggctagaggtctttgcaagggaaggaaatgtgcccaac 
atcatcattgcgggccctccaggaaccggcaagaccacaagcattctgtgcttggcccgg 
gccctgctgggcccagcactcaaagatgccatgttggaactcaatgcttcaaatgacagg 
ggcattgacgttgtgaggaataaaattaaaatgtttgctcaacaaaaagtcactcttccc 
aaaggccgacataagatcatcattctggatgaagcagacagcatgaccgacggagcccag 
caagccttgaggagaaccatggaaatctactctaaaaccactcgcttcgcccttgcttgt 
aatgcttcggataagatcatcgagcccattcagtcccgctgtgcagtcctccggtacaca 
aagctgaccgacgcccagatcctcaccaggctgatgaatgttatcgagaaggagagggta 
ccctacactgatgacggcctagaagccatcatcttcacggcccagggagacatgaggcag 
gcgctgaacaacctgcagtccaccttctcaggatttggcttcattaacagtgagaacgtg 
ttcaaggtctgtgacgagccccacccactgctggtaaaggagatgatccagcactgtgtg 
aatgccaacattgacgaagcctacaagattcttgctcacttgtggcatctgggctactca 
ccagaagatatcattggcaacatctttcgagtgtgtaaaactttccaaatggcagaatac 
ctgaaactggagtttatcaaggaaattggatacactcacatgaaaatagcggaaggagtg 
aactctcttttgcagatggcaggcctcctggcaaggctgtgtcagaagacaatggccccg 
gtggccagttagagcagagacttcactgactgacttacaggtgccctattctgaggtaca 
ggagccgcggctttctgatgggggaaaatgccgccttaggctggagccaacatgactgtc 
ctttaaactccagtggctggccaggcacggtagctcacgcctgtaatcccaacactttgg 
gaggccgaggcaggtggatcacctgaggtcagaagttcaagaccagcctggccaacatgg 
ggaaaccctgtctttactaaaaatataaaaattagctgggtgtggtggcgggcacctgta 
atcccagctactcgggaggctgtgtgagaatcgtgaaccagagtggggttgatgagcaga 
aacatgatcaggctgggagagtcttgggaaataatataatcgggtgaaaaaaaaacggct 
ttggcgaaggtagcggagtcccctttagggaggctggt 

SEQ ID NO:342. >T62520_T7 # TY Transcript # LN 1415 # Source Gene: T62520 

# Encoded protein: T62520_P1 

caatttgagtttccatttctcggatttgggaactggtataagcattgtctgtgatgtaaa 
caaagtcttcaatatttggagaaaacatctcctcatacttgagagcacaagaggaagaga 
gagaccctcactgctggggagtccctgccacactcagtcccccaccacactgaatcggaa 
ttccgagagggaagaggaggcgcgagaatggaggtggaggccgtctgtggtggcgcgggc 
gaggtggaggcccaggactctgaccctgcccctgccttcagcaaggcccccggcagcgcc 
ggccactacgaactgccgtgggttgaaaaatataggccagtaaagctgaatgaaattgtc 
gggaatgaagacaccgtgagcaggctagaggtctttgcaagggaaggaaatgtgcccaac 



222/365 



Figure 1 (Cont'd) 



atcatcattgcgggccctccaggaaccggcaagaccacaagcattctgtgcttggcccgg 
gccctgctgggcccagcactcaaagatgccatgttggaactcaatgcttcaaatgacagg 
ggcattgacgttgtgaggaataaaattaaaatgtttgctcaacaaaaagtcactcttccc 
aaaggccgacataagatcatcattctggatgaagcagacagcatgaccgacggagcccag 
caagccttgaggagaaccatggaaatctactctaaaaccactcgcttcgcccttgcttgt 
aatgcttcggataagatcatcgagcccattcagtcccgctgtgcagtcctccggtacaca 
aagctgaccgacgcccagatcctcaccaggctgatgaatgttatcgagaaggagagggta 
ccctacactgatgacggcctagaagccatcatcttcacggcccagggagacatgaggcag 
gcgctgaacaacctgcagtccaccttctcaggatttggcttcattaacagtgagaacgtg 
ttcaaggtctgtgacgagccccacccactgctggtaaaggagatgatccagcactgtgtg 
aatgccaacattgacgaagcctacaagattcttgctcacttgtggcatctgggctactca 
ccagaagatatcattggcaacatctttcgagtgtgtaaaactttccaaatggcagaatac 
ctgaaactggagtttatcaaggaaattggatacactcacatgaaaatagcggaaggagtg 
aactctcttttgcagatggcaggcctcctggcaaggctgtgtcagaagacaatggccccg 
gtggccagttagagcagagacttcactgactgacttacaggtgccctattctgaggtaca 
ggagccgcggctttctgatgggggaaaatgccgcctgggcgacagagtctccatctgggg 
aaaaaaattaaataaataaactcccgtgacttgca 

SEQ ID NO:343. >T62520_T8 # TY Transcript # LN 1834 # Source Gene: T62520 

# Encoded protein: T62520_P1 

caatttgagtttccatttctcggatttgggaactggtataagcattgtctgtgatgtaaa 
caaagtcttcaatatttggagaaaacatctcctcatacttgagagcacaagaggaagaga 
gagaccctcactgctggggagtccctgccacactcagtcccccaccacactgaatcggaa 
ttccgagagggaagaggaggcgcgagaatggaggtggaggccgtctgtggtggcgcgggc 
gaggtggaggcccaggactctgaccctgcccctgccttcagcaaggcccccggcagcgcc 
ggccactacgaactgccgtgggttgaaaaatataggccagtaaagctgaatgaaattgtc 
gggaatgaagacaccgtgagcaggctagaggtctttgcaagggaaggaaatgtgcccaac 
atcatcattgcgggccctccaggaaccggcaagaccacaagcattctgtgcttggcccgg 
gccctgctgggcccagcactcaaagatgccatgttggaactcaatgcttcaaatgacagg 
ggcattgacgttgtgaggaataaaattaaaatgtttgctcaacaaaaagtcactcttccc 
aaaggccgacataagatcatcattctggatgaagcagacagcatgaccgacggagcccag 
caagccttgaggagaaccatggaaatctactctaaaaccactcgcttcgcccttgcttgt 
aatgcttcggataagatcatcgagcccattcagtcccgctgtgcagtcctccggtacaca 
aagctgaccgacgcccagatcctcaccaggctgatgaatgttatcgagaaggagagggta 
ccctacactgatgacggcctagaagccatcatcttcacggcccagggagacatgaggcag 
gcgctgaacaacctgcagtccaccttctcaggatttggcttcattaacagtgagaacgtg 
ttcaaggtctgtgacgagccccacccactgctggtaaaggagatgatccagcactgtgtg 
aatgccaacattgacgaagcctacaagattcttgctcacttgtggcatctgggctactca 
ccagaagatatcattggcaacatctttcgagtgtgtaaaactttccaaatggcagaatac 
ctgaaactggagtttatcaaggaactgccaaacggttttccgaagtggctgcagcacgtg 
cattcccaccagcattccggaggatcctcaacctagccaccacatggagtgctcagtttg 
gtttttaagccattctaataggaaattggatacactcacatgaaaatagcggaaggagtg 
aactctcttttgcagatggcaggcctcctggcaaggctgtgtcagaagacaatggccccg 
gtggccagttagagcagagacttcactgactgacttacaggtgccctattctgaggtaca 
ggagccgcggctttctgatgggggaaaatgccgccttaggctggagccaacatgactgtc 
ctttaaactccagtggctggccaggcacggtagctcacgcctgtaatcccaacactttgg 
gaggccgaggcaggtggatcacctgaggtcagaagttcaagaccagcctggccaacatgg 
ggaaaccctgtctttactaaaaatataaaaattagctgggtgtggtggcgggcacctgta 
atcccagctactcgggaggctgtggcaggagaatcgcttgaacccaggaggtggaggttg 
cagtgagccaagatcacaccattgcactccagcctgggcgacagagtctccatctgggga 
aaaaaattaaataaataaactcccgtgacttgca 

SEQ ID NO: 344. >T62520JT9 # TY Transcript # LN 14 62 # Source Gene: T62520 

# Encoded protein: T62520_P13 

caccccaaaacatayggctaggtacgaggctgggtggtgcgaagaggaggcgcgagaatg 
gaggtggaggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccctgcc 
cctgccttcagcaaggcccccggcagcgccggccactacgaactgccgtgggttgaaaaa 
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tataggccagtaaagctgaatgaaattgtcgggaatgaagacaccgtgagcaggctagag 
gtctttgcaagggaaggaaatgtgcccaacatcatcattgcgggccctccaggaaccggc 
aagaccacaagcattctgtgcttggcccgggccctgctgggcccagcactcaaagatgcc 
atgttggaactcaatgcttcaaatgacagcatgaccgacggagcccagcaagccttgagg 
agaaccatggaaatctactctaaaaccactcgcttcgcccttgcttgtaatgcttcggat 
aagatcatcgagcccattcagtcccgctgtgcagtcctccggtacacaaagctgaccgac 
gcccagatcctcaccaggctgatgaatgttatcgagaaggagagggtaccctacactgat 
gacggcctagaagccatcatcttcacggcccagggagacatgaggcaggcgctgaacaac 
ctgcagtccaccttctcaggatttggcttcattaacagtgagaacgtgttcaaggtctgt 
gacgagccccacccactgctggtaaaggagatgatccagcactgtgtgaatgccaacatt 
gacgaagcctacaagattcttgctcacttgtggcatctgggctactcaccagaagatatc 
attggcaacatctttcgagtgtgtaaaactttccaaatggcagaatacctgaaactggag 
tttatcaaggaaattggatacactcacatgaaaatagcggaaggagtgaactctcttttg 
cagatggcaggcctcctggcaaggctgtgtcagaagacaatggccccggtggccagttag 
agcagagacttcactgactgacttacaggtgccctattctgaggtacaggagccgcggct 
ttctgatgggggaaaatgccgccttaggctggagccaacatgactgtcctttaaactcca 
gtggctggccaggcacggtagctcacgcctgtaatcccaacactttgggaggccgaggca 
ggtggatcacctgaggtcagaagttcaagaccagcctggccaacatggggaaaccctgtc 
tttactaaaaatataaaaattagctgggtgtggtggcgggcacctgtaatcccagctact 
cgggaggctgtggcaggagaatcgcttgaacccaggaggtggaggttgcagtgagccaag 
atcacaccattgcactccagcctgggcgacagagtctccatctggggaaaaaaattaaat 
aaataaactcccgtgacttgca 

SEQ ID NO: 345. >T62520_T10 # TY Transcript # LN 1455 # Source Gene: T62520 

# Encoded protein: T6252Q_P13 

ggcgtcaaggggcggtgcctcgggtggcgggaagaagaggaggcgcgagaatggaggtgg 
aggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccctgcccctgcct 
tcagcaaggcccccggcagcgccggccactacgaactgccgtgggttgaaaaatataggc 
cagtaaagctgaatgaaattgtcgggaatgaagacaccgtgagcaggctagaggtctttg 
caagggaaggaaatgtgcccaacatcatcattgcgggccctccaggaaccggcaagacca 
caagcattctgtgcttggcccgggccctgctgggcccagcactcaaagatgccatgttgg 
aactcaatgcttcaaatgacagcatgaccgacggagcccagcaagccttgaggagaacca 
tggaaatctactctaaaaccactcgcttcgcccttgcttgtaatgcttcggataagatca 
tcgagcccattcagtcccgctgtgcagtcctccggtacacaaagctgaccgacgcccaga 
tcctcaccaggctgatgaatgttatcgagaaggagagggtaccctacactgatgacggcc 
tagaagccatcatcttcacggcccagggagacatgaggcaggcgctgaacaacctgcagt 
ccaccttctcaggatttggcttcattaacagtgagaacgtgttcaaggtctgtgacgagc 
cccacccactgctggtaaaggagatgatccagcactgtgtgaatgccaacattgacgaag 
cctacaagattcttgctcacttgtggcatctgggctactcaccagaagatatcattggca 
acatctttcgagtgtgtaaaactttccaaatggcagaatacctgaaactggagtttatca 
aggaaattggatacactcacatgaaaatagcggaaggagtgaactctcttttgcagatgg 
caggcctcctggcaaggctgtgtcagaagacaatggccccggtggccagttagagcagag 
acttcactgactgacttacaggtgccctattctgaggtacaggagccgcggctttctgat 
gggggaaaatgccgccttaggctggagccaacatgactgtcctttaaactccagtggctg 
gccaggcacggtagctcacgcctgtaatcccaacactttgggaggccgaggcaggtggat 
cacctgaggtcagaagttcaagaccagcctggccaacatggggaaaccctgtctttacta 
aaaatataaaaattagctgggtgtggtggcgggcacctgtaatcccagctactcgggagg 
ctgtggcaggagaatcgcttgaacccaggaggtggaggttgcagtgagccaagatcacac 
cattgcactccagcctgggcgacagagtctccatctggggaaaaaaattaaataaataaa 
ctcccgtgacttgca 

SEQ ID NO: 346. >T62520_T11 # TY Transcript > LN 1631 # Source Gene: T62520 

# Encoded protein: T62520_P2 

gagcgaggcctgggtggcrattcggacgaggcggaagaggaggcgcgagaatggaggtgg 
aggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccctgcccctgcct 
tcagcaaggcccccggcagcgccggccactacgaactgccgtgggttgaaaaatataggc 
cagtaaagctgaatgaaattgtcgggaatgaagacaccgtgagcaggctagaggtctttg 
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caagggaaggaaatgtgcccaacatcatcattgcgggccctccaggaaccggcaagacca 
caagcattctgtgcttggcccgggccctgctgggcccagcactcaaagatgccatgttgg 
aactcaatgcttcaaatgacaggggcattgacgttgtgaggaataaaattaaaatgtttg 
ctcaacaaaaagtcactcttcccaaaggccgacataagatcatcattctggatgaagcag 
acagcatgaccgacggagcccagcaagccttgaggagaaccatggaaatctactctaaaa 
ccactcgcttcgcccttgcttgtaatgcttcggataagatcatcgagcccattcagtccc 
gctgtgcagtcctccggtacacaaagctgaccgacgcccagatcctcaccaggctgatga 
atgttatcgagaaggagagggtaccctacactgatgacggcctagaagccatcatcttca 
cggcccagggagacatgaggcaggcgctgaacaacctgcagtccaccttctcaggatttg 
gcttcattaacagtgagaacgtgttcaaggtctgtgacgagccccacccactgctggtaa 
aggagatgatccagcactgtgtgaatgccaacattgacgaagcctacaagggttggcctc 
cttccccccgtcctggcagaggaaccgctcacttcttgggcagaggagagcgggagccct 
tctgattcttgctcacttgtggcatctgggctactcaccagaagatatcattggcaacat 
ctttcgagtgtgtaaaactttccaaatggcagaatacctgaaactggagtttatcaagga 
aattggatacactcacatgaaaatagcggaaggagtgaactctcttttgcagatggcagg 
cctcctggcaaggctgtgtcagaagacaatggccccggtggccagttagagcagagactt 
cactgactgacttacaggtgccctattctgaggtacaggagccgcggctttctgatgggg 
gaaaatgccgccttaggctggagccaacatgactgtcctttaaactccagtggctggcca 
ggcacggtagctcacgcctgtaatcccaacactttgggaggccgaggcaggtggatcacc 
tgaggtcagaagttcaagaccagcctggccaacatggggaaaccctgtctttactaaaaa 
tataaaaattagctgggtgtggtggcgggcacctgtaatcccagctactcgggaggctgt 
ggcaggagaatcgcttgaacccaggaggtggaggttgcagtgagccaagatcacaccatt 
gcactccagcctgggcgacagagtctccatctggggaaaaaaattaaataaataaactcc 
cgtgacttgca 

SEQ ID NO: 347. >T62520_T12 # TY Transcript § LN 1613 # Source Gene: T62520 

# Encoded protein: T62520_P4 

caccccaaaacatayggctaggtacgaggctgggtggtgcgaagaggaggcgcgagaatg 
gaggtggaggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccctgcc 
cctgccttcagcaaggcccccggcagcgccggccactacgaactgccgtgggttgaaaaa 
tataggccagtaaagctgaatgaaattgtcgggaatgaagacaccgtgagcaggctagag 
tcatccagaagcaagttttaaccggttttaatgatcattttcgccgtaggtctttgcaag 
ggaaggaaatgtgcccaacatcatcattgcgggccctccaggaaccggcaagaccacaag 
cattctgtgcttggcccgggccctgctgggcccagcactcaaagatgccatgttggaact 
caatgcttcaaatgacaggggcattgacgttgtgaggaataaaattaaaatgtttgctca 
acaaaaagtcactcttcccaaaggccgacataagatcatcattctggatgaagcagacag 
catgaccgacggagcccagcaagccttgaggagaaccatggaaatctactctaaaaccac 
tcgcttcgcccttgcttgtaatgcttcggataagatcatcgagcccattcagtcccgctg 
tgcagtcctccggtacacaaagctgaccgacgcccagatcctcaccaggctgatgaatgt 
tatcgagaaggagagggtaccctacactgatgacggcctagaagccatcatcttcacggc 
ccagggagacatgaggcaggcgctgaacaacctgcagtccaccttctcaggatttggctt 
cattaacagtgagaacgtgttcaaggtctgtgacgagccccacccactgctggtaaagga 
gatgatccagcactgtgtgaatgccaacattgacgaagcctacaagattcttgctcactt 
gtggcatctgggctactcaccagaagatatcattggcaacatctttcgagtgtgtaaaac 
tttccaaatggcagaatacctgaaactggagtttatcaaggaaattggatacactcacat 
gaaaatagcggaaggagtgaactctcttttgcagatggcaggcctcctggcaaggctgtg 
tcagaagacaatggccccggtggccagttagagcagagacttcactgactgacttacagg 
tgccctattctgaggtacaggagccgcggctttctgatgggggaaaatgccgccttaggc 
tggagccaacatgactgtcctttaaactccagtggctggccaggcacggtagctcacgcc 
tgtaatcccaacactttgggaggccgaggcaggtggatcacctgaggtcagaagttcaag 
accagcctggccaacatggggaaaccctgtctttactaaaaatataaaaattagctgggt 
gtggtggcgggcacctgtaatcccagctactcgggaggctgtggcaggagaatcgcttga 
acccaggaggtggaggttgcagtgagccaagatcacaccattgcactccagcctgggcga 
cagagtctccatctggggaaaaaaattaaataaataaactcccgtgacttgca 

SEQ ID NO: 348. >T62520_T13 # TY Transcript # LN 1494 # Source Gene: T62520 

# Encoded protein: T62520_P4 
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caccccaaaacatayggctaggtacgaggctgggtggtgcgaagaggaggcgcgagaatg 
gaggtggaggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccctgcc 
cctgccttcagcaaggcccccggcagcgccggccactacgaactgccgtggtctttgcaa 
gggaaggaaatgtgcccaacatcatcattgcgggccctccaggaaccggcaagaccacaa 
gcattctgtgcttggcccgggccctgctgggcccagcactcaaagatgccatgttggaac 
tcaatgcttcaaatgacaggggcattgacgttgtgaggaataaaattaaaatgtttgctc 
aacaaaaagtcactcttcccaaaggccgacataagatcatcattctggatgaagcagaca 
gcatgaccgacggagcccagcaagccttgaggagaaccatggaaatctactctaaaacca 
ctcgcttcgcccttgcttgtaatgcttcggataagatcatcgagcccattcagtcccgct 
gtgcagtcctccggtacacaaagctgaccgacgcccagatcctcaccaggctgatgaatg 
ttatcgagaaggagagggtaccctacactgatgacggcctagaagccatcatcttcacgg 
cccagggagacatgaggcaggcgctgaacaacctgcagtccaccttctcaggatttggct 
tcattaacagtgagaacgtgttcaaggtctgtgacgagccccacccactgctggtaaagg 
agatgatccagcactgtgtgaatgccaacattgacgaagcctacaagattcttgctcact 
tgtggcatctgggctactcaccagaagatatcattggcaacatctttcgagtgtgtaaaa 
ctttccaaatggcagaatacctgaaactggagtttatcaaggaaattggatacactcaca 
tgaaaatagcggaaggagtgaactctcttttgcagatggcaggcctcctggcaaggctgt 
gtcagaagacaatggccccggtggccagttagagcagagacttcactgactgacttacag 
gtgccctattctgaggtacaggagccgcggctttctgatgggggaaaatgccgccttagg 
ctggagccaacatgactgtcctttaaactccagtggctggccaggcacggtagctcacgc 
ctgtaatcccaacactttgggaggccgaggcaggtggatcacctgaggtcagaagttcaa 
gaccagcctggccaacatggggaaaccctgtctttactaaaaatataaaaattagctggg 
tgtggtggcgggcacctgtaatcccagctactcgggaggctgtggcaggagaatcgcttg 
aacccaggaggtggaggttgcagtgagccaagatcacaccattgcactccagcctgggcg 
acagagtctccatctggggaaaaaaattaaataaataaactcccgtgacttgca 

SEQ ID NO: 349. >T62520_T14 # TY Transcript # LN 1666 # Source Gene: T62520 
# Encoded protein: T62520_P4 

caccccaaaacatayggctaggtacgaggctgggtggtgcgaagaggaggcgcgagaatg 
gaggtggaggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccctgcc 
cctgccttcagcaaggcccccggcagcgccggccactacgaactgccgtgggttgaaaaa 
tataggccagtaaagctgaatgaaattgtcgggaatgaagacaccgtgagcaggctagag 
gtctttgcaagggaaggaaatgtgcccaacatcatcattgcggggtcagaggtgacggag 
atgcaaacagcagtgaacataactcaagagctacctgaggtccttctggtggtttgaagt 
ctttactggagatctcattcccagggccctccaggaaccggcaagaccacaagcattctg 
tgcttggcccgggccctgctgggcccagcactcaaagatgccatgttggaactcaatgct 
tcaaatgacaggggcattgacgttgtgaggaataaaattaaaatgtttgctcaacaaaaa 
gtcactcttcccaaaggccgacataagatcatcattctggatgaagcagacagcatgacc 
gacggagcccagcaagccttgaggagaaccatggaaatctactctaaaaccactcgcttc 
gcccttgcttgtaatgcttcggataagatcatcgagcccattcagtcccgctgtgcagtc 
ctccggtacacaaagctgaccgacgcccagatcctcaccaggctgatgaatgttatcgag 
aaggagagggtaccctacactgatgacggcctagaagccatcatcttcacggcccaggga 
gacatgaggcaggcgctgaacaacctgcagtccaccttctcaggatttggcttcattaac 
agtgagaacgtgttcaaggtctgtgacgagccccacccactgctggtaaaggagatgatc 
cagcactgtgtgaatgccaacattgacgaagcctacaagattcttgctcacttgtggcat 
ctgggctactcaccagaagatatcattggcaacatctttcgagtgtgtaaaactttccaa 
atggcagaatacctgaaactggagtttatcaaggaaattggatacactcacatgaaaata 
gcggaaggagtgaactctcttttgcagatggcaggcctcctggcaaggctgtgtcagaag 
acaatggccccggtggccagttagagcagagacttcactgactgacttacaggtgcccta 
ttctgaggtacaggagccgcggctttctgatgggggaaaatgccgccttaggctggagcc 
aacatgactgtcctttaaactccagtggctggccaggcacggtagctcacgcctgtaatc 
ccaacactttgggaggccgaggcaggtggatcacctgaggtcagaagttcaagaccagcc 
tggccaacatggggaaaccctgtctttactaaaaatataaaaattagctgggtgtggtgg 
cgggcacctgtaatcccagctactcgggaggctgtggcaggagaatcgcttgaacccagg 
aggtggaggttgcagtgagccaagatcacaccattgcactccagcctgggcgacagagtc 
tccatctggggaaaaaaattaaataaataaactcccgtgacttgca 
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SEQ ID NO:350. >T62520_T15 # TY Transcript # LN 1457 # Source Gene: T62520 

# Encoded protein: T62520_P5 

caccccaaaacatayggctaggtacgaggctgggtggtgcgaagaggaggcgcgagaatg 
gaggtggaggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccctgcc 
cctgccttcagcaaggcccccggcagcgccggccactacgaactgccgtgggttgaaaaa 
tataggccagtaaagctgaatgaaattgtcgggaatgaagacaccgtgagcaggctagag 
gtctttgcaagggaaggaaatgtgcccaacatcatcattgcggggcattgacgttgtgag 
gaataaaattaaaatgtttgctcaacaaaaagtcactcttcccaaaggccgacataagat 
catcattctggatgaagcagacagcatgaccgacggagcccagcaagccttgaggagaac 
catggaaatctactctaaaaccactcgcttcgcccttgcttgtaatgcttcggataagat 
catcgagcccattcagtcccgctgtgcagtcctccggtacacaaagctgaccgacgccca 
gatcctcaccaggctgatgaatgttatcgagaaggagagggtaccctacactgatgacgg 
cctagaagccatcatcttcacggcccagggagacatgaggcaggcgctgaacaacctgca 
gtccaccttctcaggatttggcttcattaacagtgagaacgtgttcaaggtctgtgacga 
gccccacccactgctggtaaaggagatgatccagcactgtgtgaatgccaacattgacga 
agcctacaagattcttgctcacttgtggcatctgggctactcaccagaagatatcattgg 
caacatctttcgagtgtgtaaaactttccaaatggcagaatacctgaaactggagtttat 
caaggaaattggatacactcacatgaaaatagcggaaggagtgaactctcttttgcagat 
ggcaggcctcctggcaaggctgtgtcagaagacaatggccccggtggccagttagagcag 
agacttcactgactgacttacaggtgccctattctgaggtacaggagccgcggctttctg 
atgggggaaaatgccgccttaggctggagccaacatgactgtcctttaaactccagtggc 
tggccaggcacggtagctcacgcctgtaatcccaacactttgggaggccgaggcaggtgg 
atcacctgaggtcagaagttcaagaccagcctggccaacatggggaaaccctgtctttac 
taaaaatataaaaattagctgggtgtggtggcgggcacctgtaatcccagctactcggga 
ggctgtggcaggagaatcgcttgaacccaggaggtggaggttgcagtgagccaagatcac 
accattgcactccagcctgggcgacagagtctccatctggggaaaaaaattaaataaata 
aactcccgtgacttgca 

SEQ ID NO: 351. >T62520_T16 # TY Transcript # LN 1715 8 Source Gene: T62520 

# Encoded protein: T62520_P4 

caccccaaaacatayggctaggtacgaggctgggtggtgcgaagaggaggcgcgagaatg 
gaggtggaggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccctgcc 
cctgccttcagcaaggcccccggcagcgccggccactacgaactgccgtgggttgaaaaa 
tataggccagtaaagctgaatgaaattgtcgggaatgaagacaccgtgagcaggctagag 
tcatccagaagcaagttttaaccggttttaatgatcattttcgccgtaggtctttgcaag 
ggaaggaaatgtgcccaacatcatcattgcggggtcagaggtgacggagatgcaaacagc 
agtgaacataactcaagagctacctgaggtccttctggtggtttgaagtctttactggag 
atctcattcccagggccctccaggaaccggcaagaccacaagcattctgtgcttggcccg 
ggccctgctgggcccagcactcaaagatgccatgttggaactcaatgcttcaaatgacag 
gggcattgacgttgtgaggaataaaattaaaatgtttgctcaacaaaaagtcactcttcc 
caaaggccgacataagatcatcattctggatgaagcagacagcatgaccgacggagccca 
gcaagccttgaggagaaccatggaaatctactctaaaaccactcgcttcgcccttgcttg 
taatgcttcggataagatcatcgagcccattcagtcccgctgtgcagtcctccggtacac 
aaagctgaccgacgcccagatcctcaccaggctgatgaatgttatcgagaaggagagggt 
accctacactgatgacggcctagaagccatcatcttcacggcccagggagacatgaggca 
ggcgctgaacaacctgcagtccaccttctcaggatttggcttcattaacagtgagaacgt 
gttcaaggtctgtgacgagccccacccactgctggtaaaggagatgatccagcactgtgt 
gaatgccaacattgacgaagcctacaagattcttgctcacttgtggcatctgggctactc 
accagaagatatcattggcaacatctttcgagtgtgtaaaactttccaaatggcagaata 
cctgaaactggagtttatcaaggaaattggatacactcacatgaaaatagcggaaggagt 
gaactctcttttgcagatggcaggcctcctggcaaggctgtgtcagaagacaatggcccc 
ggtggccagttagagcagagacttcactgactgacttacaggtgccctattctgaggtac 
aggagccgcggctttctgatgggggaaaatgccgccttaggctggagccaacatgactgt 
cctttaaactccagtggctggccaggcacggtagctcacgcctgtaatcccaacactttg 
ggaggccgaggcaggtggatcacctgaggtcagaagttcaagaccagcctggccaacatg 
gggaaaccctgtctttactaaaaatataaaaattagctgggtgtggtggcgggcacctgt 
aatcccagctactcgggaggctgtggcaggagaatcgcttgaacccaggaggtggaggtt 
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gcagtgagccaagatcacaccattgcactccagcctgggcgacagagtctccatctgggg 
aaaaaaattaaataaataaactcccgtgacttgca 

SEQ ID NO: 352 . >T62520_T17 8 TY Transcript # LN 1564 # Source Gene: T62520 

# Encoded protein: T62520_P6 

caccccaaaacatayggctaggtacgaggctgggtggtgcgaagaggaggcgcgagaatg 
gaggtggaggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccctgcc 
cctgccttcagcaaggcccccggcagcgccggccactacgaactgccgtgggttgaaaaa 
tataggccagtaaagctgaatgaaattgtcgggaatgaagacaccgtgagcaggctagag 
gtctttgcaagggaaggaaatgtgcccaacatcatcattgcggggtcagaggtgacggag 
atgcaaacagcagtgaacataactcaagagctacctgaggtccttctggtggtttgaagt 
ctttactggagatctcattcccagggccctccaggaaccggcaagaccacaagcattctg 
tgcttggcccgggccctgctgggcccagcactcaaagatgccatgttggaactcaatgct 
tcaaatgacagcatgaccgacggagcccagcaagccttgaggagaaccatggaaatctac 
tctaaaaccactcgcttcgcccttgcttgtaatgcttcggataagatcatcgagcccatt 
cagtcccgctgtgcagtcctccggtacacaaagctgaccgacgcccagatcctcaccagg 
ctgatgaatgttatcgagaaggagagggtaccctacactgatgacggcctagaagccatc 
atcttcacggcccagggagacatgaggcaggcgctgaacaacctgcagtccaccttctca 
ggatttggcttcattaacagtgagaacgtgttcaaggtctgtgacgagccccacccactg 
ctggtaaaggagatgatccagcactgtgtgaatgccaacattgacgaagcctacaagatt 
cttgctcacttgtggcatctgggctactcaccagaagatatcattggcaacatctttcga 
gtgtgtaaaactttccaaatggcagaatacctgaaactggagtttatcaaggaaattgga 
tacactcacatgaaaatagcggaaggagtgaactctcttttgcagatggcaggcctcctg 
gcaaggctgtgtcagaagacaatggccccggtggccagttagagcagagacttcactgac 
tgacttacaggtgccctattctgaggtacaggagccgcggctttctgatgggggaaaatg 
ccgccttaggctggagccaacatgactgtcctttaaactccagtggctggccaggcacgg 
tagctcacgcctgtaatcccaacactttgggaggccgaggcaggtggatcacctgaggtc 
agaagttcaagaccagcctggccaacatggggaaaccctgtctttactaaaaatataaaa 
attagctgggtgtggtggcgggcacctgtaatcccagctactcgggaggctgtggcagga 
gaatcgcttgaacccaggaggtggaggttgcagtgagccaagatcacaccattgcactcc 
agcctgggcgacagagtctccatctggggaaaaaaattaaataaataaactcccgtgact 
tgca 

SEQ ID NO: 353. >T62520_T18 # TY Transcript # LN 1355 # Source Gene: T62520 

# Encoded protein: T62520_P7 

caccccaaaacatayggctaggtacgaggctgggtggtgcgaagaggaggcgcgagaatg 
gaggtggaggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccctgcc 
cctgccttcagcaaggcccccggcagcgccggccactacgaactgccgtgggttgaaaaa 
tataggccagtaaagctgaatgaaattgtcgggaatgaagacaccgtgagcaggctagag 
gtctttgcaagggaaggaaatgtgcccaacatcatcattgcgcatgaccgacggagccca 
gcaagccttgaggagaaccatggaaatctactctaaaaccactcgcttcgcccttgcttg 
taatgcttcggataagatcatcgagcccattcagtcccgctgtgcagtcctccggtacac 
aaagctgaccgacgcccagatcctcaccaggctgatgaatgttatcgagaaggagagggt 
accctacactgatgacggcctagaagccatcatcttcacggcccagggagacatgaggca 
ggcgctgaacaacctgcagtccaccttctcaggatttggcttcattaacagtgagaacgt 
gttcaaggtctgtgacgagccccacccactgctggtaaaggagatgatccagcactgtgt 
gaatgccaacattgacgaagcctacaagattcttgctcacttgtggcatctgggctactc 
accagaagatatcattggcaacatctttcgagtgtgtaaaactttccaaatggcagaata 
cctgaaactggagtttatcaaggaaattggatacactcacatgaaaatagcggaaggagt 
gaactctcttttgcagatggcaggcctcctggcaaggctgtgtcagaagacaatggcccc 
ggtggccagttagagcagagacttcactgactgacttacaggtgccctattctgaggtac 
aggagccgcggctttctgatgggggaaaatgccgccttaggctggagccaacatgactgt 
cctttaaactccagtggctggccaggcacggtagctcacgcctgtaatcccaacactttg 
ggaggccgaggcaggtggatcacctgaggtcagaagttcaagaccagcctggccaacatg 
gggaaaccctgtctttactaaaaatataaaaattagctgggtgtggtggcgggcacctgt 
aatcccagctactcgggaggctgtggcaggagaatcgcttgaacccaggaggtggaggtt 
gcagtgagccaagatcacaccattgcactccagcctgggcgacagagtctccatctgggg 
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aaaaaaattaaataaataaactcccgtgacttgca 

SEQ ID NO:354. >T62520_T19 # TY Transcript ft LN 1670 # Source Gene: T62520 

# Encoded protein: T62520_P8 

gctgtggtgtcgctgggctgaggtggaggcccaggactctgaccctgaccccatgctctt 
cgagacaacggctccccaggcatgcctgtcgcaggtcctatccttagctgaaatcgttgc 
ccgctaggtggtttgcacaagacatatcacgcggcgcacgtactatgtcttgaatagtga 
atagttagtcctngtgagagtgatatcgactatcgcagatgaaggtctatgggtcatagt 
agggctcatatctgcgaacgggacatcgcaaactgtgctcaaaatcattcattgcgggtc 
cctccagagtaacctggccaagaatgcaacaaaagcgaattccttgctgactgtagcgac 
cacgtgtgcacctgacttggtgcccagacactccgaaagatgccatgttgtacactctat 
gcttcatatgactggggccacttgcgattgtgagggaatatgcttatgatgtatgcctct 
ggcggacatggtctatctcttcccaacaggtgctgacaactcatgaatcgactctacttg 
ctgttattgcaaggctaaggatcatgccacttgatgcagtaatgatgcgccagttgagag 
ccgttgtcagtggattaacctattggaaattctatctctaaacaatcatctcgtcgttgc 
tgcttcattgccttgtaatgcttcggataagatcatcgagcccattcagtcccgctgtgc 
agtcctccggtacacaaagctgaccgacgcccagatcctcaccaggctgatgaatgttat 
cgagaaggagagggtaccctacactgatgacggcctagaagccatcatcttcacggccca 
gggagacatgaggcaggcgctgaacaacctgcagtccaccttctcaggatttggcttcat 
taacagtgagaacgtgttcaaggtctgtgacgagccccacccactgctggtaaaggagat 
gatccagcactgtgtgaatgccaacattgacgaagcctacaagattcttgctcacttgtg 
gcatctgggctactcaccagaagatatcattggcaacatctttcgagtgtgtaaaacttt 
ccaaatggcagaatacctgaaactggagtttatcaaggaaattggatacactcacatgaa 
aatagcggaaggagtgaactctcttttgcagatggcaggcctcctggcaaggctgtgtca 
gaagacaatggccccggtggccagttagagcagagacttcactgactgacttacaggtgc 
cctattctgaggtacaggagccgcggctttctgatgggggaaaatgccgccttaggctgg 
agccaacatgactgtcctttaaactccagtggctggccaggcacggtagctcacgcctgt 
aatcccaacactttgggaggccgaggcaggtggatcacctgaggtcagaagttcaagacc 
agcctggccaacatggggaaaccctgtctttactaaaaatataaaaattagctgggtgtg 
gtggcgggcacctgtaatcccagctactcgggaggctgtggcaggagaatcgcttgaacc 
caggaggtggaggttgcagtgagccaagatcacaccattgcactccagcctgggcgacag 
agtctccatctggggaaaaaaattaaataaataaactcccgtgacttgca 

SEQ ID NO: 355. >T62520_T20 # TY Transcript # LN 956 # Source Gene: T62520 

# Encoded protein: T62520_P9 

caccccaaaacatayggctaggtacgaggctgggtggtgcatcctcaccaggctgatgaa 
tgttatcgagaaggagagggtaccctacactgatgacggcctagaagccatcatcttcac 
ggcccagggagacatgaggcaggcgctgaacaacctgcagtccaccttctcaggatttgg 
cttcattaacagtgagaacgtgttcaaggtctgtgacgagccccacccactgctggtaaa 
ggagatgatccagcactgtgtgaatgccaacattgacgaagcctacaagattcttgctca 
cttgtggcatctgggctactcaccagaagatatcattggcaacatctttcgagtgtgtaa 
aactttccaaatggcagaatacctgaaactggagtttatcaaggaaattggatacactca 
catgaaaatagcggaaggagtgaactctcttttgcagatggcaggcctcctggcaaggct 
gtgtcagaagacaatggccccggtggccagttagagcagagacttcactgactgacttac 
aggtgccctattctgaggtacaggagccgcggctttctgatgggggaaaatgccgcctta 
ggctggagccaacatgactgtcctttaaactccagtggctggccaggcacggtagctcac 
gcctgtaatcccaacactttgggaggccgaggcaggtggatcacctgaggtcagaagttc 
aagaccagcctggccaacatggggaaaccctgtctttactaaaaatataaaaattagctg 
ggtgtggtggcgggcacctgtaatcccagctactcgggaggctgtggcaggagaatcgct 
tgaacccaggaggtggaggttgcagtgagccaagatcacaccattgcactccagcctggg 
cgacagagtctccatctggggaaaaaaattaaataaataaactcccgtgacttgca 

SEQ ID NO: 356. >T62520_T21 # TY Transcript # LN 1109 # Source Gene: T62520 

# Encoded protein: T62520_P1 

caatttgagtttccatttctcggatttgggaactggtataagcattgtctgtgatgtaaa 
caaagtcttcaatatttggagaaaacatctcctcatacttgagagcacaagaggaagaga 
gagaccctcactgctggggagtccctgccacactcagtcccccaccacactgaatcggaa 
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ttccgagagggaagaggaggcgcgagaatggaggtggaggccgtctgtggtggcgcgggc 
gaggtggaggcccaggactctgaccctgcccctgccttcagcaaggcccccggcagcgcc 
ggccactacgaactgccgtgggttgaaaaatataggccagtaaagctgaatgaaattgtc 
gggaatgaagacaccgtgagcaggctagaggtctttgcaagggaaggaaatgtgcccaac 
atcatcattgcgggccctccaggaaccggcaagaccacaagcattctgtgcttggcccgg 
gccctgctgggcccagcactcaaagatgccatgttggaactcaatgcttcaaatgacagg 
ggcattgacgttgtgaggaataaaattaaaatgtttgctcaacaaaaagtcactcttccc 
aaaggccgacataagatcatcattctggatgaagcagacagcatgaccgacggagcccag 
caagccttgaggagaaccatggaaatctactctaaaaccactcgcttcgcccttgcttgt 
aatgcttcggataagatcatcgagcccattcagtcccgctgtgcagtcctccggtacaca 
aagctgaccgacgcccagatcctcaccaggctgatgaatgttatcgagaaggagagggta 
ccctacactgatgacggcctagaagccatcatcttcacggcccagggagacatgaggcag 
gcgctgaacaacctgcagtccaccttctcaggatttggcttcattaacagtgagaacgtg 
ttcaaggtctgtgagagrccmcccamtggtggtcacggcgatgatccagactgtgtgaca 
tgcaaaattgacgaagctacaaaatatggtcaactgtggcatctgggatactcacagaaa 
atatctggcaacatttacgatgtttaaca 

SEQ ID NO: 357. >T62520_T22 # TY Transcript # LN 1556 # Source Gene: T62520 

# Encoded protein: T62520_P10 

caatttgagtttccatttctcggatttgggaactggtataagcattgtctgtgatgtaaa 
caaagtcttcaatatttggagaaaacatctcctcatacttgagagcacaagaggaagaga 
gagaccctcactgctggggagtccctgccacactcagtcccccaccacactgaatcggaa 
ttccgagagggaagaggaggcgcgagaatggaggtggaggccgtctgtggtggcgcgggc 
gaggtggaggcccaggactctgaccctgcccctgccttcagcaaggcccccggcagcgcc 
ggccactacgaactgccgtgggttgaaaaatataggccagtaaagctgaatgaaattgtc 
gggaatgaagacaccgtgagcaggctagaggtctttgcaagggaaggaaatgtgcccaac 
atcatcattgcgggccctccaggaaccggcaagaccacaagcattctgtgcttggcccgg 
gccctgctgggcccagcactcaaagatgccatgttggaactcaatgcttcaaatgacagg 
ggcattgacgttgtgaggaataaaattaaaatgtttgctcaacaaaaagtcactcttccc 
aaaggccgacataagatcatcattctggatgaagcagacagcatgaccgacggagcccag 
caagccttgaggagaaccatggaaatctactctaaaaccactcgcttcgcccttgcttgt 
aatgcttcggataagatcatcggcgctgaacaacctgcagtccaccttctcaggatttgg 
cttcattaacagtgagaacgtgttcaaggtctgtgacgagccccacccactgctggtaaa 
ggagatgatccagcactgtgtgaatgccaacattgacgaagcctacaagattcttgctca 
cttgtggcatctgggctactcaccagaagatatcattggcaacatctttcgagtgtgtaa 
aactttccaaatggcagaatacctgaaactggagtttatcaaggaaattggatacactca 
catgaaaatagcggaaggagtgaactctcttttgcagatggcaggcctcctggcaaggct 
.gtgtcagaagacaatggccccggtggccagttagagcagagacttcactgactgacttac 
aggtgccctattctgaggtacaggagccgcggctttctgatgggggaaaatgccgcctta 
ggctggagccaacatgactgtcctttaaactccagtggctggccaggcacggtagctcac 
gcctgtaatcccaacactttgggaggccgaggcaggtggatcacctgaggtcagaagttc 
aagaccagcctggccaacatggggaaaccctgtctttactaaaaatataaaaattagctg 
ggtgtggtggcgggcacctgtaatcccagctactcgggaggctgtggcaggagaatcgct 
tgaacccaggaggtggaggttgcagtgagccaagatcacaccattgcactccagcctggg 
cgacagagtctccatctggggaaaaaaattaaataaataaactcccgtgacttgca 

SEQ ID NO: 358. >T62520_T23 # TY Transcript # LN 847 # Source Gene: T62520 

# Encoded protein: 

gagcgaggcctgggtggcrattcggacgaggcggcgctgaacaacctgcagtccaccttc 
tcaggatttggcttcattaacagtgagaacgtgttcaaggtctgtgacgagccccaccca 
ctgctggtaaaggagatgatccagcactgtgtgaatgccaacattgacgaagcctacaag 
attcttgctcacttgtggcatctgggctactcaccagaagatatcattggcaacatcttt 
cgagtgtgtaaaactttccaaatggcagaatacctgaaactggagtttatcaaggaaatt 
ggatacactcacatgaaaatagcggaaggagtgaactctcttttgcagatggcaggcctc 
ctggcaaggctgtgtcagaagacaatggccccggtggccagttagagcagagacttcact 
gactgacttacaggtgccctattctgaggtacaggagccgcggctttctgatgggggaaa 
atgccgccttaggctggagccaacatgactgtcctttaaactccagtggctggccaggca 
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cggtagctcacgcctgtaatcccaacactttgggaggccgaggcaggtggatcacctgag 
gtcagaagttcaagaccagcctggccaacatggggaaaccctgtctttactaaaaatata 
aaaattagctgggtgtggtggcgggcacctgtaatcccagctactcgggaggctgtggca 
ggagaatcgcttgaacccaggaggtggaggttgcagtgagccaagatcacaccattgcac 
tccagcctgggcgacagagtctccatctggggaaaaaaattaaataaataaactcccgtg 
acttgca 

SEQ ID NO:359. >T62520JT24 # TY Transcript # LN 1067 # Source Gene: T62520 

# Encoded protein: 

attgtgccactacactccatgctgggtgacagagtgagactctgtctcaaaaaaaaaaaa 
aaaaaaaaaaagtcacccttggcccggtgcggtggctcatgcctgtaatcccagcacttg 
ggaggcggaggtgggagaatcgcttaagcccaggagttcgagaccagcctgggcaacata 
gcgagactcctgtctctactcaaaaagaaataaatataaaaatcatccttgacatgtgtt 
tgctcctgaacaggcgctgaacaacctgcagtccaccttctcaggatttggcttcattaa 
cagtgagaacgtgttcaaggtctgtgacgagccccacccactgctggtaaaggagatgat 
ccagcactgtgtgaatgccaacattgacgaagcctacaagattcttgctcacttgtggca 
tctgggctactcaccagaagatatcattggcaacatctttcgagtgtgtaaaactttcca 
aatggcagaatacctgaaactggagtttatcaaggaaattggatacactcacatgaaaat 
agcggaaggagtgaactctcttttgcagatggcaggcctcctggcaaggctgtgtcagaa 
gacaatggccccggtggccagttagagcagagacttcactgactgacttacaggtgccct 
attctgaggtacaggagccgcggctttctgatgggggaaaatgccgccttaggctggagc 
caacatgactgtcctttaaactccagtggctggccaggcacggtagctcacgcctgtaat 
cccaacactttgggaggccgaggcaggtggatcacctgaggtcagaagttcaagaccagc 
ctggccaacatggggaaaccctgtctttactaaaaatataaaaattagctgggtgtggtg 
gcgggcacctgtaatcccagctactcgggaggctgtggcaggagaatcgcttgaacccag 
gaggtggaggttgcagtgagccaagatcacaccattgcactccagcctgggcgacagagt 
ctccatctggggaaaaaaattaaataaataaactcccgtgacttgca 

SEQ ID NO:360. >T62520_T25 # TY Transcript # LN 1391 # Source Gene: T62520 

# Encoded protein: 

caggaaaccatggcttccaccgggaaaaggcaagggcaggtgtgggtgacctctggagcc 
atcaggaggctttcattgggaagatgtaggggtcgacattgtaccagaggcagagcctca 
tgccactgcggagaatgctgcagggaggacggagtccccagcaccatgcacctgcctgct 
ggagtagcctgaccccttgtttagggtgtcaggctgaggctgggggaccttttgtaggga 
aggccagttcctactccggggtggaggtgggaattggcccaaaagacctcccagttctaa 
agtgaattccaagtggttctgggcaactgaccccccaagtgatctaagacagatgacttg 
tgactttaaagctgtccccaagtgagcatccttcacaggcccttggttgctctcttgttc 
atctgtcttatgtttgggaaatttatcttcctggtgccggaagcacttctctttgatggt 
gagaggagcattccccaaactcatcctggttgctggagaatttgatgaaaacaacaggtt 
gagaagttttacccatagattgcctggaagtcggtctgttggggcctccccgctcacatt 
gtcccttctagcccctgaccgcctttcctttccctttggccaggtctgtgacgagcccca 
cccactgctggtaaaggagatgatccagcactgtgtgaatgccaacattgacgaagccta 
caagattcttgctcacttgtggcatctgggctactcaccagaagatatcattggcaacat 
ctttcgagtgtgtaaaactttccaaatggcagaatacctgaaactggagtttatcaagga 
aattggatacactcacatgaaaatagcggaaggagtgaactctcttttgcagatggcagg 
cctcctggcaaggctgtgtcagaagacaatggccccggtggccagttagagcagagactt 
cactgactgacttacaggtgccctattctgaggtacaggagccgcggctttctgatgggg 
gaaaatgccgccttaggctggagccaacatgactgtcctttaaactccagtggctggcca 
ggcacggtagctcacgcctgtaatcccaacactttgggaggccgaggcaggtggatcacc 
tgaggtcagaagttcaagaccagcctggccaacatggggaaaccctgtctttactaaaaa 
tataaaaattagctgggtgtggtggcgggcacctgtaatcccagctactcgggaggctgt 
ggcaggagaatcgcttgaacccaggaggtggaggttgcagtgagccaagatcacaccatt 
gcactccagcctgggcgacagagtctccatctggggaaaaaaattaaataaataaactcc 
cgtgacttgca 

SEQ ID NO: 361. >T62520JT26 # TY Transcript # LN 960 # Source Gene: T62520 

# Encoded protein: T62520_P9 
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caccccaaaacatayggctaggtacgaggctgggtggtgcatcctcaccaggctgatgaa 
tgttatcgagaaggagagggtaccctacactgatgacggcctagaagccatcatcttcac 
ggcccagggagacatgaggcaggcgctgaacaacctgcagtccaccttctcaggatttgg 
cttcattaacagtgagaacgtgttcaaggtctgtgacgagccccacccactgctggtaaa 
ggagatgatccagcactgtgtgaatgccaacattgacgaagcctacaagattcttgctca 
cttgtggcatctgggctactcaccagaagatatcattggcaacatctttcgagtgtgtaa 
aactttccaaatggcagaatacctgaaactggagtttatcaaggaaattggatacactca 
catgaaaatagcggaaggagtgaactctcttttgcagatggcaggcctcctggcaaggct 
gtgtcagaagacaatggccccggtggccagttagagcagagacttcactgactgacttac 
aggtgccctattctgaggtacaggagccgcggctttctgatgggggaaaatgccgcctta 
ggctggagccaacatgactgtcctttaaactccagtggctggccaggcacggtagctcac 
gcctgtaatcccaacactttgggaggccgaggcaggtggatcacctgaggtcagaagttc 
aagaccagcctggccaacatggggaaaccctgtctttactaaaaatataaaaattagctg 
ggtgtggtggcgggcacctgtaatcccagctactcgggaggctgtgtgagaatcgtgaac 
cagagtggggttgatgagcagaaacatgatcaggctgggagagtcttgggaaataatata 
atcgggtgaaaaaaaaacggctttggcgaaggtagcggagtcccctttagggaggctggt 

SEQ ID NO: 362. >T62520_T27 # TY Transcript # LN 821 # Source Gene: T62520 

# Encoded protein: T62520__P11 

gagcgaggcctgggtggcrattcggacgaggcgatatcattggcaacatctttcgagtgt 
gtaaaactttccaaatggcagaatacctgaaactggagtttatcaaggaaattggataca 
ctcacatgaaaatagcggaaggagtgaactctcttttgcagatggcaggcctcctggcaa 
ggctgtgtcagaagacaatggccccggtggccagttagagcagagacttcactgactgac 
ttacaggtgccctattctgaggtacaggagccgcggctttctgatgggggaaaatgccgc 
cttaggctggagccaacatgactgtcctttaaactccagtggctggccaggcacggtagc 
tcacgcctgtaatcccaacactttgggaggccgaggcaggtggatcacctgaggtcagaa 
gttcaagaccagcctggccaacatggggaaaccctgtctttactaaaaatataaaaatta 
gctgggtgtggtggcgggcacctgtaatcccagctactcgggaggctgtggcaggagaat 
cgcttgaacccaggaggtggaggttgcagtgagccaagatcacaccattgcactccagcc 
tgggcgacagagtctccatctggggaaaaaaattaaataaataaactcccgtgacttgca 
tgtttgcttctgggacgtctgtgccccgcaagtgtttgagtttggcctccaccccattga 
tgcggtgacggggcggaaaggcgcagagaagctgagggcggtctctgatctgtgtgtggg 
ttgacattttagctaataaagccttgcagtgtttgttggca 

SEQ ID NO: 363. >T62520_T28 # TY Transcript # LN 984 # Source Gene: T62520 
< Encoded protein: T62520_P6 

caccccaaaacatayggctaggtacgaggctgggtggtgcgaagaggaggcgcgagaatg 
gaggtggaggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccctgcc 
cctgccttcagcaaggcccccggcagcgccggccactacgaactgccgtgggttgaaaaa 
tataggccagtaaagctgaatgaaattgtcgggaatgaagacaccgtgagcaggctagag 
gtctttgcaagggaaggaaatgtgcccaacatcatcattgcggggtcagaggtgacggag 
atgcaaacagcagtgaacataactcaagagctacctgaggtccttctggtggtttgaagt 
ctttactggagatctcattcccagggccctccaggaaccggcaagaccacaagcattctg 
tgcttggcccgggccctgctgggcccagcactcaaagatgccatgttggaactcaatgct 
tcaaatgacagcatgaccgacggagcccagcaagccttgaggagaaccatggaaatctac 
tctaaaaccactcgcttcgcccttgcttgtaatgcttcggataagatcatcgagcccatt 
cagtcccgctgtgcagtcctccggtacacaaagctgaccgacgcccagatcctcaccagg 
ctgatgaatgttatcgagaaggagagggtaccctacactgatgacggcctagaagccatc 
atcttcacggcccagggagacatgaggcaggcgctgaacaacctgcagtccaccttctca 
ggatttggcttcattaacagtgagaacgtgttcaaggtctgtgacgagccccacccactg 
ctggtaaaggagatgatccagcactgtgtgaatgccaacattgacgaagcctacaagatc 
cttgsytcmctttgkgggcatytggggctactctcccccgaaaaaattcctttgggcaaa 
cttcttttttcagggggggtaaaa 

SEQ ID NO: 364. >T62520_T29 # TY Transcript # LN 437 # Source Gene: T62520 

# Encoded protein: T62520_P12 

ctgaatttgtgtcatgaaggactggagagaaccaattcccacaagtgtgatggcagccct 
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tgtagatcgtggttttccatgaccgacggagcccagcaagccttgaggagaaccatggaa 
atctactctaaaaccactcgcttcgcccttgcttgtaatgcttcggataagatcatcgag 
cccattcagtcccgctgtgcagtcctccggtacacaaagctgaccgacgcccagatcctc 
accaggctgatgaatgttatcgagaaggagagggtaccctacactgatgacggcctagaa 
gccatcatcttcacggcccagggagacatgaggcagggcattgacgttgtgaggaataaa 
attaaaatgtttgctcaacaaaaagtcactcttcccaaaggccgacataagatcatcatt 
ctggatgaagcagacag 

SEQ ID NO: 365. >T62520_T30 # TY Transcript # LN 173 # Source Gene: T62520 

# Encoded protein: 

gacagagacctggcaccctcctgggacagtgggctgtctagctcggactccaagaagaga 
ggcagggaagggcagagctttaaaagtcaccccggaggctgaggcaggaggatcacctga 
ggtcaggagttcaagaccagcctggccaacatggtgaaatcccatctctacta 

SEQ ID NO: 366. >T62520_P14 # TY Protein 8 CC #LN 354 # Source Gene: 

T62520 # Encoding Transcript: 1 

MEVEAVCGGAGEVEAQDSDPAPAFSKAPGSAGHYELPWVEKYRPVKLNEIVGNEDTVSRL 
EVFAREGNVPNIIIAGPPGTGKTTSILCLARALLGPALKDAMLELNASNDRGIDWRNKI 
KMFAQQKVTLPKGRHKIIILDEADSMTDGAQQALRRTMEIYSKTTRFALACNASDKIIEP 
IQSRCAVLRYTKLTDAQILTRLMNVIEKERVPYTDDGLEAI I FTAQGDMRQALNNLQSTF 
SGFGFINSENVFKVCDEPHPLLVKEMIQHCVNANIDEAYKILAHLWHLGYSPEDIIGNIF 
RVCKTFQMAEYLKLEFIKEIGYTHMKIAEGVNSLLQMAGLLARLCQKTMAPVAS 

SEQ ID NO: 367. >T62520_P2 # TY Protein # CC #LN 262 # Source Gene: T62520 

# Encoding Transcript: 2 

SEAWVAIRTRRRGGARMEVEAVCGGAGEVEAQDSDPAPAFSKAPGSAGHYELPWVEKYRP 
VKLNEIVGNEDTVSRLEVFAREGNVPNIIIAGPPGTGKTTSILCLARALLGPALKDAMLE 
LNASNDRGI DWRNKI KMFAQQKVTLPKGRHKI I ILDEADSMTDGAQQALRRTMEI YSKT 
TRFALACNASDKIIEPIQSRCAVLRYTKLTDAQILTRLMNVIEKERVPYTDDGLEAIIFT 
AQGDMRQALNNLQSTFLRIWLH 

SEQ ID NO: 368. >T62520_P1 # TY Protein # CC #LN 24 6 * Source Gene: T62520 

# Encoding Transcript: 3 

ME VEAVCGGAGE VEAQDS DPAPAFS KAPGSAGH YELPWVEK YRPVKLNE I VGNE DT VS RL 
EVFAREGNVPNI 1 1 AGPPGTGKTTS ILCLARALLGPALKDAMLELNASNDRGI DWRNKI 
KMFAQQKVTLPKGRHKI I ILDEADSMTDGAQQALRRTMEI YSKTTRFALACNASDKIIEP 
IQSRCAVLRYTKLTDAQILTRLMNVIEKERVPYTDDGLEAI I FTAQGDMRQALNNLQSTF 
LRIWLH 

SEQ ID N0:369. >T62520_P13 # TY Protein # CC #LN 320 # Source Gene: 

T62520 # Encoding Transcript: 9 

MEVEAVCGGAGEVEAQDSDPAPAFSKAPGSAGHYELPWVEKYRPVKLNEIVGNEDTVSRL 
EVFAREGNVPNI I IAGPPGTGKTTSILCLARALLGPALKDAMLELNASNDSMTDGAQQAL 
RRTMEIYSKTTRFALACNASDKIIEPIQSRCAVLRYTKLTDAQILTRLMNVIEKERVPYT 
DDGLEAI I FTAQGDMRQALNNLQSTFSG FG FI NSENVFKVCDE PH PLLVKEM IQHC VNAN 
IDEAYKILAHLWHLGYSPEDIIGNIFRVCKTFQMAEYLKLEFIKEIGYTHMKIAEGVNSL 
LQMAGLLARLCQKTMAPVAS 

SEQ ID NO: 370. >T62520_P4 # TY Protein # CC #LN 145 # Source Gene: T62520 

# Encoding Transcript: 12 

MLELNASNDRGIDWRNKIKMFAQQKVTLPKGRHKIIILDEADSMTDGAQQALRRTMEIY 
SKTTRFALACNASDKIIEPIQSRCAVLRYTKLTDAQILTRLMNVIEKERVPYTDDGLEAI 
IFTAQGDMRQALNNLQSTFLRIWLH 

SEQ ID NO:371. >T62520_P5 # TY Protein # CC #LN 143 # Source Gene: T62520 

# Encoding Transcript: 15 

MC PTSSLRG I DWRNKI KMFAQQKVTLPKGRHKI 1 1 LDEADSMT DGAQQALRRTME I YSK 
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TTRFALACNASDKIIEPIQSRCAVLRYTKLTDAQILTRLMNVIEKERVPYTDDGLEAIIF 
TAQGDMRQALNNLQSTFLRIWLH 

SEQ ID NO: 372. >T62520_P6 # TY Protein # CC ftLN 111 # Source Gene: T62520 

# Encoding Transcript: 17 

MLELNASNDSMTDGAQQALRRTMEIYSKTTRFALACNASDKIIEPIQSRCAVLRYTKLTD 
AQILTRLMNVIEKERVPYTDDGLEAIIFTAQGDMRQALNNLQSTFLRIWLH 

SEQ ID NO: 373. >T62520_P7 # TY Protein < CC ftLN 109 # Source Gene: T62520 

# Encoding Transcript: 18 

MCPT SS LRMT DG AQQALRRTME I Y S KTT RFALACNAS DK 1 1 EP IQS RCAVLR YT KLT DAQ 
ILTRLMNVIEKERVPYTDDGLEAIIFTAQGDMRQALNNLQSTFLRIWLH 

SEQ ID NO: 374. >T62520_P8 # TY Protein # CC #LN 300 # Source Gene: T62520 

# Encoding Transcript: 19 

LWCRWAEVEAQDSDPDPMLFETTAPQACLSQVLSLAEIVARQWCTRHITRRTYYVLNSE 
YLVLVRVISTIADEGLWVIVGLISANGTSQTVLKIIHCGSLQSNLAKNATKANSLLTVAT 
TCAPDLVPRHSERCHWHSMLHMTGATCIVREYAYDVCLWRTWSISSQQVLTTHESTLLA 
VIARLRIMPLDAVMMRQLRAWSGLTYWKFSSLNNHLVVAASLPCNASDKIIEPIQSRCA 
VLRYTKLTDAQI LTRLMNVI EKERVPYTDDGLEAI I FTAQGDMRQALNNLQSTFLRIWLH 

SEQ ID NO: 375. >T62520_P9 # TY Protein # CC #LN 62 # Source Gene: T62520 

# Encoding Transcript: 20 

TPKHMARYEAGWCI LTRLMNVI EKERVPYTDDGLEAI I FTAQGDMRQALNNLQSTFLRIW 
LH 

SEQ ID NO: 376. >T62520_P10 # TY Protein # CC #LN 223 # Source Gene: 

T62520 # Encoding Transcript: 22 

MEVEAVCGGAGEVEAQDSDPAPAFSKAPGSAGHYELPWVEKYRPVKLNEIVGNEDTVSRL 
EVFAREGNVPN 1 1 1 AGPPGTGKTTS I LCLARALLG PALKDAMLELNASNDRG I DWRNKI 
KMFAQQKVTLPKGRHKIIILDEADSMTDGAQQALRRTMEIYSKTTRFALACNASDKIIGA 
EQPAVHLSQDLASLTGENVFKVCDEPHPLLVKGDDPALCECQH 

SEQ ID NO: 377. >T62520_P11 I TY Protein # CC #LN 72 # Source Gene: T62520 

# Encoding Transcript: 27 

SEAWVAIRTRRYHWATSFRVCKTFQMAEYLKLEFIKEIGYTHMKIAEGVNSLLQMAGLLA 
RLCQKTMAPVAS 

SEQ ID NO: 378. >T62520_P12 ft TY Protein ft CC #LN 145 # Source Gene: 

T62520 # Encoding Transcript: 29 

LNLCHEGLERTNSHKCDGSPCRSWFSMTDGAQQALRRTMEIYSKTTRFALACNASDKIIE 
P I QSRCAVLRYTKLTDAQ I LTRLMNVI EKERVPYTDDGLEAI I FT AQGDMRQG I DWRNK 
IKMFAQQKVTLPKGRHKIIILDEAD 



SEQ ID NO: 379. >T83032 # TY Consensus ft Length 6290 ft Number of exons 25 

ctcccagcgtgctttgcggcgggccggcccgctttacccagagtcgccctgccgcaatcg 

cgcgtctttccaccgaggccccggatgtagattccctcccccgttcagtggtcgtggcct 

cacagcgactctaagacttggggctctctcattggctgtaactcttccactggattggta 

gcaaaaaaagaggcggtgcccaaggcgaaaggctctgtgactacagccaatcagaatcga 

ggccgggctttggcgggaggtgggaacgctgtggccattcggatttggcgcgagcgcggc 

tggagtttgctgctgccgctgtgcagtttgttcaggggcttgtggtggtgagtccgagag 

gctgcgtgtgagagacgtgagaaggatcctgcactgaggaggtggaaagaagaggattgc 
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tcgaggaggcctggggtctgtgaggcagcggagctgggtgaaggctgcgggttccggcga 
ggcctgagctgtgctgtcgtcatgcctcaaacccgatcccaggcacaggctacaatcagt 
tttccaaaaaggaagctgtctcgggcattgaacaaagctaaaaactccagtgatgccaaa 
ctagaaccaacaaatgtccaaaccgtaacctgttctcctcgtgtaaaagccctgcctctc 
agccccaggaaacgtctggggtgatccaactgcctcggcctccctaagtgctgggattat 
agggcgatgacaacctatgcaacactccccatttacctccttgttctccaccaaagcaag 
gcaagaaagagaatggtccccctcactcacatacacttaagggacgaagattggtatttg 
acaatcagctgacaattaagtctcctagcaaaagagaactagccaaagttcaccaaaaca 
aaatactttcttcagttagaaaaagtcaagagatcacaacaaattctgagcagagatgtc 
cactgaagaaagaatctgcatgtgtgagactattcaagcaagaaggcacttgctaccagc 
aagcaaagctggtcctgaacacagctgtcccagatcggctgcctgccagggaaagggaga 
tggatgtcatcaggaatttcttgagggaacacatctgtgggaaaaaagctggaagccttt 
acctttctggtgctcctggaactggaaaaactgcctgcttaagccggattctgcaagacc 
tcaagcgtgagggtcaggagttcaagaccagcctggaaggaactgaaaggctttaaaact 
atcatgctgaattgcatgtccttgaggactgcccaggctgtattcccagctattgctcag 
gagatttgtcaggaagaggtatccaggccagctgggaaggacatgatgaggaaattggaa 
aaacatatgactgcagagaagggccccatgattgtgttggtattggacgagatggatcaa 
ctggacagcaaaggccaggatgtattgtacacgctatttgaatggccatggctaagcaat 
tctcacttggtgctgattggtattgctaataccctggatctcacagatagaattctacct 
aggcttcaagctagagaaaaatgtaagccacagctgttgaacttcccaccttataccaga 
aatcagatagtcactattttgcaagatcgacttaatcagttttagatgagttttgctctt 
atcgcccaggctggagtgcaatggcgtgatctcggctctctacagcctctgcctcacggg 
ttcaagcgattctcctgcctcagcctccccagtagctaggactacaggcacctgccacca 
cgcccagctaattgttgtatttttagtatctagagatcaggttctggacaatgctgcagt 
tcaattctgtgcccgcaaagtctctgctgtttcaggagatgttcgcaaagcactggatgt 
ttgcaggtgagttacggctctgttgcattcttttattagagagctattgaaattgtagag 
tcagatgtcaaaagccagactattctcaaaccactgtctgaatgtaaatcaccttctgag 
cctctgattcccaagagggttggtcttattcacatatcccaagtcatctcagaagttgat 
ggtaacaggatgaccttgagccaagaaggagcacaagattccttccctcttcagcagaag 
atcttggtttgctctttgatgctcttgatcaggcagttgaaaatcaaagaggtcactctg 
gggaagactgtataggttttcgggaatatctacagaagcctgttcaaagattttattgaa 
aagaggaagaaatagggtattcagataagtttttgcaaacccagactcaggtttcttaaa 
tgattaaaggctataagcaatgtgacttttaagcagcgtttgttctcccttgtttcctac 
cagttatatgaagcctacagtaaagtctgtcgcaaacagcaggtggcggctgtggaccag 
tcagagtgtttgtcactttcagggctcttggaagccaggggcattttaggattaaagaga 
aacaaggaaacccgtttgacaaaggtgtttttcaagattgaagagaaagaaatagaacat 
gctctgaaagataaagctttaattggaaatatcttagctactggattgccttaaattctt 
ctcttacaccccacccgaaagtattcagctggcatttagagagctacagtcttcatttta 
gtgctttacacattcgggcctgaaaacaaatatgaccttttttacttgaagccaatgaat 
tttaatctatagattctttaatattagcacagaataatatctttgggtcttactattttt 
acccataaaagtgaccaggtagaccctttttaattacattcactacttctaccacttgtg 
tatctctagccaatgtgcttgcaagtgtacagatctgtgtagaggaatgtgtgtatattt 
acctcttcgtttgctcaaacatgagtgggtatttttttgtttgttttttttgttgttgtt 
gtttttgaggcgcgtctcaccctgttgcccaggctggagtgcaatggcgcgttctctgct 
cactacagcacccgcttcccaggttgaagtgattctcttgcctcagcctcccgagtagct 
gggattacaggtgcccaccaccgcgcccagctaattttttaatttttagtagagacaggg 
ttttaccatgttggccaggctggtcttgaactcctgaccctcaagtgatctgcccacctt 
ggcctccctaagtgctgggattataggcgtgagccaccatgctcagccattaaggtattt 
tgttaagaactttaagtttagggtaagaagaatgaaaatgatccagaaaaatgcaagcaa 
gtccacatggagatttggaggacactggttaaagaatttatttctttgtatagtatacta 
tgttcatggtgcagatactacaacattgtggcattttagactcgttgagtttcttgggca 
ctcccaagggcgttggggtcataaggagactataactctacagattgtgaatatatttat 
tttcaagttgcattctttgtctttttaagcaatcagatttcaagagagctcaagctttca 
gaagtcaatgtgaaaattccttcctaggctgtcccacagtctttgctgcccttagatgaa 
gccacttgtttcaagatgactactttggggttgggttttcatctaaacacatttttccag 
tcttattagataaattagtccatatggttggttaatcaagagccttctgggtttggtttg 
gtggcattaaatggcaatttgtgactgagacaccagagggcacccttataacattgacag 
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aatcgcttgaacctgcgaggcagagattgcagtgagccaagatcaagccattgtactcca 
gcctggacaacaagagcgaaactctgtctaaaaaaaaaaaaaaaacaacaacacaacaca 
aggttttcacgcctgtaaacctagcacttggggaagccaaggtgtgggaggatggcttga 
ggccaggagttcaaggctgcagtgacgctatgatggcacactggtactctagccggggag 
acagagtgcagacacttgtctctaaaaaaaaaacactaaaaaaatgttttggaccctaaa 
aatctttggtggaattccaaaacactcttccaaaaagacgacgtaaacaacaggggcaac 
tgcgtccttggtgtccacaggctatttaactataacaatacagacgggtttattctacga 
gcacgcggaaccaccctcggggctggcgcaaaacatctcaccctgggtatgctcccaatt 
actcttaaggagaactgtgttagcctgcctgaacacattctgttctattacaaacttcta 
actctcgcagtcagtccatgcaacattagcccgaggaatgccctatactgtgacccaaga 
cctgtgtcattctgaacaaatacgacacactactttgaaaggggatgcttgaggctgagg 
tagttttgagggaaagggggaagttctagtttgatttgtttgggagattgctgtcctttc 
ataactgaagcaaatgcttgaaatgtaaaccatgtgtgatacgaatgatctgagagccag 
tgaccaagtctgcttttccctgagctctcctgctgccgcctctacctttctgacatggcc 
ttccatttggaacttgatatgtgtacaagtgatctgttggccactgtttcattcatcctg 
accactgtatccgtgtaatgtgatctgagctacaacaggtggtatgggatagaggcaagg 
aaagcagagagagccctctgaaatgaacactacccactggtgtgtgtagcacatgatagt 
ctgtaggcatgagtctgaatgttgttctcacactcctttactagagtattcgttccctct 
gaaggtagtttgaacatggaaacacagttctatacctagaaggttggaaagtgtctggga 
ctataagggttaacatgcataattgcaaagattgtccgaaggctgatggaaagcagagac 
tgaatgggattgagaacagatgcgaagcttgatttaaattacattttattggatgctgca 
gccttaagagacgtgactgctttacagtttgtttccacactgtgggcagctgctgtctgt 
tctgtgtccacagtaggatcctgcccaataaggagcagcctccccacctcattgtgtttg 
aggcttggcgccttcctcttaactgtagggcttgagtcaggaacatggcttgactcgcag 
tgggggctgctatgtatcctccctggcttccagccaaaatcacattgggtagattcaaag 
gggccaaatttcttttcccctctatctttcccttttcccctgggttttgggaaatagagt 
ttttctggcctactggatttggttaagtttccctttttccttctcccccctcactgggca 
atttctaaggtcattgctgctcttaagactttagcagttggaacagggttggttctgtca 
atgatgcatgaagcagacttagtgtccctgcttggcttctgctgcccttgtgggagcaaa 
agctgatatatgtttgtcagtaaagtcttaagtgtaattcagactgctgaggaagaaagc 
cctttccttgctggcttttctccctgaagctgagagcttcaggaaggtggtaggagtttt 
agtgggatgggatgggatgggcttgtggttagcagtttttgccaggatcataggctgctt 
cacttagagtagtggcaaagatgctgacctacattcttccttttgaaggtgatggagtgt 
gtgtctatatcctggtttatctctctcccccaccccaaggaagctgattaagcttccata 
gagtgttgggtgttgccatggaaagagcatagacaaaataaattgtgtgtttcaacccca 
gagctgccacagatgtttctggctcaatggagacttgcctgtggctagctaactctgcaa 
gaatgtgtgaggtccaggctgggggtggtggctcacgcctgtaatccagcactttgggag 
gccgaggcgggcggatcacctgaggtcaggagtttgagaccagtctggccaacatggcga 
aaccctgtctctactaaaataaattagccaggcattgtggtgtgttcctgtaattccagc 
tactgggaggctgaggcaggagaatcgcttcaacctgggaggcggaggttgcagtgagcc 
gagatcgtgccattgcactccagcttgggcagcagaatgagactgtctca 

SEQ ID NO: 380. >T83032_T1 # TY Transcript ft LN 5208 # Source Gene: T83032 
# Encoded protein: T83032_P1 

ctcccagcgtgctttgcggcgggccggcccgctttacccagagtcgccctgccgcaatcg 
cgcgtctttccaccgaggccccggatgtagattccctcccccgttcagtggtcgtggcct 
cacagcgactctaagacttggggctctctcattggctgtaactcttccactggattggta 
gcaaaaaaagaggcggtgcccaaggcgaaaggctctgtgactacagccaatcagaatcga 
ggccgggctttggcgggaggtgggaacgctgtggccattcggatttggcgcgagcgcggc 
tggagtttgctgctgccgctgtgcagtttgttcaggggcttgtggtggtgagtccgagag 
gctgcgtgtgagagacgtgagaaggatcctgcactgaggaggtggaaagaagaggattgc 
tcgaggaggcctggggtctgtgaggcagcggagctgggtgaaggctgcgggttccggcga 
ggcctgagctgtgctgtcgtcatgcctcaaacccgatcccaggcacaggctacaatcagt 
tttccaaaaaggaagctgtctcgggcattgaacaaagctaaaaactccagtgatgccaaa 
ctagaaccaacaaatgtccaaaccgtaacctgttctcctcgtgtaaaagccctgcctctc 
agccccaggaaacgtctgggcgatgacaacctatgcaacactccccatttacctccttgt 
tctccaccaaagcaaggcaagaaagagaatggtccccctcactcacatacacttaaggga 
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cgaagattggtatttgacaatcagctgacaattaagtctcctagcaaaagagaactagcc 
aaagttcaccaaaacaaaatactttcttcagttagaaaaagtcaagagatcacaacaaat 
tctgagcagagatgtccactgaagaaagaatctgcatgtgtgagactattcaagcaagaa 
ggcacttgctaccagcaagcaaagctggtcctgaacacagctgtcccagatcggctgcct 
gccagggaaagggagatggatgtcatcaggaatttcttgagggaacacatctgtgggaaa 
aaagctggaagcctttacctttctggtgctcctggaactggaaaaactgcctgcttaagc 
cggattctgcaagacctcaagaaggaactgaaaggctttaaaactatcatgctgaattgc 
atgtccttgaggactgcccaggctgtattcccagctattgctcaggagatttgtcaggaa 
gaggtatccaggccagctgggaaggacatgatgaggaaattggaaaaacatatgactgca 
gagaagggccccatgattgtgttggtattggacgagatggatcaactggacagcaaaggc 
caggatgtattgtacacgctatttgaatggccatggctaagcaattctcacttggtgctg 
attggtattgctaataccctggatctcacagatagaattctacctaggcttcaagctaga 
gaaaaatgtaagccacagctgttgaacttcccaccttataccagaaatcagatagtcact 
attttgcaagatcgacttaatcaggtatctagagatcaggttctggacaatgctgcagtt 
caattctgtgcccgcaaagtctctgctgtttcaggagatgttcgcaaagcactggatgtt 
tgcaggagagctattgaaattgtagagtcagatgtcaaaagccagactattctcaaacca 
ctgtctgaatgtaaatcaccttctgagcctctgattcccaagagggttggtcttattcac 
atatcccaagtcatctcagaagttgatggtaacaggatgaccttgagccaagaaggagca 
caagattccttccctcttcagcagaagatcttggtttgctctttgatgctcttgatcagg 
cagttgaaaatcaaagaggtcactctggggaagttatatgaagcctacagtaaagtctgt 
cgcaaacagcaggtggcggctgtggaccagtcagagtgtttgtcactttcagggctcttg 
gaagccaggggcattttaggattaaagagaaacaaggaaacccgtttgacaaaggtgttt 
ttcaagattgaagagaaagaaatagaacatgctctgaaagataaagctttaattggaaat 
atcttagctactggattgccttaaattcttctcttacaccccacccgaaagtattcagct 
ggcatttagagagctacagtcttcattttagtgctttacacattcgggcctgaaaacaaa 
tatgaccttttttacttgaagccaatgaattttaatctatagattctttaatattagcac 
agaataatatctttgggtcttactatttttacccataaaagtgaccaggtagaccctttt 
taattacattcactacttctaccacttgtgtatctctagccaatgtgcttgcaagtgtac 
agatctgtgtagaggaatgtgtgtatatttacctcttcgtttgctcaaacatgagtgggt 
atttttttgtttgttttttttgttgttgttgtttttgaggcgcgtctcaccctgttgccc 
aggctggagtgcaatggcgcgttctctgctcactacagcacccgcttcccaggttgaagt 
gattctcttgcctcagcctcccgagtagctgggattacaggtgcccaccaccgcgcccag 
ctaattttttaatttttagtagagacagggttttaccatgttggccaggctggtcttgaa 
ctcctgaccctcaagtgatctgcccaccttggcctccctaagtgctgggattataggcgt 
gagccaccatgctcagccattaaggtattttgttaagaactttaagtttagggtaagaag 
aatgaaaatgatccagaaaaatgcaagcaagtccacatggagatttggaggacactggtt 
aaagaatttatttctttgtatagtatactatgttcatggtgcagatactacaacattgtg 
gcattttagactcgttgagtttcttgggcactcccaagggcgttggggtcataaggagac 
tataactctacagattgtgaatatatttattttcaagttgcattctttgtctttttaagc 
aatcagatttcaagagagctcaagctttcagaagtcaatgtgaaaattccttcctaggct 
gtcccacagtctttgctgcccttagatgaagccacttgtttcaagatgactactttgggg 
ttgggttttcatctaaacacatttttccagtcttattagataaattagtccatatggttg 
gttaatcaagagccttctgggtttggtttggtggcattaaatggcaatttgtgactgaga 
caccagagggcacccttataacattgactactttgaaaggggatgcttgaggctgaggta 
gttttgagggaaagggggaagttctagtttgatttgtttgggagattgctgtcctttcat 
aactgaagcaaatgcttgaaatgtaaaccatgtgtgatacgaatgatctgagagccagtg 
accaagtctgcttttccctgagctctcctgctgccgcctctacctttctgacatggcctt 
ccatttggaacttgatatgtgtacaagtgatctgttggccactgtttcattcatcctgac 
cactgtatccgtgtaatgtgatctgagctacaacaggtggtatgggatagaggcaaggaa 
agcagagagagccctctgaaatgaacactacccactggtgtgtgtagcacatgatagtct 
gtaggcatgagtctgaatgttgttctcacactcctttactagagtattcgttccctctga 
aggtagtttgaacatggaaacacagttctatacctagaaggttggaaagtgtctgggact 
ataagggttaacatgcataattgcaaagattgtccgaaggctgatggaaagcagagactg 
aatgggattgagaacagatgcgaagcttgatttaaattacattttattggatgctgcagc 
cttaagagacgtgactgctttacagtttgtttccacactgtgggcagctgctgtctgttc 
tgtgtccacagtaggatcctgcccaataaggagcagcctccccacctcattgtgtttgag 
gcttggcgccttcctcttaactgtagggcttgagtcaggaacatggcttgactcgcagtg 
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Figur 1 (Cont'd) 



ggggctgctatgtatcctccctggcttccagccaaaatcacattgggtagattcaaaggg 
gccaaatttcttttcccctctatctttcccttttcccctgggttttgggaaatagagttt 
ttctggcctactggatttggttaagtttccctttttccttctcccccctcactgggcaat 
ttctaaggtcattgctgctcttaagactttagcagttggaacagggttggttctgtcaat 
gatgcatgaagcagacttagtgtccctgcttggcttctgctgcccttgtgggagcaaaag 
ctgatatatgtttgtcagtaaagtcttaagtgtaattcagactgctgaggaagaaagccc 
tttccttgctggcttttctccctgaagctgagagcttcaggaaggtggtaggagttttag 
tgggatgggatgggatgggcttgtggttagcagtttttgccaggatcataggctgcttca 
cttagagtagtggcaaagatgctgacctacattcttccttttgaaggtgatggagtgtgt 
gtctatatcctggtttatctctctcccccaccccaaggaagctgattaagcttccataga 
gtgttgggtgttgccatggaaagagcatagacaaaataaattgtgtgtttcaaccccaga 
gctgccacagatgtttctggctcaatggagacttgcctgtggctagctaactctgcaaga 
atgtgtgaggtccaggctgggggtggtggctcacgcctgtaatccagcactttgggaggc 
cgaggcgggcggatcacctgaggtcaggagtttgagaccagtctggccaacatggcgaaa 
ccctgtctctactaaaataaattagccaggcattgtggtgtgttcctgtaattccagcta 
ctgggaggctgaggcaggagaatcgcttcaacctgggaggcggaggttgcagtgagccga 
gatcgtgccattgcactccagcttgggcagcagaatgagactgtctca 

, SEQ ID NO: 381. >T83032_T2 # TY Transcript * LN 3100 # Source Gene: T83032 
# Encoded protein: T83Q32_P1 

ctcccagcgtgctttgcggcgggccggcccgctttacccagagtcgccctgccgcaatcg 
cgcgtctttccaccgaggccccggatgtagattccctcccccgttcagtggtcgtggcct 
cacagcgactctaagacttggggctctctcattggctgtaactcttccactggattggta 
gcaaaaaaagaggcggtgcccaaggcgaaaggctctgtgactacagccaatcagaatcga 
ggccgggctttggcgggaggtgggaacgctgtggccattcggatttggcgcgagcgcggc 
tggagtttgctgctgccgctgtgcagtttgttcaggggcttgtggtggtgagtccgagag 
gctgcgtgtgagagacgtgagaaggatcctgcactgaggaggtggaaagaagaggattgc 
tcgaggaggcctggggtctgtgaggcagcggagctgggtgaaggctgcgggttccggcga 
ggcctgagctgtgctgtcgtcatgcctcaaacccgatcccaggcacaggctacaatcagt 
tttccaaaaaggaagctgtctcgggcattgaacaaagctaaaaactccagtgatgccaaa 
ctagaaccaacaaatgtccaaaccgtaacctgttctcctcgtgtaaaagccctgcctctc 
agccccaggaaacgtctgggcgatgacaacctatgcaacactccccatttacctccttgt 
tctccaccaaagcaaggcaagaaagagaatggtccccctcactcacatacacttaaggga 
cgaagattggtatttgacaatcagctgacaattaagtctcctagcaaaagagaactagcc 
aaagttcaccaaaacaaaatactttcttcagttagaaaaagtcaagagatcacaacaaat 
tctgagcagagatgtccactgaagaaagaatctgcatgtgtgagactattcaagcaagaa 
ggcacttgctaccagcaagcaaagctggtcctgaacacagctgtcccagatcggctgcct 
gccagggaaagggagatggatgtcatcaggaatttcttgagggaacacatctgtgggaaa 
aaagctggaagcctttacctttctggtgctcctggaactggaaaaactgcctgcttaagc 
cggattctgcaagacctcaagaaggaactgaaaggctttaaaactatcatgctgaattgc 
atgtccttgaggactgcccaggctgtattcccagctattgctcaggagatttgtcaggaa 
gaggtatccaggccagctgggaaggacatgatgaggaaattggaaaaacatatgactgca 
gagaagggccccatgattgtgttggtattggacgagatggatcaactggacagcaaaggc 
caggatgtattgtacacgctatttgaatggccatggctaagcaattctcacttggtgctg 
attggtattgctaataccctggatctcacagatagaattctacctaggcttcaagctaga 
gaaaaatgtaagccacagctgttgaacttcccaccttataccagaaatcagatagtcact 
attttgcaagatcgacttaatcaggtatctagagatcaggttctggacaatgctgcagtt 
caattctgtgcccgcaaagtctctgctgtttcaggagatgttcgcaaagcactggatgtt 
tgcaggagagctattgaaattgtagagtcagatgtcaaaagccagactattctcaaacca 
ctgtctgaatgtaaatcaccttctgagcctctgattcccaagagggttggtcttattcac 
atatcccaagtcatctcagaagttgatggtaacaggatgaccttgagccaagaaggagca 
caagattccttccctcttcagcagaagatcttggtttgctctttgatgctcttgatcagg 
cagttgaaaatcaaagaggtcactctggggaagttatatgaagcctacagtaaagtctgt 
cgcaaacagcaggtggcggctgtggaccagtcagagtgtttgtcactttcagggctcttg 
gaagccaggggcattttaggattaaagagaaacaaggaaacccgtttgacaaaggtgttt 
ttcaagattgaagagaaagaaatagaacatgctctgaaagataaagctttaattggaaat 
atcttagctactggattgccttaaattcttctcttacaccccacccgaaagtattcagct 
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ggcatttagagagctacagtcttcattttagtgctttacacattcgggcctgaaaacaaa 
tatgaccttttttacttgaagccaatgaattttaatctatagattctttaatattagcac 
agaataatatctttgggtcttactatttttacccataaaagtgaccaggtagaccctttt 
taattacattcactacttctaccacttgtgtatctctagccaatgtgcttgcaagtgtac 
agatctgtgtagaggaatgtgtgtatatttacctcttcgtttgctcaaacatgagtgggt 
atttttttgtttgttttttttgttgttgttgtttttgaggcgcgtctcaccctgttgccc 
aggctggagtgcaatggcgcgttctctgctcactacagcacccgcttcccaggttgaagt 
gattctcttgcctcagcctcccgagtagctgggattacaggtgcccaccaccgcgcccag 
ctaattttttaatttttagtagagacagggttttaccatgttggccaggctggtcttgaa 
ctcctgaccctcaagtgatctgcccaccttggcctccctaagtgctgggattataggcgt 
gagccaccatgctcagccattaaggtattttgttaagaactttaagtttagggtaagaag 
aatgaaaatgatccagaaaaatgcaagcaagtccacatggagatttggaggacactggtt 
aaagaatttatttctttgtatagtatactatgttcatggtgcagatactacaacattgtg 
gcattttagactcgttgagtttcttgggcactcccaagggcgttggggtcataaggagac 
tataactctacagattgtgaatatatttattttcaagttg 

SEQ ID N0:382. >T83032_T3 # TY Transcript # LN 3265 # Source Gene: T83032 
# Encoded protein: T83032_P1 

ctcccagcgtgctttgcggcgggccggcccgctttacccagagtcgccctgccgcaatcg 
cgcgtctttccaccgaggccccggatgtagattccctcccccgttcagtggtcgtggcct 
cacagcgactctaagacttggggctctctcattggctgtaactcttccactggattggta 
gcaaaaaaagaggcggtgcccaaggcgaaaggctctgtgactacagccaatcagaatcga 
ggccgggctttggcgggaggtgggaacgctgtggccattcggatttggcgcgagcgcggc 
tggagtttgctgctgccgctgtgcagtttgttcaggggcttgtggtggtgagtccgagag 
gctgcgtgtgagagacgtgagaaggatcctgcactgaggaggtggaaagaagaggattgc 
tcgaggaggcctggggtctgtgaggcagcggagctgggtgaaggctgcgggttccggcga 
ggcctgagctgtgctgtcgtcatgcctcaaacccgatcccaggcacaggctacaatcagt 
tttccaaaaaggaagctgtctcgggcattgaacaaagctaaaaactccagtgatgccaaa 
ctagaaccaacaaatgtccaaaccgtaacctgttctcctcgtgtaaaagccctgcctctc 
agccccaggaaacgtctgggcgatgacaacctatgcaacactccccatttacctccttgt 
tctccaccaaagcaaggcaagaaagagaatggtccccctcactcacatacacttaaggga 
cgaagattggtatttgacaatcagctgacaattaagtctcctagcaaaagagaactagcc 
aaagttcaccaaaacaaaatactttcttcagttagaaaaagtcaagagatcacaacaaat 
tctgagcagagatgtccactgaagaaagaatctgcatgtgtgagactattcaagcaagaa 
ggcacttgctaccagcaagcaaagctggtcctgaacacagctgtcccagatcggctgcct 
gccagggaaagggagatggatgtcatcaggaatttcttgagggaacacatctgtgggaaa 
aaagctggaagcctttacctttctggtgctcctggaactggaaaaactgcctgcttaagc 
cggattctgcaagacctcaagaaggaactgaaaggctttaaaactatcatgctgaattgc 
atgtccttgaggactgcccaggctgtattcccagctattgctcaggagatttgtcaggaa 
gaggtatccaggccagctgggaaggacatgatgaggaaattggaaaaacatatgactgca 
gagaagggccccatgattgtgttggtattggacgagatggatcaactggacagcaaaggc 
caggatgtattgtacacgctatttgaatggccatggctaagcaattctcacttggtgctg 
attggtattgctaataccctggatctcacagatagaattctacctaggcttcaagctaga 
gaaaaatgtaagccacagctgttgaacttcccaccttataccagaaatcagatagtcact 
attttgcaagatcgacttaatcaggtatctagagatcaggttctggacaatgctgcagtt 
caattctgtgcccgcaaagtctctgctgtttcaggagatgttcgcaaagcactggatgtt 
tgcaggagagctattgaaattgtagagtcagatgtcaaaagccagactattctcaaacca 
ctgtctgaatgtaaatcaccttctgagcctctgattcccaagagggttggtcttattcac 
atatcccaagtcatctcagaagttgatggtaacaggatgaccttgagccaagaaggagca 
caagattccttccctcttcagcagaagatcttggtttgctctttgatgctcttgatcagg 
cagttgaaaatcaaagaggtcactctggggaagttatatgaagcctacagtaaagtctgt 
cgcaaacagcaggtggcggctgtggaccagtcagagtgtttgtcactttcagggctcttg 
gaagccaggggcattttaggattaaagagaaacaaggaaacccgtttgacaaaggtgttt 
ttcaagattgaagagaaagaaatagaacatgctctgaaagataaagctttaattggaaat 
atcttagctactggattgccttaaattcttctcttacaccccacccgaaagtattcagct 
ggcatttagagagctacagtcttcattttagtgctttacacattcgggcctgaaaacaaa 
tatgaccttttttacttgaagccaatgaattttaatctatagattctttaatattagcac 
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agaataatatctttgggtcttactatttttacccataaaagtgaccaggtagaccctttt 
taattacattcactacttctaccacttgtgtatctctagccaatgtgcttgcaagtgtac 
agatctgtgtagaggaatgtgtgtatatttacctcttcgtttgctcaaacatgagtgggt 
atttttttgtttgttttttttgttgttgttgtttttgaggcgcgtctcaccctgttgccc 
aggctggagtgcaatggcgcgttctctgctcactacagcacccgcttcccaggagaatcg 
cttgaacctgcgaggcagagattgcagtgagccaagatcaagccattgtactccagcctg 
gacaacaagagcgaaactctgtctaaaaaaaaaaaaaaaacaacaacacaacacaaggtt 
ttcacgcctgtaaacctagcacttggggaagccaaggtgtgggaggatggcttgaggcca 
ggagttcaaggctgcagtgacgctatgatggcacactggtactctagccggggagacaga 
gtgcagacacttgtctctaaaaaaaaaacactaaaaaaatgttttggaccctaaaaatct 
ttggtggaattccaaaacactcttccaaaaagacgacgtaaacaacaggggcaactgcgt 
ccttggtgtccacaggctatttaactataacaatacagacgggtttattctacgagcacg 
cggaaccaccctcggggctggcgcaaaacatctcaccctgggtatgctcccaattactct 
taaggagaactgtgttagcctgcctgaacacattctgttctattacaaacttctaactct 
cgcagtcagtccatgcaacattagcccgaggaatgccctatactgtgacccaagacctgt 
gtcattctgaacaaatacgacacac 

SEQ ID NO: 383. >T83032_T4 # TY Transcript # LN 2767 # Source Gene: T83032 
# Encoded protein: T83032_P1 

ctcccagcgtgctttgcggcgggccggcccgctttacccagagtcgccctgccgcaatcg 
cgcgtctttccaccgaggccccggatgtagattccctcccccgttcagtggtcgtggcct 
cacagcgactctaagacttggggctctctcattggctgtaactcttccactggattggta 
gcaaaaaaagaggcggtgcccaaggcgaaaggctctgtgactacagccaatcagaatcga 
ggccgggctttggcgggaggtgggaacgctgtggccattcggatttggcgcgagcgcggc 
tggagtttgctgctgccgctgtgcagtttgttcaggggcttgtggtggtgagtccgagag 
gctgcgtgtgagagacgtgagaaggatcctgcactgaggaggtggaaagaagaggattgc 
tcgaggaggcctggggtctgtgaggcagcggagctgggtgaaggctgcgggttccggcga 
ggcctgagctgtgctgtcgtcatgcctcaaacccgatcccaggcacaggctacaatcagt 
tttccaaaaaggaagctgtctcgggcattgaacaaagctaaaaactccagtgatgccaaa 
ctagaaccaacaaatgtccaaaccgtaacctgttctcctcgtgtaaaagccctgcctctc 
agccccaggaaacgtctgggcgatgacaacctatgcaacactccccatttacctccttgt 
tctccaccaaagcaaggcaagaaagagaatggtccccctcactcacatacacttaaggga 
cgaagattggtatttgacaatcagctgacaattaagtctcctagcaaaagagaactagcc 
aaagttcaccaaaacaaaatactttcttcagttagaaaaagtcaagagatcacaacaaat 
tctgagcagagatgtccactgaagaaagaatctgcatgtgtgagactattcaagcaagaa 
ggcacttgctaccagcaagcaaagctggtcctgaacacagctgtcccagatcggctgcct 
gccagggaaagggagatggatgtcatcaggaatttcttgagggaacacatctgtgggaaa 
aaagctggaagcctttacctttctggtgctcctggaactggaaaaactgcctgcttaagc 
cggattctgcaagacctcaagaaggaactgaaaggctttaaaactatcatgctgaattgc 
atgtccttgaggactgcccaggctgtattcccagctattgctcaggagatttgtcaggaa 
gaggtatccaggccagctgggaaggacatgatgaggaaattggaaaaacatatgactgca 
gagaagggccccatgattgtgttggtattggacgagatggatcaactggacagcaaaggc 
caggatgtattgtacacgctatttgaatggccatggctaagcaattctcacttggtgctg 
attggtattgctaataccctggatctcacagatagaattctacctaggcttcaagctaga 
gaaaaatgtaagccacagctgttgaacttcccaccttataccagaaatcagatagtcact 
attttgcaagatcgacttaatcaggtatctagagatcaggttctggacaatgctgcagtt 
caattctgtgcccgcaaagtctctgctgtttcaggagatgttcgcaaagcactggatgtt 
tgcaggagagctattgaaattgtagagtcagatgtcaaaagccagactattctcaaacca 
ctgtctgaatgtaaatcaccttctgagcctctgattcccaagagggttggtcttattcac 
atatcccaagtcatctcagaagttgatggtaacaggatgaccttgagccaagaaggagca 
caagattccttccctcttcagcagaagatcttggtttgctctttgatgctcttgatcagg 
cagttgaaaatcaaagaggtcactctggggaagttatatgaagcctacagtaaagtctgt 
cgcaaacagcaggtggcggctgtggaccagtcagagtgtttgtcactttcagggctcttg 
gaagccaggggcattttaggattaaagagaaacaaggaaacccgtttgacaaaggtgttt 
ttcaagattgaagagaaagaaatagaacatgctctgaaagataaagctttaattggaaat 
atcttagctactggattgccttaaattcttctcttacaccccacccgaaagtattcagct 
ggcatttagagagctacagtcttcattttagtgctttacacattcgggcctgaaaacaaa 
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tatgaccttttttacttgaagccaatgaattttaatctatagattctttaatattagcac 
agaataatatctttgggtcttactatttttacccataaaagtgaccaggtagaccctttt 
taattacattcactacttctaccacttgtgtatctctagccaatgtgcttgcaagtgtac 
agatctgtgtagaggaatgtgtgtatatttacctcttcgtttgctcaaacatgagtgggt 
atttttttgtttgttttttttgttgttgttgtttttgaggcgcgtctcaccctgttgccc 
aggctggagtgcaatggcgcgttctctgctcactacagcacccgcttcccaggttgaagt 
gattctcttgcctcagctactgggaggctgaggcaggagaatcgcttcaacctgggaggc 
ggaggttgcagtgagccgagatcgtgccattgcactccagcttgggcagcagaatgagac 
tgtctca 

SEQ ID NO: 384. >T83032_T5 # TY Transcript # LN 2166 # Source Gene: T83032 

# Encoded protein: T83032__P2 

ctcccagcgtgctttgcggcgggccggcccgctttacccagagtcgccctgccgcaatcg 
cgcgtctttccaccgaggccccggatgtagattccctcccccgttcagtggtcgtggcct 
cacagcgactctaagacttggggctctctcattggctgtaactcttccactggattggta 
gcaaaaaaagaggcggtgcccaaggcgaaaggctctgtgactacagccaatcagaatcga 
ggccgggctttggcgggaggtgggaacgctgtggccattcggatttggcgcgagcgcggc 
tggagtttgctgctgccgctgtgcagtttgttcaggggcttgtggtggtgagtccgagag 
gctgcgtgtgagagacgtgagaaggatcctgcactgaggaggtggaaagaagaggattgc 
tcgaggaggcctggggtctgtgaggcagcggagctgggtgaaggctgcgggttccggcga 
ggcctgagctgtgctgtcgtcatgcctcaaacccgatcccaggcacaggctacaatcagt 
tttccaaaaaggaagctgtctcgggcattgaacaaagctaaaaactccagtgatgccaaa 
ctagaaccaacaaatgtccaaaccgtaacctgttctcctcgtgtaaaagccctgcctctc 
agccccaggaaacgtctgggcgatgacaacctatgcaacactccccatttacctccttgt 
tctccaccaaagcaaggcaagaaagagaatggtccccctcactcacatacacttaaggga 
cgaagattggtatttgacaatcagctgacaattaagtctcctagcaaaagagaactagcc 
aaagttcaccaaaacaaaatactttcttcagttagaaaaagtcaagagatcacaacaaat 
tctgagcagagatgtccactgaagaaagaatctgcatgtgtgagactattcaagcaagaa 
ggcacttgctaccagcaagcaaagctggtcctgaacacagctgtcccagatcggctgcct 
gccagggaaagggagatggatgtcatcaggaatttcttgagggaacacatctgtgggaaa 
aaagctggaagcctttacctttctggtgctcctggaactggaaaaactgcctgcttaagc 
cggattctgcaagacctcaagaaggaactgaaaggctttaaaactatcatgctgaattgc 
atgtccttgaggactgcccaggctgtattcccagctattgctcaggagatttgtcaggaa 
gaggtatccaggccagctgggaaggacatgatgaggaaattggaaaaacatatgactgca 
gagaagggccccatgattgtgttggtattggacgagatggatcaactggacagcaaaggc 
caggatgtattgtacacgctatttgaatggccatggctaagcaattctcacttggtgctg 
attggtattgctaataccctggatctcacagatagaattctacctaggcttcaagctaga 
gaaaaatgtaagccacagctgttgaacttcccaccttataccagaaatcagatagtcact 
attttgcaagatcgacttaatcaggtatctagagatcaggttctggacaatgctgcagtt 
caattctgtgcccgcaaagtctctgctgtttcaggagatgttcgcaaagcactggatgtt 
tgcaggagagctattgaaattgtagagtcagatgtcaaaagccagactattctcaaacca 
ctgtctgaatgtaaatcaccttctgagcctctgattcccaagagggttggtcttattcac 
atatcccaagtcatctcagaagttgatggtaacaggatgaccttgagccaagaaggagca 
caagattccttccctcttcagcagaagatcttggtttgctctttgatgctcttgatcagg 
cagttgaaaatcaaagaggtcactctggggaagttatatgaagcctacagtaaagtctgt 
cgcaaacagcaggtggcggctgtggaccagtcagagtgtttgtcactttcagggctcttg 
gaagccaggggcattttaggattaaagagaaacaaggaaacccgtttgacaaaggtgttt 
ttcaagattgaagagaaagaaatagaacatgctctgaaagataaagctttaattggaaat 
atctta 

SEQ ID NO: 385. >T83032JT6 # TY Transcript # LN 5067 # Source Gene: T83032 

# Encoded protein: T83032_P1 

ctcccagcgtgctttgcggcgggccggcccgctttacccagagtcgccctgccgcaatcg 
cgcgtctttccaccgaggccccggatgtagattccctcccccgttcagtggtcgtggcct 
cacagcgactctaagacttggggctctctcattggctgtaactcttccactggattggta 
gcaaaaaaagaggcggtgcccaaggcgaaaggctctgtgactacagccaatcagaatcga 
ggccgggctttggcgggaggtgggaacgctgtggccattcggatttggcgcgagcgcggc 
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tggagtttgctgctgccgctgtgcagtttgttcaggggcttgtggtgctgtgctgtcgtc 
atgcctcaaacccgatcccaggcacaggctacaatcagttttccaaaaaggaagctgtct 
cgggcattgaacaaagctaaaaactccagtgatgccaaactagaaccaacaaatgtccaa 
accgtaacctgttctcctcgtgtaaaagccctgcctctcagccccaggaaacgtctgggc 
gatgacaacctatgcaacactccccatttacctccttgttctccaccaaagcaaggcaag 
aaagagaatggtccccctcactcacatacacttaagggacgaagattggtatttgacaat 
cagctgacaattaagtctcctagcaaaagagaactagccaaagttcaccaaaacaaaata 
ctttcttcagttagaaaaagtcaagagatcacaacaaattctgagcagagatgtccactg 
aagaaagaatctgcatgtgtgagactattcaagcaagaaggcacttgctaccagcaagca 
aagctggtcctgaacacagctgtcccagatcggctgcctgccagggaaagggagatggat 
gtcatcaggaatttcttgagggaacacatctgtgggaaaaaagctggaagcctttacctt 
tctggtgctcctggaactggaaaaactgcctgcttaagccggattctgcaagacctcaag 
aaggaactgaaaggctttaaaactatcatgctgaattgcatgtccttgaggactgcccag 
gctgtattcccagctattgctcaggagatttgtcaggaagaggtatccaggccagctggg 
aaggacatgatgaggaaattggaaaaacatatgactgcagagaagggccccatgattgtg 
ttggtattggacgagatggatcaactggacagcaaaggccaggatgtattgtacacgcta 
tttgaatggccatggctaagcaattctcacttggtgctgattggtattgctaataccctg 
gatctcacagatagaattctacctaggcttcaagctagagaaaaatgtaagccacagctg 
ttgaacttcccaccttataccagaaatcagatagtcactattttgcaagatcgacttaat 
caggtatctagagatcaggttctggacaatgctgcagttcaattctgtgcccgcaaagtc 
tctgctgtttcaggagatgttcgcaaagcactggatgtttgcaggagagctattgaaatt 
gtagagtcagatgtcaaaagccagactattctcaaaccactgtctgaatgtaaatcacct 
tctgagcctctgattcccaagagggttggtcttattcacatatcccaagtcatctcagaa 
gttgatggtaacaggatgaccttgagccaagaaggagcacaagattccttccctcttcag 
cagaagatcttggtttgctctttgatgctcttgatcaggcagttgaaaatcaaagaggtc 
actctggggaagttatatgaagcctacagtaaagtctgtcgcaaacagcaggtggcggct 
gtggaccagtcagagtgtttgtcactttcagggctcttggaagccaggggcattttagga 
ttaaagagaaacaaggaaacccgtttgacaaaggtgtttttcaagattgaagagaaagaa 
atagaacatgctctgaaagataaagctttaattggaaatatcttagctactggattgcct 
taaattcttctcttacaccccacccgaaagtattcagctggcatttagagagctacagtc 
ttcattttagtgctttacacattcgggcctgaaaacaaatatgaccttttttacttgaag 
ccaatgaattttaatctatagattctttaatattagcacagaataatatctttgggtctt 
actatttttacccataaaagtgaccaggtagaccctttttaattacattcactacttcta 
ccacttgtgtatctctagccaatgtgcttgcaagtgtacagatctgtgtagaggaatgtg 
tgtatatttacctcttcgtttgctcaaacatgagtgggtatttttttgtttgtttttttt 
gttgttgttgtttttgaggcgcgtctcaccctgttgcccaggctggagtgcaatggcgcg 
ttctctgctcactacagcacccgcttcccaggttgaagtgattctcttgcctcagcctcc 
cgagtagctgggattacaggtgcccaccaccgcgcccagctaattttttaatttttagta 
gagacagggttttaccatgttggccaggctggtcttgaactcctgaccctcaagtgatct 
gcccaccttggcctccctaagtgctgggattataggcgtgagccaccatgctcagccatt 
aaggtattttgttaagaactttaagtttagggtaagaagaatgaaaatgatccagaaaaa 
tgcaagcaagtccacatggagatttggaggacactggttaaagaatttatttctttgtat 
agtatactatgttcatggtgcagatactacaacattgtggcattttagactcgttgagtt 
tcttgggcactcccaagggcgttggggtcataaggagactataactctacagattgtgaa 
tatatttattttcaagttgcattctttgtctttttaagcaatcagatttcaagagagctc 
aagctttcagaagtcaatgtgaaaattccttcctaggctgtcccacagtctttgctgccc 
ttagatgaagccacttgtttcaagatgactactttggggttgggttttcatctaaacaca 
tttttccagtcttattagataaattagtccatatggttggttaatcaagagccttctggg 
tttggtttggtggcattaaatggcaatttgtgactgagacaccagagggcacccttataa 
cattgactactttgaaaggggatgcttgaggctgaggtagttttgagggaaagggggaag 
ttctagtttgatttgtttgggagattgctgtcctttcataactgaagcaaatgcttgaaa 
tgtaaaccatgtgtgatacgaatgatctgagagccagtgaccaagtctgcttttccctga 
gctctcctgctgccgcctctacctttctgacatggccttccatttggaacttgatatgtg 
tacaagtgatctgttggccactgtttcattcatcctgaccactgtatccgtgtaatgtga 
tctgagctacaacaggtggtatgggatagaggcaaggaaagcagagagagccctctgaaa 
tgaacactacccactggtgtgtgtagcacatgatagtctgtaggcatgagtctgaatgtt 
gttctcacactcctttactagagtattcgttccctctgaaggtagtttgaacatggaaac 
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acagttctatacctagaaggttggaaagtgtctgggactataagggttaacatgcataat 
tgcaaagattgtccgaaggctgatggaaagcagagactgaatgggattgagaacagatgc 
gaagcttgatttaaattacattttattggatgctgcagccttaagagacgtgactgcttt 
acagtttgtttccacactgtgggcagctgctgtctgttctgtgtccacagtaggatcctg 
cccaataaggagcagcctccccacctcattgtgtttgaggcttggcgccttcctcttaac 
tgtagggcttgagtcaggaacatggcttgactcgcagtgggggctgctatgtatcctccc 
tggcttccagccaaaatcacattgggtagattcaaaggggccaaatttcttttcccctct 
atctttcccttttcccctgggttttgggaaatagagtttttctggcctactggatttggt 
taagtttccctttttccttctcccccctcactgggcaatttctaaggtcattgctgctct 
taagactttagcagttggaacagggttggttctgtcaatgatgcatgaagcagacttagt 
gtccctgcttggcttctgctgcccttgtgggagcaaaagctgatatatgtttgtcagtaa 
agtcttaagtgtaattcagactgctgaggaagaaagccctttccttgctggcttttctcc 
ctgaagctgagagcttcaggaaggtggtaggagttttagtgggatgggatgggatgggct 
tgtggttagcagtttttgccaggatcataggctgcttcacttagagtagtggcaaagatg 
ctgacctacattcttccttttgaaggtgatggagtgtgtgtctatatcctggtttatctc 
tctcccccaccccaaggaagctgattaagcttccatagagtgttgggtgttgccatggaa 
agagcatagacaaaataaattgtgtgtttcaaccccagagctgccacagatgtttctggc 
tcaatggagacttgcctgtggctagctaactctgcaagaatgtgtgaggtccaggctggg 
ggtggtggctcacgcctgtaatccagcactttgggaggccgaggcgggcggatcacctga 
ggtcaggagtttgagaccagtctggccaacatggcgaaaccctgtctctactaaaataaa 
ttagccaggcattgtggtgtgttcctgtaattccagctactgggaggctgaggcaggaga 
atcgcttcaacctgggaggcggaggttgcagtgagccgagatcgtgccattgcactccag 
cttgggcagcagaatgagactgtctca 

SEQ ID NO: 386. >T83032_T7 # TY Transcript # LN 2516 # Source Gene: T83032 
8 Encoded protein: 

ggtgatccaactgcctcggcctccctaagtgctgggattataggcgtgagggtcaggagt 
tcaagaccagcctggccctcaagtgatctgcccaccttggcctccctaagtgctgggatt 
ataggcgtgagccaccatgctcagccattaaggtattttgttaagaactttaagtttagg 
gtaagaagaatgaaaatgatccagaaaaatgcaagcaagtccacatggagatttggagga 
cactggttaaagaatttatttctttgtatagtatactatgttcatggtgcagatactaca 
acattgtggcattttagactcgttgagtttcttgggcactcccaagggcgttggggtcat 
aaggagactataactctacagattgtgaatatatttattttcaagttgcattctttgtct 
ttttaagcaatcagatttcaagagagctcaagctttcagaagtcaatgtgaaaattcctt 
cctaggctgtcccacagtctttgctgcccttagatgaagccacttgtttcaagatgacta 
ctttggggttgggttttcatctaaacacatttttccagtcttattagataaattagtcca 
tatggttggttaatcaagagccttctgggtttggtttggtggcattaaatggcaatttgt 
gactgagacaccagagggcacccttataacattgactactttgaaaggggatgcttgagg 
ctgaggtagttttgagggaaagggggaagttctagtttgatttgtttgggagattgctgt 
cctttcataactgaagcaaatgcttgaaatgtaaaccatgtgtgatacgaatgatctgag 
agccagtgaccaagtctgcttttccctgagctctcctgctgccgcctctacctttctgac 
atggccttccatttggaacttgatatgtgtacaagtgatctgttggccactgtttcattc 
atcctgaccactgtatccgtgtaatgtgatctgagctacaacaggtggtatgggatagag 
gcaaggaaagcagagagagccctctgaaatgaacactacccactggtgtgtgtagcacat 
gatagtctgtaggcatgagtctgaatgttgttctcacactcctttactagagtattcgtt 
ccctctgaaggtagtttgaacatggaaacacagttctatacctagaaggttggaaagtgt 
ctgggactataagggttaacatgcataattgcaaagattgtccgaaggctgatggaaagc 
agagactgaatgggattgagaacagatgcgaagcttgatttaaattacattttattggat 
gctgcagccttaagagacgtgactgctttacagtttgtttccacactgtgggcagctgct 
gtctgttctgtgtccacagtaggatcctgcccaataaggagcagcctccccacctcattg 
tgtttgaggcttggcgccttcctcttaactgtagggcttgagtcaggaacatggcttgac 
tcgcagtgggggctgctatgtatcctccctggcttccagccaaaatcacattgggtagat 
tcaaaggggccaaatttcttttcccctctatctttcccttttcccctgggttttgggaaa 
tagagtttttctggcctactggatttggttaagtttccctttttccttctcccccctcac 
tgggcaatttctaaggtcattgctgctcttaagactttagcagttggaacagggttggtt 
ctgtcaatgatgcatgaagcagacttagtgtccctgcttggcttctgctgcccttgtggg 
agcaaaagctgatatatgtttgtcagtaaagtcttaagtgtaattcagactgctgaggaa 
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gaaagccctttccttgctggcttttctccctgaagctgagagcttcaggaaggtggtagg 
agttttagtgggatgggatgggatgggcttgtggttagcagtttttgccaggatcatagg 
ctgcttcacttagagtagtggcaaagatgctgacctacattcttccttttgaaggtgatg 
gagtgtgtgtctatatcctggtttatctctctcccccaccccaaggaagctgattaagct 
tccatagagtgttgggtgttgccatggaaagagcatagacaaaataaattgtgtgtttca 
accccagagctgccacagatgtttctggctcaatggagacttgcctgtggctagctaact 
ctgcaagaatgtgtgaggtccaggctgggggtggtggctcacgcctgtaatccagcactt 
tgggaggccgaggcgggcggatcacctgaggtcaggagtttgagaccagtctggccaaca 
tggcgaaaccctgtctctactaaaataaattagccaggcattgtggtgtgttcctgtaat 
tccagctactgggaggctgaggcaggagaatcgcttcaacctgggaggcggaggttgcag 
tgagccgagatcgtgccattgcactccagcttgggcagcagaatgagactgtctca 

SEQ ID NO: 387. >T83032_T8 # TY Transcript # LN 2655 # Source Gene: T83032 
# Encoded protein: 

ttttagatgagttttgctcttatcgcccaggctggagtgcaatggcgtgatctcggctct 
ctacagcctctgcctcacgggttcaagcgattctcctgcctcagcctccccagtagctag 
gactacaggcacctgccaccacgcccagctaattgttgtatttttatagagacagggttt 
taccatgttggccaggctggtcttgaactcctgaccctcaagtgatctgcccaccttggc 
ctccctaagtgctgggattataggcgtgagccaccatgctcagccattaaggtattttgt 
taagaactttaagtttagggtaagaagaatgaaaatgatccagaaaaatgcaagcaagtc 
cacatggagatttggaggacactggttaaagaatttatttctttgtatagtatactatgt 
tcatggtgcagatactacaacattgtggcattttagactcgttgagtttcttgggcactc 
ccaagggcgttggggtcataaggagactataactctacagattgtgaatatatttatttt 
caagttgcattctttgtctttttaagcaatcagatttcaagagagctcaagctttcagaa 
gtcaatgtgaaaattccttcctaggctgtcccacagtctttgctgcccttagatgaagcc 
acttgtttcaagatgactactttggggttgggttttcatctaaacacatttttccagtct 
tattagataaattagtccatatggttggttaatcaagagccttctgggtttggtttggtg 
gcattaaatggcaatttgtgactgagacaccagagggcacccttataacattgactactt 
tgaaaggggatgcttgaggctgaggtagttttgagggaaagggggaagttctagtttgat 
ttgtttgggagattgctgtcctttcataactgaagcaaatgcttgaaatgtaaaccatgt 
gtgatacgaatgatctgagagccagtgaccaagtctgcttttccctgagctctcctgctg 
ccgcctctacctttctgacatggccttccatttggaacttgatatgtgtacaagtgatct 
gttggccactgtttcattcatcctgaccactgtatccgtgtaatgtgatctgagctacaa 
caggtggtatgggatagaggcaaggaaagcagagagagccctctgaaatgaacactaccc 
actggtgtgtgtagcacatgatagtctgtaggcatgagtctgaatgttgttctcacactc 
ctttactagagtattcgttccctctgaaggtagtttgaacatggaaacacagttctatac 
ctagaaggttggaaagtgtctgggactataagggttaacatgcataattgcaaagattgt 
ccgaaggctgatggaaagcagagactgaatgggattgagaacagatgcgaagcttgattt 
aaattacattttattggatgctgcagccttaagagacgtgactgctttacagtttgtttc 
cacactgtgggcagctgctgtctgttctgtgtccacagtaggatcctgcccaataaggag 
cagcctccccacctcattgtgtttgaggcttggcgccttcctcttaactgtagggcttga 
gtcaggaacatggcttgactcgcagtgggggctgctatgtatcctccctggcttccagcc 
aaaatcacattgggtagattcaaaggggccaaatttcttttcccctctatctttcccttt 
tcccctgggttttgggaaatagagtttttctggcctactggatttggttaagtttccctt 
tttccttctcccccctcactgggcaatttctaaggtcattgctgctcttaagactttagc 
agttggaacagggttggttctgtcaatgatgcatgaagcagacttagtgtccctgcttgg 
cttctgctgcccttgtgggagcaaaagctgatatatgtttgtcagtaaagtcttaagtgt 
aattcagactgctgaggaagaaagccctttccttgctggcttttctccctgaagctgaga 
gcttcaggaaggtggtaggagttttagtgggatgggatgggatgggcttgtggttagcag 
tttttgccaggatcataggctgcttcacttagagtagtggcaaagatgctgacctacatt 
cttccttttgaaggtgatggagtgtgtgtctatatcctggtttatctctctcccccaccc 
caaggaagctgattaagcttccatagagtgttgggtgttgccatggaaagagcatagaca 
aaataaattgtgtgtttcaaccccagagctgccacagatgtttctgg.ctcaatggagact 
tgcctgtggctagctaactctgcaagaatgtgtgaggtccaggctgggggtggtggctca 
cgcctgtaatccagcactttgggaggccgaggcgggcggatcacctgaggtcaggagttt 
gagaccagtctggccaacatggcgaaaccctgtctctactaaaataaattagccaggcat 
tgtggtgtgttcctgtaattccagctactgggaggctgaggcaggagaatcgcttcaacc 
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tgggaggcggaggttgcagtgagccgagatcgtgccattgcactccagcttgggcagcag 
aatgagactgtctca 

SEQ ID NO: 388. >T83032JT9 # TY Transcript # LN 1294 « Source Gene: T83032 

# Encoded protein: T83032_P3 

ctcccagcgtgctttgcggcgggccggcccgctttacccagagtcgccctgccgcaatcg 
cgcgtctttccaccgaggccccggatgtagattccctcccccgttcagtggtcgtggcct 
cacagcgactctaagacttggggctctctcattggctgtaactcttccactggattggta 
gcaaaaaaagaggcggtgcccaaggcgaaaggctctgtgactacagccaatcagaatcga 
ggccgggctttggcgggaggtgggaacgctgtggccattcggatttggcgcgagcgcggc 
tggagtttgctgctgccgctgtgcagtttgttcaggggcttgtggtggtgagtccgagag 
gctgcgtgtgagagacgtgagaaggatcctgcactgaggaggtggaaagaagaggattgc 
tcgaggaggcctggggtctgtgaggcagcggagctgggtgaaggctgcgggttccggcga 
ggcctgagctgtgctgtcgtcatgcctcaaacccgatcccaggcacaggctacaatcagt 
tttccaaaaaggaagctgtctcgggcattgaacaaagctaaaaactccagtgatgccaaa 
ctagaaccaacaaatgtccaaaccgtaacctgttctcctcgtgtaaaagccctgcctctc 
agccccaggaaacgtctgggcgatgacaacctatgcaacactccccatttacctccttgt 
tctccaccaaagcaaggcaagaaagagaatggtccccctcactcacatacacttaaggga 
cgaagattggtatttgacaatcagctgacaattaagtctcctagcaaaagagaactagcc 
aaagttcaccaaaacaaaatactttcttcagttagaaaaagtcaagagatcacaacaaat 
tctgagcagagatgtccactgaagaaagaatctgcatgtgtgagactattcaagcaagaa 
ggcacttgctaccagcaagcaaagctggtcctgaacacagctgtcccagatcggctgcct 
gccagggaaagggagatggatgtcatcaggaatttcttgagggaacacatctgtgggaaa 
aaagctggaagcctttacctttctggtgctcctggaactggaaaaactgcctgcttaagc 
cggattctgcaagacctcaaggtatctagagatcaggttctggacaatgctgcagttcaa 
ttctgtgcccgcaaagtctctgctgtttcaggagatgttcgcaaagcactggatgtttgc 
aggtgagttacggctctgttgcattcttttatta 

SEQ ID NO: 389, >T83032_T10 t TY Transcript # LN 390 # Source Gene: T83032 

# Encoded protein: 

actgtataggttttcgggaatatctacagaagcctgttcaaagattttattgaaaagagg 
aagaaatagggtattcagataagtttttgcaaacccagactcaggtttcttaaatgatta 
aaggctataagcaatgtgacttttaagcagcgtttgttctcccttgtttcctaccagtta 
tatgaagcctacagtaaagtctgtcgcaaacagcaggtggcggctgtggaccagtcagag 
tgtttgtcactttcagggctcttggaagccaggggcattttaggattaaagagaaacaag 
gaaacccgtttgacaaaggtgtttttcaagattgaagagaaagaaatagaacatgctctg 
aaagataaagctttaattggaaatatctta 

SEQ ID NO: 390. >T83032_P1 # TY Protein # CC #LN 560 # Source Gene: T83032 

# Encoding Transcript: 1 

MPQTRSQAQAT I S FPKRKLS RALNKAKNS S DAKLE PTN VQT VTC S PRVKAL PLS PRKRLG 
DDNLCNTPHLPPCSPPKQGKKENGPPHSHTLKGRRLVFDNQLTIKSPSKRELAKVHQNKI 
LSSVRKSQEITTN 

SEQRCPLKKESACVRLFKQEGTCYQQAKLVLNTAVPDRLPAREREMD 
VIRNFLREHICGKKAGSLYLSGAPGTGKTACLSRILQDLKKELKGFKTIMLNCMSLRTAQ 
AVFPAIAQEICQEEVSRPAGKDMMRKLEKHMTAEKGPMIVLVLDEMDQLDSKGQDVLYTL 
FEWPWLSNSHLVLIGIANTLDLTDRILPRLQAREKCKPQLLNFPPYTRNQIVTILQDRLN 
QVSRDQVLDNAAVQFCARKVS AVSGDVRKALDVCRRAI EI VES DVKSQT I LKPLS ECKS P 
SEPLI PKRVGLIHI SQVISEVDGNRMTLSQEGAQDSFPLQQKI LVCSLMLLIRQLKIKEV 
TLGKLYEAYSKVCRKQQVAAVDQSECLSLSGLLEARGILGLKRNKETRLTKVFFKIEEKE 
IEHALKDKALIGNILATGLP 

SEQ ID NO: 391 . >T83032_P2 # TY Protein # CC #LN 89 # Source Gene: T83032 

# Encoding Transcript: 5 

MSTEERICMCETIQARRHLLPASKAGPEHSCPRSAACQGKGDGCHQEFLEGTHLWEKSWK 
PLPFWCSWNWKNCLLKPDSARPQEGTERL 
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SEQ ID NO: 392. >T83032_P3 # TY Protein 8 CC #LN 254 # Source Gene: 
# Encoding Transcript: 9 

MPQTRSQAQATISFPKRKLSRALNKAKNSSDAKLEPTNVQTVTCSPRVKALPLSPRKRLG 
DDNLCNTPHLPPCSPPKQGKKENGPPHSHTLKGRRLVFDNQLTIKSPSKRELAKVHQNKI 
LSSVRKSQEITTN 

S EQRC PL KKES AC VRLFKQEGTC YQQAKL VLNT AV PDRL PAREREMD 

VIRNFLREHICGKKAGSLYLSGAPGTGKTACLSRILQDLKVSRDQVLDNAAVQFCARKVS 

AVSGDVRKALDVCR 



SEQ ID NO: 393 . >Z26993 # TY Consensus # Length 9773 # Number of exons 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 

ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 

tgcccgggggtctcttagaatccattgattagaccagtagcttttggtagtaccatgatg 

tttccatttgacttttcaaacatttttcagctgaccattaccagctttgttgggggtaaa 

ctcaagagagttacatcttttacacattgatttttccagattttgcaaatgtcactataa 

atggataatattttctaattgggagttgtggcaggggtagttattactgttttttgcttg 

ttcgtgctattttttactttgttactaattgtattttttccaattccaggccatttcata 

attctgaatcatgtctgataacggagaactggaagataagcctccagcacctcctgtgcg 

aatgagcagcaccatctttagcactggaggcaaagaccctttgtcagccaatcacagttt 

gaaacctttgccctctgttccagaagagaaaaagcccaggcataaaatcatctccatatt 

ctcaggcacagagaaaggaagtaaaaagaaagaaaaggaacggccagaaatttctcctcc 

atctgattttgagcacaccatccatgttggctttgatgctgttactggagaattcactgg 

catgccagaacagtgggctcgattactacagacctccaatatcaccaaactagagcaaaa 

gaagaatcctcaggctgtgctggatgtcctaaagttctacgactccaacacagtgaagca 

gaaatatctgagctttactcctcctgagaaagatggctttccttctggaacaccagcatt 

tccttcaggaacaccagcacagggcatgtcagtgaattctccaggtaacagcatcgaagc 

caacatggatggtgtgctcaaaatcagatggaggagaaatttcgggccgttccttttctt 

tctttttacttcatttctctgtgcctgagaatatggagatgattttatgcctgggctttt 

tctcttctggaacagagggcaaaggtttcaaactgtgattggcctgaatgccaagggaac 

agaagcacccgcagtagtgacagaggaggaggatgatgatgaagagactgctcctcccgt 

tattgccccgcgaccggatcatacgaaatcaatttacacacggtctgtaattgaccctgt 

tcctgcaccagttggtgattcacatgttgatggtgctgccaagtctttagacaaacagaa 

aaagaagactaagatgacagatgaagagattatggagaaattaagtatgttatctacatt 

ttacatcattgttaaattgttcacggctcttaaaacatttggcccagatggattttaact 

tacactttacattgtgacttaaagtataaagaataaattgtttccttatagcaattcttg 

agattacctgattctaaagctgaacatctgagaacctaattaatctctgacaaactggta 

cagtcttttcaaaatgtgaccattattgagcaaatgattagttaggaggtctaacctaat 

ggaaataatgtgtcttcttgtttctgttacttttaaacttttaatgtgtcttgtttctga 

tacttttaaacttttattatgtatttaagtttctttagtgttttgtgtttgaaacctgtt 

atttttggaacttagtattatgtttgaaaatcacttataaaaacagtggagttcattaaa 

aggaataataatatcacaatcaatttacacttaaaatgcttgactgagcttccagctttt 

tgtttattctaggaactatcgtgagcataggtgaccctaagaaaaaatatacaagatatg 

aaaaaattggacaagggtaagtatttgtgactgtattacgataatattcagtattcagta 

tttcctggctttatagcatgatgtttattttgcctgccgacatcaagtacttatatagta 

ttctgtgagtggtacccttgggggatttgtgtaaataaataatgcagttatttgtagatg 

ctgttaaaaaactgtactagttgttagcaatgactgtgtatcattattttttcttaagtg 

tcatttcccatctgtgataattagagggaagaattgcccataaaatatatttaaaagaca 

ataagaatatttactgcatgactataagcaataaaaaatgcatttaaatcatggcttctg 

gtacagttttcactgctactgacgttgcactgggacaggaggttgctatcaaacaaatta 

atttacagaaacagccaaagaaggaactgatcattaacgagattctggtgatgaaagaat 

tgaaaaatcccaacatcgttaactttttggacaggtaagtatgactattccttaaacacc 

gggagaaaatgtagaaattttaagtctcatggtttcgggggtggggctggccactttgga 



246/365 



Figur 1 (Cont'd) 



ataaactgtcagtgccgagaaataagatggaagaaagaagaggagaggtaaaaatagcac 
agtgcagttttagaaggtccttctctgcaaggacatcacttggcaggtgagagaaggcag 
ttaattgaaggccatagctggtggtgtttatactcatcgcaaacgacctgaagcgaacac 
atgaatggttggctgtgattcgtgtcttgactgacagaggggactggttacatggttttt 
aaacttgtggatcaatttttatttcaaatgctatccttcatggacaaatttatatgtcta 
tcaaaacctcatttagttgaggaattaatgttaaataacagaccaaaaagcagaatagaa 
aatgatgcatattggctgggtgcagtggctcacacctgtaatcccagctactcaggaggc 
tgaggcaggagaattgcttgaacccaggaggtggaggttgcagtgagccgagatcgcacc 
actgcactccagcctgggcaatggaatgagactccatctcaaaaaaaagtgggcaaactg 
taactttaaaatgattcaggatcataacttcagttatttttctgtgctgttacgtagcag 
ctttccagttttcttcctaactctcaagctaccagctctagttttattgttagggtgttt 
aatatttgaattcttcaacatagaataccatgtgtttaggaaaaataaaagtgtgactgt 
atttggtatgttgtatttttagttacctggtaggagatgaattgtttgtggtcatggaat 
accttgctggggggtcactcactgatgtggtaacagaaacgtgcatggatgaagcacaga 
ttgctgctgtatgcagagaggtaggtttgcctccttcttgatatgttatggtgatatgtg 
gttagcaagaagtgaacaaaaggaaatttttctgcacaggtaattgttattgtgggaggt 
gctttctctgtatacattctctgcaaattactccatctccatgccatttcatcctacaca 
ggagtttcagccctttccttcaggtataagataatttgggaatgaattgtggacctttga 
agatccagattcacaaggaaattctggtagagtgatgatgatcatttaccatccaaacag 
gacacttttgagagtaaaagggtatatatcattagaaatacaccactcaggctgggcgcg 
gtggctcacgcctgtaatcccagcactttgggaggccgaggcgggtggatcacaaggtca 
ggagatcgagaccatgtttacaggcattggagtttttacatgctaatcaagtgatccaca 
gagacatcaaaagtgacaatgtacttttgggaatggaaggatctgttaagctcagtgagt 
aacaaatggacaattcacaatcatttattataattttctgccttttgtttaattaaaaac 
ttttcttgaccaatgcaaggattaataatgatttaatataggaagaggtgagacctttga 
aaactgagtgatcctctgagggttttaactaatcagtgccagagaaaggactcgctggat 
aagacattacctaataattaccactagataagacattattaggtaatgtcttgtcttgtc 
tttttgagatggagctgactttggtttctgtgcccagatcacccctgagcagagcaaacg 
cagtaccatggtcggaacgccatactggatggcaccagaggtggttacacggaaagctta 
tggccctaaagtcgacatatggtctctgggtatcatggctattgagatggtagaaggaga 
gcctccatacctcaatgaaaatcccttgagggatcacttgaacccaggaggcggaggttg 
cagtgagccgagatcatgccatcgcactccagcctggtcgacaaagcgaaactccgtctc 
cataaataaataaataaatgttggtgagaagataaggaaactagaaatctcgttcattgc 
aaaatataaaatggtacatctgctttggaaaacaacttgacaattctgtgtgaagttaat 
acggaattaccagatgacctgccttgtacctaatagcaactaatggaaccccagaacttc 
agaatccagagaaactttccccaatatttcgggatttcttaaatcgatgtttggaaatgg 
atgtggaaaaaaggggttcagccaaagaattattacagcatcctttcctgaaactggcca 
aaccgttatctagcttgacaccactgatcatggcagctaaagaagcaatgaagagtaacc 
gttaacatcactgctgtggcctcatactcttttttccattttctacaagaagccttttag 
tatatgaaaattattactctttttggggtttaaagaaatggtctgcataacctgaatgaa 
agaagcaaatgactattctctgaagacaaccaagagaaaattgcaaaaagacaagtatga 
cttttatatgaaccccttctttagggtccagaaggaattgtggactgaatcactagcctt 
aggtctttcagcaaacagcctatcagggccatttatcatgtgtgagatttgcattttact 
ttgctgactttgttgtaatagatcccattcattgtcccctttggggtatttccaatactt 
gaatggcagattggagtttttcagagtatgtgtttcatctgctagtctttctctccttca 
tagcttttcttttcctggacttgctccttttgagttgcttttgcgtttctcatgcctagg 
caagtgtaatagaaattatgtagctccttatgttggcaaaggagctctatatagtttcac 
tttgtataaaagttaggaccagctgttgttacatgtaatattttagttcagaacttgacc 
tgaaggaagggaagaaaagtatgtgatttttaccttttttaacaaatgtgaaaaagtcag 
ttttagaaatttcgtggtagtaagttcggcatttgttacatgtatagagagaagactaat 
aatctctatttataactaaatcattgagatagaaaaagattcccattgactgtagacttc 
ttcccattttgtcttcccttctgcctgtttccccttcaggcttggctctaggaaccaaag 
tgatttgttgttgttccaacctgggctttgtgactttggttagtgccactaccttcttcc 
ctcctttcccccttcaatttggaaataaatttctgtatatgttgcaattttaggtttagg 
tttgttctttttctttttcattaatcctctctcacctcacagataccccctcccatggca 
aataatataataaccagtgaattttcaggaatttaaaaattagcttttttccacttaaag 
gagaaaaatatttgggactagcagcagaggcagtaagagatgtgaaccttggtgagctct 
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gatacagtgagaagagattatactcatgaaagagaatgttagtgttacagagaagcagcc 
gatagcaaatcgactgtagagacttggcggcggtggcattgccccaggtcgtcagcagtg 
tggtattatctatgagaacttgagcgacagagtatttcttgatgaatttatagatcattt 
gagatgttgagttactttagtttagttttgttttgttttttcaaataagtagagactatt 
gtaaaaaacgagaaaggaaaatgaaatgtgcgtgttgatagcaataatttgtttctttta 
aagattctaaaaggtctgagacctgtagcattaattatttgagtgccctcccttctcccc 
tcccctcccttttctcttctcttttttcctctcctctccttctcctttattcattgtttt 
gcttttggagtaggtgttgttcaagtatctgtggtttggttctggcattttgttcccacc 
atccccttcccccattaacttcccccctgcttgccatcctgcagtagtataaatcatgaa 
taaaaaataattttgctgttgtagtatacattggagaaactggcaggttttatttccatt 
attttatttccactatatctatgataagatgcaattataaggagagaagtgactgttttt 
tattgataaggcaagattttcagaaaaatgagtaaaataattaatgaaacatatttagag 
cacttaatggtctctgttttcaatataattcttgatttcatttttctctggaatatattg 
gccttctacagctattactgaattatagaaactggtttatttctggcagaaagctgcagt 
gccaccgctcaagttctcatagataataccacactgctgacgacctggggcaatgccacc 
gccgccaagtctctacagtcgatttgctatcggctgcttctctgtaacactaacattctc 
tttcttgagtataatctcttctcactgtatcaggtcgagcctgagttccaaattttacca 
ttctttgtaaacagttggatggattatgataaagaagatgctaccaatgaaatagaaaac 
caacgagatgagaagactgtgatcctcatgtactcagaggcacttccctcctaagtcaaa 
gaccatcctcactgactatgtgccaacgcctcgtttcaggcttgtgactcaacaaagggc 
ttttccattgatagaagcagtttgggatttgtagttgcgacttcttcgatagttacctgc 
acgtccattgctggcaactgacttgtcattaaaacctggctctttggttaagggagctac 
gctgtggtttattcttaagttacgtggataaactaacctctaacagaaatatactttggt 
taattttgaaatgtgtcatttttaaacaatcttaaaagtaatacagaattgtgatttatt 
aattttaaaacattcagaacttgttgaaagaaaaattatatctgaatcaagattcatgtt 
ttttatttttattttttttgatacagagtctcactctgtcactcaggctggagtgcagtg 
acatgatctcagctcactgcaacctccgcttcctgggttcaagcaattctcatgcctcag 
cctcctgagtagctgagaccacaggcacccgccaccacacccagctatttttttgtattt 
ttagtagagacagagtctcaccacattgcccaggctggtctccaactcctgagctcaggc 
agtctgcccaccttggcctcccaaagtgctgcagttacaggcgtgagccactgcacctgg 
cctcatgttttttaaataattgccttttatatttaccctttttgtcatcactttagaatg 
aaaattcccatttaaatctgaaagttaccttaatagtcctcttgtgttattaggacagta 
ttattatagtacttatttattttattttagatttaaagttatcttctctttttctttttt 
cttctgctgcttttagggacaattaaaactgggaaactatgaaacatggaacattttatc 
ctacctgaaagtaaacgagtaattgtgaagcataagacactgaggctaatacaactctgt 
cttcatgtgttgactgcctggcacatagtattcattctcttccctttaacatagaagtgt 
ccagctgcgtacagtctagtaaccagcaactgtaaacgaacctgtgcctctaacaagcga 
ttctaaaccacctatgagtatttcttttagggctcacttaaatacatgtttgtatatact 
gtattctagccagaataattttagatctgatcaggtagtagctaaaattagaaaaaaaca 
aaatagatgcttaaagaatttgcatccatttttgagtctaaatcttttaaaatatactga 
gatccacatctagtgaaatgtcagtgtcaaaatattatagattatagctaaaatccagat 
taatactcatttggggttttttatagtggaacttcatagtaatacaaaaagcagattgtc 
ttcctgtctccgctgctcccacagtaggtattgaaactggtaaaatcagttttttgatag 
tgtgtgtatataagaaaaaatagatacacacattcttttttctcagtcaacacattgatt 
gaacactctggcaaagatgctgtggtggatgaggttggagttcgaaagaagaagcaagcg 
ctggcctggccttgaaagaaccgaagtctttcccattcacttctctagaaagctgccaag 
acagaggcagaaagaaatggatgatagttctgtcaagcacacttctgttctcttagaact 
tagaagtgtttctaagagaacagaagtaataagagaaacagttacgtgtggaattcaaca 
tctttggttggaacgcattggctttttttttcttgttttgatagaaatggaattaagcaa 
aagtagtttttgtcttttctgttgtcttcaaattttatgccttagtttagggacccacgc 
tggttgggtaccatgggaaataatttttaaggttcaagaaaacttggaccctttaaggga 
agagggggaaaaaaaaaagttnttgttatttttaatttaatcccgttcaattatttaatt 
gttatacattgacattaactgctgtattttgactttgttcaataattttgttctttcagg 
gctagaaataaactttttaaaaaaagtgtgcatttttccctttcctaaacttttattctt 
tcttttgatcagcgtaaaagaatattttaatgtcttttgatagcataaaagaatatttaa 
atgtcttaataggttttcaaagaacatttagtatttttagtgataaatgttttaaacctt 
ttagaatggcccctgggggttttcaatcttggattggaggtgggggtttcagagtgttgc 
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Figur 1 (Cont'd) 



cacgtctagctcctctcccagattgaaccgtggcactccagcctggtgtgacagagcgag 
actttatctcaataaaaaacaaaaagaccaaaaacagtaaatcaccgtgataggcaaaag 
gcattcctttattatttttggcatcttttaaactattgtggactctgggtccaatttgaa 
agtgattaagctaaacatatttgacctttgaactagtcccatttaagaccattacggaac 
tggttcctctttgatctccccatgtctgaatggctaataatacacattgagagggcacca 
acggaactggagctgttgtaactggagctgttgtagctggagctgttgtamctggagctg 
ttgtaactggagctgttgtaactgragctgttgtaactggagctgttgtaact 

SEQ ID NO: 394. >Z26993_T1 # TY Transcript # LN 5965 # Source Gene: Z26993 
# Encoded protein: Z26993_P1 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaagtatgacttttatatgaaccccttctt 
tagggtccagaaggaattgtggactgaatcactagccttaggtctttcagcaaacagcct 
atcagggccatttatcatgtgtgagatttgcattttactttgctgactttgttgtaatag 
atcccattcattgtcccctttggggtatttccaatacttgaatggcagattggagttttt 
cagagtatgtgtttcatctgctagtctttctctccttcatagcttttcttttcctggact 
tgctccttttgagttgcttttgcgtttctcatgcctaggcaagtgtaatagaaattatgt 
agctccttatgttggcaaaggagctctatatagtttcactttgtataaaagttaggacca 
gctgttgttacatgtaatattttagttcagaacttgacctgaaggaagggaagaaaagta 
tgtgatttttacctt.ttttaacaaatgtgaaaaagtcagttttagaaatttcgtggtagt 
aagttcggcatttgttacatgtatagagagaagactaataatctctatttataactaaat 
cattgagatagaaaaagattcccattgactgtagacttcttcccattttgtcttcccttc 
tgcctgtttccccttcaggcttggctctaggaaccaaagtgatttgttgttgttccaacc 
tgggctttgtgactttggttagtgccactaccttcttccctcctttcccccttcaatttg 
gaaataaatttctgtatatgttgcaattttaggtttaggtttgttctttttctttttcat 
taatcctctctcacctcacagataccccctcccatggcaaataatataataaccagtgaa 
ttttcaggaatttaaaaattagcttttttccacttaaaggagaaaaatatttgggactag 
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Figur 1 (Cont'd) 



cagcagaggcagtaagagatgtgaaccttggtgagctctgatacagtgagaagagattat 
actcatgaaagagaatgttagtgttacagagaagcagccgatagcaaatcgactgtagag 
acttggcggcggtggcattgccccaggtcgtcagcagtgtggtattatctatgagaactt 
gagcgacagagtatttcttgatgaatttatagatcatttgagatgttgagttactttagt 
ttagttttgttttgttttttcaaataagtagagactattgtaaaaaacgagaaaggaaaa 
tgaaatgtgcgtgttgatagcaataatttgtttcttttaaagattctaaaaggtctgaga 
cctgtagcattaattatttgagtgccctcccttctcccctcccctcccttttctcttctc 
ttttttcctctcctctccttctcctttattcattgttttgcttttggagtaggtgttgtt 
caagtatctgtggtttggttctggcattttgttcccaccatccccttcccccattaactt 
cccccctgcttgccatcctgcagtagtataaatcatgaataaaaaataattttgctgttg 
tagtatacattggagaaactggcaggttttatttccattattttatttccactatatcta 
tgataagatgcaattataaggagagaagtgactgttttttattgataaggcaagattttc 
agaaaaatgagtaaaataattaatgaaacatatttagagcacttaatggtctctgttttc 
aatataattcttgatttcatttttctctggaatatattggccttctacagctattactga 
attatagaaactggtttatttctggcagaaagctgcagtgccacctgagttccaaatttt 
accattctttgtaaacagttggatggattatgataaagaagatgctaccaatgaaataga 
aaaccaacgagatgagaagactgtgatcctcatgtactcagaggcacttccctcctaagt 
caaagaccatcctcactgactatgtgccaacgcctcgtttcaggcttgtgactcaacaaa 
gggcttttccattgatagaagcagtttgggatttgtagttgcgacttcttcgatagttac 
ctgcacgtccattgctggcaactgacttgtcattaaaacctggctctttggttaagggag 
ctacgctgtggtttattcttaagttacgtggataaactaacctctaacagaaatatactt 
tggttaattttgaaatgtgtcatttttaaacaatcttaaaagtaatacagaattgtgatt 
tattaattttaaaacattcagaacttgttgaaagaaaaattatatctgaatcaagattca 
tgttttttatttttattttttttgatacagagtctcactctgtcactcaggctggagtgc 
agtgacatgatctcagctcactgcaacctccgcttcctgggttcaagcaattctcatgcc 
tcagcctcctgagtagctgagaccacaggcacccgccaccacacccagctatttttttgt 
atttttagtagagacagagtctcaccacattgcccaggctggtctccaactcctgagctc 
aggcagtctgcccaccttggcctcccaaagtgctgcagttacaggcgtgagccactgcac 
ctggcctcatgttttttaaataattgccttttatatttaccctttttgtcatcactttag 
aatgaaaattcccatttaaatctgaaagttaccttaatagtcctcttgtgttattaggac 
agtattattatagtacttatttattttattttagatttaaagttatcttctctttttctt 
ttttcttctgctgcttttagggacaattaaaactgggaaactatgaaacatggaacattt 
tatcctacctgaaagtaaacgagtaattgtgaagcataagacactgaggctaatacaact 
ctgtcttcatgtgttgactgcctggcacatagtattcattctcttccctttaacatagaa 
gtgtccagctgcgtacagtctagtaaccagcaactgtaaacgaacctgtgcctctaacaa 
gcgattctaaaccacctatgagtatttcttttagggctcacttaaatacatgtttgtata 
tactgtattctagccagaataattttagatctgatcaggtagtagctaaaattagaaaaa 
aacaaaatagatgcttaaagaatttgcatccatttttgagtctaaatcttttaaaatata 
ctgagatccacatctagtgaaatgtcagtgtcaaaatattatagattatagctaaaatcc 
agattaatactcatttggggttttttatagtggaacttcatagtaatacaaaaagcagat 
tgtcttcctgtctccgctgctcccacagtaggtattgaaactggtaaaatcagttttttg 
atagtgtgtgtatataagaaaaaatagatacacacattcttttttctcagtcaacacatt 
gattgaacactctggcaaagatgctgtggtggatgaggttggagttcgaaagaagaagca 
agcgctggcctggccttgaaagaaccgaagtctttcccattcacttctctagaaagctgc 
caagacagaggcagaaagaaatggatgatagttctgtcaagcacacttctgttctcttag 
aacttagaagtgtttctaagagaacagaagtaataagagaaacagttacgtgtggaattc 
aacatctttggttggaacgcattggctttttttttcttgttttgatagaaatggaattaa 
gcaaaagtagtttttgtcttttctgttgtcttcaaattttatgccttttatttttaattt 
aatcccgttcaattatttaattgttatacattgacattaactgctgtattttgactttgt 
tcaataattttgttctttcagggctagaaataaactttttaaaaaaagtgtgcatttttc 
cctttcctaaacttttattctttcttttgatcagcgtaaaagaatattttaatgtctttt 
gatagcataaaagaatatttaaatgtcttaataggttttcaaagaacatttagtattttt 
agtgataaatgttttaaacctttta 

SEQ ID NO: 395. >Z26993JT2 # TY Transcript # LN 6118 # Source Gene: Z26993 
# Encoded protein: Z26993_P1 

ggtctcttagaatccattgattagaccagtagcttttggtagtaccatgatgtttccatt 
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tgacttttcaaacatttttcagctgaccattaccagctttgttgggggtaaactcaagag 
agttacatcttttacacattgatttttccagattttgcaaatgtcactataaatggataa 
tat tttctaattgggagttgtggcaggggtagttat tact gttttttgcttgttcgtgct 
attttttactttgttactaattgtattttttccaattccaggccatttcataattctgaa 
tcatgtctgataacggagaactggaagataagcctccagcacctcctgtgcgaatgagca 
gcaccatctttagcactggaggcaaagaccctttgtcagccaatcacagtttgaaacctt 
tgccctctgttccagaagagaaaaagcccaggcataaaatcatctccatattctcaggca 
cagagaaaggaagtaaaaagaaagaaaaggaacggccagaaatttctcctccatctgatt 
ttgagcacaccatccatgttggctttgatgctgttactggagaattcactggcatgccag 
aacagtgggctcgattactacagacctccaatatcaccaaactagagcaaaagaagaatc 
ctcaggctgtgctggatgtcctaaagttctacgactccaacacagtgaagcagaaatatc 
tgagctttactcctcctgagaaagatggctttccttctggaacaccagcactgaatgcca 
agggaacagaagcacccgcagtagtgacagaggaggaggatgatgatgaagagactgctc 
ctcccgttattgccccgcgaccggatcatacgaaatcaatttacacacggtctgtaattg 
accctgttcctgcaccagttggtgattcacatgttgatggtgctgccaagtctttagaca 
aacagaaaaagaagactaagatgacagatgaagagattatggagaaattaagaactatcg 
tgagcataggtgaccctaagaaaaaatatacaagatatgaaaaaattggacaaggggctt 
ctggtacagttttcactgctactgacgttgcactgggacaggaggttgctatcaaacaaa 
ttaatttacagaaacagccaaagaaggaactgatcattaacgagattctggtgatgaaag 
aattgaaaaatcccaacatcgttaactttttggacagttacctggtaggagatgaattgt 
ttgtggtcatggaataccttgctggggggtcactcactgatgtggtaacagaaacgtgca 
tggatgaagcacagattgctgctgtatgcagagagtgtttacaggcattggagtttttac 
atgctaatcaagtgatccacagagacatcaaaagtgacaatgtacttttgggaatggaag 
gatctgttaagctcactgactttggtttctgtgcccagatcacccctgagcagagcaaac 
gcagtaccatggtcggaacgccatactggatggcaccagaggtggttacacggaaagctt 
atggccctaaagtcgacatatggtctctgggtatcatggctattgagatggtagaaggag 
agcctccatacctcaatgaaaatcccttgagggccttgtacctaatagcaactaatggaa 
ccccagaacttcagaatccagagaaactttccccaatatttcgggatttcttaaatcgat 
gtttggaaatggatgtggaaaaaaggggttcagccaaagaattattacagcatcctttcc 
tgaaactggccaaaccgttatctagcttgacaccactgatcatggcagctaaagaagcaa 
tgaagagtaaccgttaacatcactgctgtggcctcatactcttttttccattttctacaa 
gaagccttttagtatatgaaaattattactctttttggggtttaaagaaatggtctgcat 
aacctgaatgaaagaagcaaatgactattctctgaagacaaccaagagaaaattgcaaaa 
agacaagtatgacttttatatgaaccccttctttagggtccagaaggaattgtggactga 
atcactagccttaggtctttcagcaaacagcctatcagggccatttatcatgtgtgagat 
ttgcattttactttgctgactttgttgtaatagatcccattcattgtcccctttggggta 
tttccaatacttgaatggcagattggagtttttcagagtatgtgtttcatctgctagtct 
ttctctccttcatagcttttcttttcctggacttgctccttttgagttgcttttgcgttt 
ctcatgcctaggcaagtgtaatagaaattatgtagctccttatgttggcaaaggagctct 
atatagtttcactttgtataaaagttaggaccagctgttgttacatgtaatattttagtt 
cagaacttgacctgaaggaagggaagaaaagtatgtgatttttaccttttttaacaaatg 
tgaaaaagtcagttttagaaatttcgtggtagtaagttcggcatttgttacatgtataga 
gagaagactaataatctctatttataactaaatcattgagatagaaaaagattcccattg 
actgtagacttcttcccattttgtcttcccttctgcctgtttccccttcaggcttggctc 
taggaaccaaagtgatttgttgttgttccaacctgggctttgtgactttggttagtgcca 
ctaccttcttccctcctttcccccttcaatttggaaataaatttctgtatatgttgcaat 
tttaggtttaggtttgttctttttctttttcattaatcctctctcacctcacagataccc 
cctcccatggcaaataatataataaccagtgaattttcaggaatttaaaaattagctttt 
ttccacttaaaggagaaaaatatttgggactagcagcagaggcagtaagagatgtgaacc 
ttggtgagctctgatacagtgagaagagattatactcatgaaagagaatgttagtgttac 
agagaagcagccgatagcaaatcgactgtagagacttggcggcggtggcattgccccagg 
tcgtcagcagtgtggtattatctatgagaacttgagcgacagagtatttcttgatgaatt 
tatagatcatttgagatgttgagttactttagtttagttttgttttgttttttcaaataa 
gtagagactattgtaaaaaacgagaaaggaaaatgaaatgtgcgtgttgatagcaataat 
ttgtttcttttaaagattctaaaaggtctgagacctgtagcattaattatttgagtgccc 
tcccttctcccctcccctcccttttctcttctcttttttcctctcctctccttctccttt 
attcattgttttgcttttggagtaggtgttgttcaagtatctgtggtttggttctggcat 
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tttgttcccaccatccccttcccccattaacttcccccctgcttgccatcctgcagtagt 
ataaatcatgaataaaaaataattttgctgttgtagtatacattggagaaactggcaggt 
tttatttccattattttatttccactatatctatgataagatgcaattataaggagagaa 
gtgactgttttttattgataaggcaagattttcagaaaaatgagtaaaataattaatgaa 
acatatttagagcacttaatggtctctgttttcaatataattcttgatttcatttttctc 
tggaatatattggccttctacagctattactgaattatagaaactggtttatttctggca 
gaaagctgcagtgccacctgagttccaaattttaccattctttgtaaacagttggatgga 
ttatgataaagaagatgctaccaatgaaatagaaaaccaacgagatgagaagactgtgat 
cctcatgtactcagaggcacttccctcctaagtcaaagaccatcctcactgactatgtgc 
caacgcctcgtttcaggcttgtgactcaacaaagggcttttccattgatagaagcagttt 
gggatttgtagttgcgacttcttcgatagttacctgcacgtccattgctggcaactgact 
tgtcattaaaacctggctctttggttaagggagctacgctgtggtttattcttaagttac 
gtggataaactaacctctaacagaaatatactttggttaattttgaaatgtgtcattttt 
aaacaatcttaaaagtaatacagaattgtgatttattaattttaaaacattcagaacttg 
ttgaaagaaaaattatatctgaatcaagattcatgttttttatttttattttttttgata 
cagagtctcactctgtcactcaggctggagtgcagtgacatgatctcagctcactgcaac 
ctccgcttcctgggttcaagcaattctcatgcctcagcctcctgagtagctgagaccaca 
ggcacccgccaccacacccagctatttttttgtatttttagtagagacagagtctcacca 
cattgcccaggctggtctccaactcctgagctcaggcagtctgcccaccttggcctccca 
aagtgctgcagttacaggcgtgagccactgcacctggcctcatgttttttaaataattgc 
cttttatatttaccctttttgtcatcactttagaatgaaaattcccatttaaatctgaaa 
gttaccttaatagtcctcttgtgttattaggacagtattattatagtacttatttatttt 
attttagatttaaagttatcttctctttttcttttttcttctgctgcttttagggacaat 
taaaactgggaaactatgaaacatggaacattttatcctacctgaaagtaaacgagtaat 
tgtgaagcataagacactgaggctaatacaactctgtcttcatgtgttgactgcctggca 
catagtattcattctcttccctttaacatagaagtgtccagctgcgtacagtctagtaac 
cagcaactgtaaacgaacctgtgcctctaacaagcgattctaaaccacctatgagtattt 
cttttagggctcacttaaatacatgtttgtatatactgtattctagccagaataatttta 
gatctgatcaggtagtagctaaaattagaaaaaaacaaaatagatgcttaaagaatttgc 
atccatttttgagtctaaatcttttaaaatatactgagatccacatctagtgaaatgtca 
gtgtcaaaatattatagattatagctaaaatccagattaatactcatttggggtttttta 
tagtggaacttcatagtaatacaaaaagcagattgtcttcctgtctccgctgctcccaca 
gtaggtattgaaactggtaaaatcagttttttgatagtgtgtgtatataagaaaaaatag 
atacacacattcttttttctcagtcaacacattgattgaacactctggcaaagatgctgt 
ggtggatgaggttggagttcgaaagaagaagcaagcgctggcctggccttgaaagaaccg 
aagtctttcccattcacttctctagaaagctgccaagacagaggcagaaagaaatggatg 
atagttctgtcaagcacacttctgttctcttagaacttagaagtgtttctaagagaacag 
aagtaataagagaaacagttacgtgtggaattcaacatctttggttggaacgcattggct 
ttttttttcttgttttgatagaaatggaattaagcaaaagtagtttttgtcttttctgtt 
gtcttcaaattttatgccttttatttttaatttaatcccgttcaattatttaattgttat 
acattgacattaactgctgtattttgactttgttcaataattttgttctttcagggctag 
aaataaactttttaaaaaaagtgtgcatttttccctttcctaaacttttattctttcttt 
tgatcagcgtaaaagaatattttaatgtcttttgatagcataaaagaatatttaaatgtc 
ttaataggttttcaaagaacatttagtatttttagtgataaatgttttaaacctttta 

SEQ ID NO: 396. >Z26993_T3 # TY Transcript # LN 5518 I Source Gene: Z26993 
# Encoded protein: Z26993_P2 

cagggcatgtcagtgaattctccaggtaacagcatcgaagccaacatggatggtgtgctc 
aaaatcagatggaggagaaatttcgggccgttccttttctttctttt tact teat tt etc 
tgtgcctgagaatatggagatgattttatgcctgggctttttctcttctggaacagaggg 
caaaggtttcaaactgtgattggctgacagaggaggaggatgatgatgaagagactgctc 
ctcccgttattgccccgcgaccggatcatacgaaatcaatttacacacggtctgtaattg 
accctgttcctgcaccagttggtgattcacatgttgatggtgctgccaagtctttagaca 
aacagaaaaagaagactaagatgacagatgaagagattatggagaaattaagaactatcg 
tgagcataggtgaccctaagaaaaaatatacaagatatgaaaaaattggacaaggggctt 
ctggtacagttttcactgctactgacgttgcactgggacaggaggttgctatcaaacaaa 
ttaatttacagaaacagccaaagaaggaactgatcattaacgagattctggtgatgaaag 
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aattgaaaaatcccaacatcgttaactttttggacagttacctggtaggagatgaattgt 
ttgtggtcatggaataccttgctggggggtcactcactgatgtggtaacagaaacgtgca 
tggatgaagcacagattgctgctgtatgcagagagtgtttacaggcattggagtttttac 
atgctaatcaagtgatccacagagacatcaaaagtgacaatgtacttttgggaatggaag 
gatctgttaagctcactgactttggtttctgtgcccagatcacccctgagcagagcaaac 
gcagtaccatggtcggaacgccatactggatggcaccagaggtggttacacggaaagctt 
atggccctaaagtcgacatatggtctctgggtatcatggctattgagatggtagaaggag 
agcctccatacctcaatgaaaatcccttgagggccttgtacctaatagcaactaatggaa 
ccccagaacttcagaatccagagaaactttccccaatatttcgggatttcttaaatcgat 
gtttggaaatggatgtggaaaaaaggggttcagccaaagaattattacagcatcctttcc 
tgaaactggccaaaccgttatctagcttgacaccactgatcatggcagctaaagaagcaa 
tgaagagtaaccgttaacatcactgctgtggcctcatactcttttttccattttctacaa 
gaagccttttagtatatgaaaattattactctttttggggtttaaagaaatggtctgcat 
aacctgaatgaaagaagcaaatgactattctctgaagacaaccaagagaaaattgcaaaa 
agacaagtatgacttttatatgaaccccttctttagggtccagaaggaattgtggactga 
atcactagccttaggtctttcagcaaacagcctatcagggccatttatcatgtgtgagat 
ttgcattttactttgctgactttgttgtaatagatcccattcattgtcccctttggggta 
tttccaatacttgaatggcagattggagtttttcagagtatgtgtttcatctgctagtct 
ttctctccttcatagcttttcttttcctggacttgctccttttgagttgcttttgcgttt 
ctcatgcctaggcaagtgtaatagaaattatgtagctccttatgttggcaaaggagctct 
atatagtttcactttgtataaaagttaggaccagctgttgttacatgtaatattttagtt 
cagaacttgacctgaaggaagggaagaaaagtatgtgatttttaccttttttaacaaatg 
tgaaaaagtcagttttagaaatttcgtggtagtaagttcggcatttgttacatgtataga 
gagaagactaataatctctatttataactaaatcattgagatagaaaaagattcccattg 
actgtagacttcttcccattttgtcttcccttctgcctgtttccccttcaggcttggctc 
taggaaccaaagtgatttgttgttgttccaacctgggctttgtgactttggttagtgcca 
ctaccttcttccctcctttcccccttcaatttggaaataaatttctgtatatgttgcaat 
tttaggtttaggtttgttctttttctttttcattaatcctctctcacctcacagataccc 
cctcccatggcaaataatataataaccagtgaattttcaggaatttaaaaattagctttt 
ttccacttaaaggagaaaaatatttgggactagcagcagaggcagtaagagatgtgaacc 
ttggtgagctctgatacagtgagaagagattatactcatgaaagagaatgttagtgttac 
agagaagcagccgatagcaaatcgactgtagagacttggcggcggtggcattgccccagg 
tcgtcagcagtgtggtattatctatgagaacttgagcgacagagtatttcttgatgaatt 
tatagatcatttgagatgttgagttactttagtttagttttgttttgttttttcaaataa 
gtagagactattgtaaaaaacgagaaaggaaaatgaaatgtgcgtgttgatagcaataat 
ttgtttcttttaaagattctaaaaggtctgagacctgtagcattaattatttgagtgccc 
tcccttctcccctcccctcccttttctcttctcttttttcctctcctctccttctccttt 
attcattgttttgcttttggagtaggtgttgttcaagtatctgtggtttggttctggcat 
tttgttcccaccatccccttcccccattaacttcccccctgcttgccatcctgcagtagt 
ataaatcatgaataaaaaataattttgctgttgtagtatacattggagaaactggcaggt 
tttatttccattattttatttccactatatctatgataagatgcaattataaggagagaa 
gtgactgttttttattgataaggcaagattttcagaaaaatgagtaaaataattaatgaa 
acatatttagagcacttaatggtctctgttttcaatataattcttgatttcatttttctc 
tggaatatattggccttctacagctattactgaattatagaaactggtttatttctggca 
gaaagctgcagtgccacctgagttccaaattttaccattctttgtaaacagttggatgga 
ttatgataaagaagatgctaccaatgaaatagaaaaccaacgagatgagaagactgtgat 
cctcatgtactcagaggcacttccctcctaagtcaaagaccatcctcactgactatgtgc 
caacgcctcgtttcaggcttgtgactcaacaaagggcttttccattgatagaagcagttt 
gggatttgtagttgcgacttcttcgatagttacctgcacgtccattgctggcaactgact 
tgtcattaaaacctggctctttggttaagggagctacgctgtggtttattcttaagttac 
gtggataaactaacctctaacagaaatatactttggttaattttgaaatgtgtcattttt 
aaacaatcttaaaagtaatacagaattgtgatttattaattttaaaacattcagaacttg 
ttgaaagaaaaattatatctgaatcaagattcatgttttttatttttattttttttgata 
cagagtctcactctgtcactcaggctggagtgcagtgacatgatctcagctcactgcaac 
ctccgcttcctgggttcaagcaattctcatgcctcagcctcctgagtagctgagaccaca 
ggcacccgccaccacacccagctatttttttgtatttttagtagagacagagtctcacca 
cattgcccaggctggtctccaactcctgagctcaggcagtctgcccaccttggcctccca 
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aagtgctgcagttacaggcgtgagccactgcacctggcctcatgttttttaaataattgc 
cttttatatttaccctttttgtcatcactttagaatgaaaattcccatttaaatctgaaa 
gttaccttaatagtcctcttgtgttattaggacagtattattatagtacttatttatttt 
attttagatttaaagttatcttctctttttcttttttcttctgctgcttttagggacaat 
taaaactgggaaactatgaaacatggaacattttatcctacctgaaagtaaacgagtaat 
tgtgaagcataagacactgaggctaatacaactctgtcttcatgtgttgactgcctggca 
catagtattcattctcttccctttaacatagaagtgtccagctgcgtacagtctagtaac 
cagcaactgtaaacgaacctgtgcctctaacaagcgattctaaaccacctatgagtattt 
cttttagggctcacttaaatacatgtttgtatatactgtattctagccagaataatttta 
gatctgatcaggtagtagctaaaattagaaaaaaacaaaatagatgcttaaagaatttgc 
atccatttttgagtctaaatcttttaaaatatactgagatccacatctagtgaaatgtca 
gtgtcaaaatattatagattatagctaaaatccagattaatactcatttggggtttttta 
tagtggaacttcatagtaatacaaaaagcagattgtcttcctgtctccgctgctcccaca 
gtaggtattgaaactggtaaaatcagttttttgatagtgtgtgtatataagaaaaaatag 
atacacacattcttttttctcagtcaacacattgattgaacactctggcaaagatgctgt 
ggtggatgaggttggagttcgaaagaagaagcaagcgctggcctggccttgaaagaaccg 
aagtctttcccattcacttctctagaaagctgccaagacagaggcagaaagaaatggatg 
atagttctgtcaagcacacttctgttctcttagaacttagaagtgtttctaagagaacag 
aagtaataagagaaacagttacgtgtggaattcaacatctttggttggaacgcattggct 
ttttttttcttgttttgatagaaatggaattaagcaaaagtagtttttgtcttttctgtt 
gtcttcaaattttatgccttttatttttaatttaatcccgttcaattatttaattgttat 
acattgacattaactgctgtattttgactttgttcaataattttgttctttcagggctag 
aaataaactttttaaaaaaagtgtgcatttttccctttcctaaacttttattctttcttt 
tgatcagcgtaaaagaatattttaatgtcttttgatagcataaaagaatatttaaatgtc 
ttaataggttttcaaagaacatttagtatttttagtgataaatgttttaaacctttta 

SEQ ID NO: 397. >Z26993JT4 # TY Transcript # LN 4120 # Source Gene: Z26993 
# Encoded protein: Z26993_P1 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
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tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaattggatggattatgataaagaagatgc 
taccaatgaaatagaaaaccaacgagatgagaagactgtgatcctcatgtactcagaggc 
acttccctcctaagtcaaagaccatcctcactgactatgtgccaacgcctcgtttcaggc 
ttgtgactcaacaaagggcttttccattgatagaagcagtttgggatttgtagttgcgac 
ttcttcgatagttacctgcacgtccattgctggcaactgacttgtcattaaaacctggct 
ctttggttaagggagctacgctgtggtttattcttaagttacgtggataaactaacctct 
aacagaaatatactttggttaattttgaaatgtgtcatttttaaacaatcttaaaagtaa 
tacagaattgtgatttattaattttaaaacattcagaacttgttgaaagaaaaattatat 
ctgaatcaagattcatgttttttatttttattttttttgatacagagtctcactctgtca 
ctcaggctggagtgcagtgacatgatctcagctcactgcaacctccgcttcctgggttca 
agcaattctcatgcctcagcctcctgagtagctgagaccacaggcacccgccaccacacc 
cagctatttttttgtatttttagtagagacagagtctcaccacattgcccaggctggtct 
ccaactcctgagctcaggcagtctgcccaccttggcctcccaaagtgctgcagttacagg 
cgtgagccactgcacctggcctcatgttttttaaataattgccttttatatttacccttt 
ttgtcatcactttagaatgaaaattcccatttaaatctgaaagttaccttaatagtcctc 
ttgtgttattaggacagtattattatagtacttatttattttattttagatttaaagtta 
tcttctctttttcttttttcttctgctgcttttagggacaattaaaactgggaaactatg 
aaacatggaacattttatcctacctgaaagtaaacgagtaattgtgaagcataagacact 
gaggctaatacaactctgtcttcatgtgttgactgcctggcacatagtattcattctctt 
ccctttaacatagaagtgtccagctgcgtacagtctagtaaccagcaactgtaaacgaac 
ctgtgcctctaacaagcgattctaaaccacctatgagtatttcttttagggctcacttaa 
atacatgtttgtatatactgtattctagccagaataattttagatctgatcaggtagtag 
ctaaaattagaaaaaaacaaaatagatgcttaaagaatttgcatccatttttgagtctaa 
atcttttaaaatatactgagatccacatctagtgaaatgtcagtgtcaaaatattataga 
ttatagctaaaatccagattaatactcatttggggttttttatagtggaacttcatagta 
atacaaaaagcagattgtcttcctgtctccgctgctcccacagtaggtattgaaactggt 
aaaatcagttttttgatagtgtgtgtatataagaaaaaatagatacacacattctttttt 
ctcagtcaacacattgattgaacactctggcaaagatgctgtggtggatgaggttggagt 
tcgaaagaagaagcaagcgctggcctggccttgaaagaaccgaagtctttcccattcact 
tctctagaaagctgccaagacagaggcagaaagaaatggatgatagttctgtcaagcaca 
cttctgttctcttagaacttagaagtgtttctaagagaacagaagtaataagagaaacag 
ttacgtgtggaattcaacatctttggttggaacgcattggctttttttttcttgttttga 
tagaaatggaattaagcaaaagtagtttttgtcttttctgttgtcttcaaattttatgcc 
ttttatttttaatttaatcccgttcaattatttaattgttatacattgacattaactgct 
gtattttgactttgttcaataattttgttctttcagggctagaaataaactttttaaaaa 
aagtgtgcatttttccctttcctaaacttttattctttcttttgatcagcgtaaaagaat 
attttaatgtcttttgatagcataaaagaatatttaaatgtcttaataggttttcaaaga 
acatttagtatttttagtgataaatgttttaaacctttta 

SEQ ID NO: 398. >Z26993_T5 # TY Transcript # LN 4093 # Source Gene: Z26993 
# Encoded protein: Z26993_P1 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
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agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaagtatgacttttatatgaaccccttctt 
tagggtccagaaggaattgtggactgaatcactagccttaggtctttcagcaaacagcct 
atcagggccatttatcatgtgtgagatttgcattttactttgctgactttgttgtaatag 
atcccattcattgtcccctttggggtatttccaatacttgaatggcagattggagttttt 
cagagtatgtgtttcatctgctagtctttctctccttcatagcttttcttttcctggact 
tgctccttttgagttgcttttgcgtttctcatgcctaggcaagtgtaatagaaattatgt 
agctccttatgttggcaaaggagctctatatagtttcactttgtataaaagttaggacca 
gctgttgttacatgtaatattttagttcagaacttgacctgaaggaagggaagaaaagta 
tgtgatttttaccttttttaacaaatgtgaaaaagtcagttttagaaatttcgtggtagt 
aagttcggcatttgttacatgtatagagagaagactaataatctctatttataactaaat 
cattgagatagaaaaagattcccattgactgtagacttcttcccattttgtcttcccttc 
tgcctgtttccccttcaggcttggctctaggaaccaaagtgatttgttgttgttccaacc 
tgggctttgtgactttggttagtgccactaccttcttccctcctttcccccttcaatttg 
gaaataaatttctgtatatgttgcaattttaggtttaggtttgttctttttctttttcat 
taatcctctctcacctcacagataccccctcccatggcaaataatataataaccagtgaa 
ttttcaggaatttaaaaattagcttttttccacttaaaggagaaaaatatttgggactag 
cagcagaggcagtaagagatgtgaaccttggtgagctctgatacagtgagaagagattat 
actcatgaaagagaatgttagtgttacagagaagcagccgatagcaaatcgactgtagag 
acttggcggcggtggcattgccccaggtcgtcagcagtgtggtattatctatgagaactt 
gagcgacagagtatttcttgatgaatttatagatcatttgagatgttgagttactttagt 
ttagttttgttttgttttttcaaataagtagagactattgtaaaaaacgagaaaggaaaa 
tgaaatgtgcgtgttgatagcaataatttgtttcttttaaagattctaaaaggtctgaga 
cctgtagcattaattatttgagtgccctcccttctcccctcccctcccttttctcttctc 
ttttttcctctcctctccttctcctttattcattgttttgcttttggagtaggtgttgtt 
caagtatctgtggtttggttctggcattttgttcccaccatccccttcccccattaactt 
cccccctgcttgccatcctgcagtagtataaatcatgaataaaaaataattttgctgttg 
tagtatacattggagaaactggcaggttttatttccattattttatttccactatatcta 
tgataagatgcaattataaggagagaagtgactgttttttattgataaggcaagattttc 
agaaaaatgagtaaaataattaatgaaacatatttagagcacttaatggtctctgttttc 
aatataattcttgatttcatttttctctggaatatattggccttctacagctattactga 
attatagaaactggtttatttctggcagaaagctgcagtgccacctgagttccaaatttt 
accattctttgtaaacagttggatggattatgataaagaagatgctaccaatgaaataga 
aaaccaacgagatgagaagactgtgatcctcatgtactcagaggcacttccctcctaagt 
caaagaccatcctcactgactatgtgccaacgcctcgtttcaggcttgtgactcaacaaa 
gggcttttccattgatagaagcagtttgggatttgtagttgcgacttcttcgatagttac 
ctgcacgtccattgctggcaactgacttgtcattaaaacctggctctttggttaagggag 
ctacgctgtggtttattcttaagttacgtggataaactaacctctaacagaaatatactt 
tggttaattttga 



256/365 



Figur 1 (Cont'd) 



SEQ ID NO: 399. >Z26993_T6 # TY Transcript # LN 4539 # Source Gene: Z26993 
# Encoded protein: Z26993_P1 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaagtatgacttttatatgaaccccttctt 
tagggtccagaaggaattgtggactgaatcactagccttaggtctttcagcaaacagcct 
atcagggccatttatcatgtgtgagatttgcattttactttgctgactttgttgtaatag 
atcccattcattgtcccctttggggtatttccaatacttgaatggcagattggagttttt 
cagagtatgtgtttcatctgctagtctttctctccttcatagcttttcttttcctggact 
tgctccttttgagttgcttttgcgtttctcatgcctaggcaagtgtaatagaaattatgt 
agctccttatgttggcaaaggagctctatatagtttcactttgtataaaagttaggacca 
gctgttgttacatgtaatattttagttcagaacttgacctgaaggaagggaagaaaagta 
tgtgatttttaccttttttaacaaatgtgaaaaagtcagttttagaaatttcgtggtagt 
aagttcggcatttgttacatgtatagagagaagactaataatctctatttataactaaat 
cattgagatagaaaaagattcccattgactgtagacttcttcccattttgtcttcccttc 
tgcctgtttccccttcaggcttggctctaggaaccaaagtgatttgttgttgttccaacc 
tgggctttgtgactttggttagtgccactaccttcttccctcctttcccccttcaatttg 
gaaataaatttctgtatatgttgcaattttaggtttaggtttgttctttttctttttcat 
taatcctctctcacctcacagataccccctcccatggcaaataatataataaccagtgaa 
ttttcaggaatttaaaaattagcttttttccacttaaaggagaaaaatatttgggactag 
cagcagaggcagtaagagatgtgaaccttggtgagctctgatacagtgagaagagattat 
actcatgaaagagaatgttagtgttacagagaagcagccgatagcaaatcgactgtagag 
acttggcggcggtggcattgccccaggtcgtcagcagtgtggtattatctatgagaactt 
gagcgacagagtatttcttgatgaatttatagatcatttgagatgttgagttactttagt 
ttagttttgttttgttttttcaaataagtagagactattgtaaaaaacgagaaaggaaaa 
tgaaatgtgcgtgttgatagcaataatttgtttcttttaaagattctaaaaggtctgaga 
cctgtagcattaattatttgagtgccctcccttctcccctcccctcccttttctcttctc 
ttttttcctctcctctccttctcctttattcattgttttgcttttggagtaggtgttgtt 
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caagtatctgtggtttggttctggcattttgttcccaccatccccttcccccattaactt 
cccccctgcttgccatcctgcagtagtataaatcatgaataaaaaataattttgctgttg 
tagtatacattggagaaactggcaggttttatttccattattttatttccactatatcta 
tgataagatgcaattataaggagagaagtgactgttttttattgataaggcaagattttc 
agaaaaatgagtaaaataattaatgaaacatatttagagcacttaatggtctctgttttc 
aatataattcttgatttcatttttctctggaatatattggccttctacagctattactga 
attatagaaactggtttatttctggcagaaagctgcagtgccacctgagttccaaatttt 
accattctttgtaaacagttggatggattatgataaagaagatgctaccaatgaaataga 
aaaccaacgagatgagaagactgtgatcctcatgtactcagaggcacttccctcctaagt 
caaagaccatcctcactgactatgtgccaacgcctcgtttcaggcttgtgactcaacaaa 
gggcttttccattgatagaagcagtttgggatttgtagttgcgacttcttcgatagttac 
ctgcacgtccattgctggcaactgacttgtcattaaaacctggctctttggttaagggag 
ctacgctgtggtttattcttaagttacgtggataaactaacctctaacagaaatatactt 
tggttaattttgaaatgtgtcatttttaaacaatcttaaaagtaatacagaattgtgatt 
tattaattttaaaacattcagaacttgttgaaagaaaaattatatctgaatcaagattca 
tgttttttatttttattttttttgatacagagtctcactctgtcactcaggctgagattg 
aaccgtggcactccagcctggtgtgacagagcgagactttatctcaataaaaaacaaaaa 
gaccaaaaacagtaaatcaccgtgataggcaaaaggcattcctttattatttttggcatc 
ttttaaactattgtggactctgggtccaatttgaaagtgattaagctaaacatatttgac 
ctttgaactagtcccatttaagaccattacggaactggttcctctttgatctccccatgt 
ctgaatggctaataatacacattgagagggcaccaacgg 

SEQ ID NO: 400 . >Z26993_T7 ft TY Transcript # LN 4014 # Source Gene: Z26993 
# Encoded protein: Z26993_P1 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaagtatgacttttatatgaaccccttctt 
tagggtccagaaggaattgtggactgaatcactagccttaggtctttcagcaaacagcct 
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atcagggccatttatcatgtgtgagatttgcattttactttgctgactttgttgtaatag 
atcccattcattgtcccctttggggtatttccaatacttgaatggcagattggagttttt 
cagagtatgtgtttcatctgctagtctttctctccttcatagcttttcttttcctggact 
tgctccttttgagttgcttttgcgtttctcatgcctaggcaagtgtaatagaaattatgt 
agctccttatgttggcaaaggagctctatatagtttcactttgtataaaagttaggacca 
gctgttgttacatgtaatattttagttcagaacttgacctgaaggaagggaagaaaagta 
tgtgatttttaccttttttaacaaatgtgaaaaagtcagttttagaaatttcgtggtagt 
aagttcggcatttgttacatgtatagagagaagactaataatctctatttataactaaat 
cattgagatagaaaaagattcccattgactgtagacttcttcccattttgtcttcccttc 
tgcctgtttccccttcaggcttggctctaggaaccaaagtgatttgttgttgttccaacc 
tgggctttgtgactttggttagtgccactaccttcttccctcctttcccccttcaatttg 
gaaataaatttctgtatatgttgcaattttaggtttaggtttgttctttttctttttcat 
taatcctctctcacctcacagataccccctcccatggcaaataatataataaccagtgaa 
ttttcaggaatttaaaaattagcttttttccacttaaaggagaaaaatatttgggactag 
cagcagaggcagtaagagatgtgaaccttggtgagctctgatacagtgagaagagattat 
actcatgaaagagaatgttagtgttacagagaagcagccgatagcaaatcgactgtagag 
acttggcggcggtggcattgccccaggtcgtcagcagtgtggtattatctatgagaactt 
gagcgacagagtatttcttgatgaatttatagatcatttgagatgttgagttactttagt 
ttagttttgttttgttttttcaaataagtagagactattgtaaaaaacgagaaaggaaaa 
tgaaatgtgcgtgttgatagcaataatttgtttcttttaaagattctaaaaggtctgaga 
cctgtagcattaattatttgagtgccctcccttctcccctcccctcccttttctcttctc 
ttttttcctctcctctccttctcctttattcattgttttgcttttggagtaggtgttgtt 
caagtatctgtggtttggttctggcattttgttcccaccatccccttcccccattaactt 
cccccctgcttgccatcctgcagtagtataaatcatgaataaaaaataattttgctgttg 
tagtatacattggagaaactggcaggttttatttccattattttatttccactatatcta 
tgataagatgcaattataaggagagaagtgactgttttttattgataaggcaagattttc 
agaaaaatgagtaaaataattaatgaaacatatttagagcacttaatggtctctgttttc 
aatataattcttgatttcatttttctctggaatatattggccttctacagctattactga 
attatagaaactggtttatttctggcagaaagctgcagtgccacctgagttccaaatttt 
accattctttgtaaacagttggatggattatgataaagaagatgctaccaatgaaataga 
aaaccaacgagatgagaagactgtgatcctcatgtactcagaggcacttccctcctaagt 
caaagaccatcctcactgactatgtgccaacgcctcgtttcaggcttgtgactcaacaaa 
gggcttttccattgatagaagcagtttgggatttgtagttgcgacttcttcgatagttac 
ctgcacgtccattgctggcaactgacttgtcattaaaacctggctctttggtta 

SEQ ID NO: 401 . >Z26993_T8 # TY Transcript # LN 3572 # Source Gene: Z26993 
# Encoded protein: 226993_P1 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
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agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaagtatgacttttatatgaaccccttctt 
tagggtccagaaggaattgtggactgaatcactagccttaggtctttcagcaaacagcct 
atcagggccatttatcatgtgtgagatttgcattttactttgctgactttgttgtaatag 
atcccattcattgtcccctttggggtatttccaatacttgaatggcagattggagttttt 
cagagtatgtgtttcatctgctagtctttctctccttcatagcttttcttttcctggact 
tgctccttttgagttgcttttgcgtttctcatgcctaggcaagtgtaatagaaattatgt 
agctccttatgttggcaaaggagctctatatagtttcactttgtataaaagttaggacca 
gctgttgttacatgtaatattttagttcagaacttgacctgaaggaagggaagaaaagta 
tgtgatttttaccttttttaacaaatgtgaaaaagtcagttttagaaatttcgtggtagt 
aagttcggcatttgttacatgtatagagagaagactaataatctctatttataactaaat 
cattgagatagaaaaagattcccattgactgtagacttcttcccattttgtcttcccttc 
tgcctgtttccccttcaggcttggctctaggaaccaaagtgatttgttgttgttccaacc 
tgggctttgtgactttggttagtgccactaccttcttccctcctttcccccttcaatttg 
gaaataaatttctgtatatgttgcaattttaggtttaggtttgttctttttctttttcat 
taatcctctctcacctcacagataccccctcccatggcaaataatataataaccagtgaa 
ttttcaggaatttaaaaattagcttttttccacttaaaggagaaaaatatttgggactag 
cagcagaggcagtaagagatgtgaaccttggtgagctctgatacagtgagaagagattat 
actcatgaaagagaatgttagtgttacagagaagcagccgatagcaaatcgactgtagag 
acttggcggcggtggcattgccccaggtcgtcagcagtgtggtattatctatgagaactt 
gagcgacagagtatttcttgatgaatttatagatcatttgagatgttgagttactttagt 
ttagttttgttttgttttttcaaataagtagagactattgtaaaaaacgagaaaggaaaa 
tgaaatgtgcgtgttgatagcaataatttgtttcttttaaagattctaaaaggtctgaga 
cctgtagcattaattatttgagtgccctcccttctcccctcccctcccttttctcttctc 
ttttttcctctcctctccttctcctttattcattgttttgcttttggagtaggtgttgtt 
caagtatctgtggtttggttctggcattttgttcccaccatccccttcccccattaactt 
cccccctgcttgccatcctgcagtagtataaatcatgaataaaaaataattttgctgttg 
tagtatacattggagaaactggcaggttttatttccattattttatttccactatatcta 
tgataagatgcaattataaggagagaagtgactgttttttattgataaggcaagattttc 
agaaaaatgagtaaaataattaatgaaacata 

SEQ ID NO:402. >Z26993_T9 # TY Transcript # LN 3425 # Source Gene: Z26993 
# Encoded protein: Z26993_P1 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
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agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaagtatgacttttatatgaaccccttctt 
tagggtccagaaggaattgtggactgaatcactagccttaggtctttcagcaaacagcct 
atcagggccatttatcatgtgtgagatttgcattttactttgctgactttgttgtaatag 
atcccattcattgtcccctttggggtatttccaatacttgaatggcagattggagttttt 
cagagtatgtgtttcatctgctagtctttctctccttcatagcttttcttttcctggact 
tgctccttttgagttgcttttgcgtttctcatgcctaggcaagtgtaatagaaattatgt 
agctccttatgttggcaaaggagctctatatagtttcactttgtataaaagttaggacca 
gctgttgttacatgtaatattttagttcagaacttgacctgaaggaagggaagaaaagta 
tgtgatttttaccttttttaacaaatgtgaaaaagtcagttttagaaatttcgtggtagt 
aagttcggcatttgttacatgtatagagagaagactaataatctctatttataactaaat 
cattgagatagaaaaagattcccattgactgtagacttcttcccattttgtcttcccttc 
tgcctgtttccccttcaggcttggctctaggaaccaaagtgatttgttgttgttccaacc 
tgggctttgtgactttggttagtgccactaccttcttccctcctttcccccttcaatttg 
gaaataaatttctgtatatgttgcaattttaggtttaggtttgttctttttctttttcat 
taatcctctctcacctcacagataccccctcccatggcaaataatataataaccagtgaa 
ttttcaggaatttaaaaattagcttttttccacttaaaggagaaaaatatttgggactag 
cagcagaggcagtaagagatgtgaaccttggtgagctctgatacagtgagaagagattat 
actcatgaaagagaatgttagtgttacagagaagcagccgatagcaaatcgactgtagag 
acttggcggcggtggcattgccccaggtcgtcagcagtgtggtattatctatgagaactt 
gagcgacagagtatttcttgatgaatttatagatcatttgagatgttgagttactttagt 
ttagttttgttttgttttttcaaataagtagagactattgtaaaaaacgagaaaggaaaa 
tgaaatgtgcgtgttgatagcaataatttgtttcttttaaagattctaaaaggtctgaga 
cctgtagcattaattatttgagtgccctcccttctcccctcccctcccttttctcttctc 
ttttttcctctcctctccttctcctttattcattgttttgcttttggagtaggtgttgtt 
caagtatctgtggtttggttctggcattttgttcccaccatccccttcccccattaactt 
cccccctgcttgccatcctgcagtagtataaatcatgaataaaaaataattttgctgttg 
tagta 

SEQ ID NO: 403. >Z26993_T10 # TY Transcript # LN 3340 # Source Gene: Z26993 
# Encoded protein: Z26993_P1 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
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cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaagtatgacttttatatgaaccccttctt 
tagggtccagaaggaattgtggactgaatcactagccttaggtctttcagcaaacagcct 
atcagggccatttatcatgtgtgagatttgcattttactttgctgactttgttgtaatag 
atcccattcattgtcccctttggggtatttccaatacttgaatggcagattggagttttt 
cagagtatgtgtttcatctgctagtctttctctccttcatagcttttcttttcctggact 
tgctccttttgagttgcttttgcgtttctcatgcctaggcaagtgtaatagaaattatgt 
agctccttatgttggcaaaggagctctatatagtttcactttgtataaaagttaggacca 
gctgttgttacatgtaatattttagttcagaacttgacctgaaggaagggaagaaaagta 
tgtgatttttaccttttttaacaaatgtgaaaaagtcagttttagaaatttcgtggtagt 
aagttcggcatttgttacatgtatagagagaagactaataatctctatttataactaaat 
cattgagatagaaaaagattcccattgactgtagacttcttcccattttgtcttcccttc 
tgcctgtttccccttcaggcttggctctaggaaccaaagtgatttgttgttgttccaacc 
tgggctttgtgactttggttagtgccactaccttcttccctcctttcccccttcaatttg 
gaaataaatttctgtatatgttgcaattttaggtttaggtttgttctttttctttttcat 
taatcctctctcacctcacagataccccctcccatggcaaataatataataaccagtgaa 
ttttcaggaatttaaaaattagcttttttccacttaaaggagaaaaatatttgggactag 
cagcagaggcagtaagagatgtgaaccttggtgagctctgatacagtgagaagagattat 
actcatgaaagagaatgttagtgttacagagaagcagccgatagcaaatcgactgtagag 
acttggcggcggtggcattgccccaggtcgtcagcagtgtggtattatctatgagaactt 
gagcgacagagtatttcttgatgaatttatagatcatttgagatgttgagttactttagt 
ttagttttgttttgttttttcaaataagtagagactattgtaaaaaacgagaaaggaaaa 
tgaaatgtgcgtgttgatagcaataatttgtttcttttaaagattctaaaaggtctgaga 
cctgtcgctcaagttctcatagataataccacactgctgacgacctggggcaatgccacc 
gccgccaagtctctacagtcgatttgctatcggctgcttctctgtaacactaacattctc 
tttcttgagtataatctcttctcactgtatcaggtcgagc 

SEQ ID NO: 404. >Z26993_T11 # TY Transcript # LN 3607 # Source Gene: Z26993 
# Encoded protein: Z26993_P1 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
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Figur 1 (Cont'd) 



tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaagtatgacttttatatgaaccccttctt 
tagggtccagaaggaattgtggactgaatcactagccttaggtctttcagcaaacagcct 
atcagggccatttatcatgtgtgagatttgcattttactttgctgactttgttgtaatag 
atcccattcattgtcccctttggggtatttccaatacttgaatggcagattggagttttt 
cagagtatgtgtttcatctgctagtctttctctccttcatagcttttcttttcctggact 
tgctccttttgagttgcttttgcgtttctcatgcctaggcaagtgtaatagaaattatgt 
agctccttatgttggcaaaggagctctatatagtttcactttgtataaaagttaggacca 
gctgttgttacatgtaatattttagttcagaacttgacctgaaggaagggaagaaaagta 
tgtgatttttaccttttttaacaaatgtgaaaaagtcagttttagaaatttcgtggtagt 
aagttcggcatttgttacatgtatagagagaagactaataatctctatttataactaaat 
cattgagatagaaaaagattcccattgactgtagacttcttcccattttgtcttcccttc 
tgcctgtttccccttcaggcttggctctaggaaccaaagtgatttgttgttgttccaacc 
tgggctttgtgactttggttagtgccactaccttcttccctcctttcccccttcaatttg 
gaaataaatttctgtatatgttgcaattttaggtttaggtttgttctttttctttttcat 
taatcctctctcacctcacagataccccctcccatggcaaataatataataaccagtgaa 
ttttcaggaatttaaaaattagcttttttccacttaaaggagaaaaatatttgggactag 
cagcagaggcagtaagagatgtgaaccttggtgagctctgatacagtgagaagagattat 
actcatgaaagagaatgttagtgttacagagaagcagccgatagcaaatcgactgtagag 
acttggcggcggtggcattgccccaggtcgtcagcatagtataaatcatgaataaaaaat 
aattttgctgttgtagtatacattggagaaactggcaggttttatttccattattttatt 
tccactatatctatgataagatgcaattataaggagagaagtgactgttttttattgata 
aggcaagattttcagaaaaatgagtaaaataattaatgaaacatatttagagcacttaat 
ggtctctgttttcaatataattcttgatttcatttttctctggaatatattggccttcta 
cagctattactgaattatagaaactggtttatttctggcagaaagctgcagtgccacctg 
agttccaaattttaccattctttgtaaacagttggatggattatgataaagaagatgcta 
ccaatgaaatagaaaaccaacgagatgagaagactgtgatcctcatgtactcagaggcac 
ttccctcctaagtcaaagaccatcctcactgactatgtgccaacgcctcgtttcaggctt 
gtgactcaacaaagggcttttccattgatagaagcagtttgggatttgtagttgcgactt 
cttcgatagttacctgcacgtccattgctggcaactgacttgtcattaaaacctggctct 
ttggtta 

SEQ ID NO: 405. >Z26993_T12 # TY Transcript # LN 3165 # Source Gene: Z26993 
# Encoded protein: Z26993_P1 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
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Figure 1 (Cont'd) 



ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaagtatgacttttatatgaaccccttctt 
tagggtccagaaggaattgtggactgaatcactagccttaggtctttcagcaaacagcct 
atcagggccatttatcatgtgtgagatttgcattttactttgctgactttgttgtaatag 
atcccattcattgtcccctttggggtatttccaatacttgaatggcagattggagttttt 
cagagtatgtgtttcatctgctagtctttctctccttcatagcttttcttttcctggact 
tgctccttttgagttgcttttgcgtttctcatgcctaggcaagtgtaatagaaattatgt 
agctccttatgttggcaaaggagctctatatagtttcactttgtataaaagttaggacca 
gctgttgttacatgtaatattttagttcagaacttgacctgaaggaagggaagaaaagta 
tgtgatttttaccttttttaacaaatgtgaaaaagtcagttttagaaatttcgtggtagt 
aagttcggcatttgttacatgtatagagagaagactaataatctctatttataactaaat 
cattgagatagaaaaagattcccattgactgtagacttcttcccattttgtcttcccttc 
tgcctgtttccccttcaggcttggctctaggaaccaaagtgatttgttgttgttccaacc 
tgggctttgtgactttggttagtgccactaccttcttccctcctttcccccttcaatttg 
gaaataaatttctgtatatgttgcaattttaggtttaggtttgttctttttctttttcat 
taatcctctctcacctcacagataccccctcccatggcaaataatataataaccagtgaa 
ttttcaggaatttaaaaattagcttttttccacttaaaggagaaaaatatttgggactag 
cagcagaggcagtaagagatgtgaaccttggtgagctctgatacagtgagaagagattat 
actcatgaaagagaatgttagtgttacagagaagcagccgatagcaaatcgactgtagag 
acttggcggcggtggcattgccccaggtcgtcagcatagtataaatcatgaataaaaaat 
aattttgctgttgtagtatacattggagaaactggcaggttttatttccattattttatt 
tccactatatctatgataagatgcaattataaggagagaagtgactgttttttattgata 
aggcaagattttcagaaaaatgagtaaaataattaatgaaacata 

SEQ ID NO: 406. >Z26993_T13 § TY Transcript # LN 2665 # Source Gene: Z26993 
# Encoded protein: Z26993_P1 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
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Figure 1 (Cont'd) 



tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaagtatgacttttatatgaaccccttctt 
tagggtccagaaggaattgtggactgaatcactagccttaggtctttcagcaaacagcct 
atcagggccatttatcatgtgtgagatttgcattttactttgctgactttgttgtaatag 
atcccattcattgtcccctttggggtatttccaatacttgaatggcagattggagttttt 
cagagtatgtgtttcatctgctagtctttctctccttcatagcttttcttttcctggact 
tgctccttttgagttgcttttgcgtttctcatgcctaggcaagtgtaatagaaattatgt 
agctccttatgttggcaaaggagctctatatagtttcactttgtataaaagttaggacca 
gctgttgttacatgtaatattttagttcagaacttgacctgaaggaagggaagaaaagta 
tgtgatttttaccttttttaacaaatgtgaaaaagtcagttttagaaatttcgtggtagt 
aagttcggcatttgttacatgtatagagagaagactaataatctctatttataactaaat 
cattgagatagaaaaagattcccattgactgtagacttcttcccattttgtcttcccttc 
tgcctgtttccccttcaggcttggctctaggaaccaaagtgatttgttgttgttccaacc 
tgggctttgtgactttggttagtgccactaccttcttccctcctttcccccttcaatttg 
gaaataaatttctgtatatgttgca 

SEQ ID NO: 407. >Z26993_T14 # TY Transcript » LN 2123 # Source Gene: Z26993 
# Encoded protein: Z26993_P6 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
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Figur 1 (Cont'd) 



aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaagtatgacttttatatgaaccccttctt 
tagggtccagaaggaattgtggactgaatcactagccttaggtctttcagcaaacagcct 
atcagggccatttatcatgtgtgagatttgcattttactttgctgactttgttgtaatag 
atcccatgaatggcccctgggggttttcaatcttggattggaggtgggggtttcagagtg 
ttgccacgtctagctcctctccc 

SEQ ID NO: 408. >Z26993_T15 # TY Transcript # LN 2397 # Source Gene: Z26993 
# Encoded protein: Z26993_P1 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
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tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaagtatgacttttatatgaaccccttctt 
tagggtccagaaggaattgtggactgaatcactagccttaggtctttcagcaaacagcct 
atcagggccatttatcatgtgtgagatttgcattttactttgctgactttgttgtaatag 
atcccattcattgtcccctttggggtatttccaatacttgaatggcagattggagttttt 
cagagtatgtgtttcatctgctagtctttctctccttcatagcttttcttttcctggact 
tgctccttttgagttgcttttgcgtttctcatgcctaggcaagtgtaatagaaattatgt 
agctccttatgttggcaaaggagctctatatagtttcactttgtataaaagttaggacca 
gctgttgtaactggagctgttgtaactggagctgttgtagctggagctgttgtamctgga 
gctgttgtaactggagctgttgtaactgragctgttgtaactggagctgttgtaact 

SEQ ID NO: 409 . >Z26993_T16 # TY Transcript # LN 2372 # Source Gene: Z26993 

# Encoded protein: Z26993_P1 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaagtatgacttttatatgaaccccttctt 
tagggtccagaaggaattgtggactgaatcactagccttaggtctttcagcaaacagcct 
atcagggccatttatcatgtgtgagatttgcattttactttgctgactttgttgtaatag 
atcccattcattgtcccctttggggtatttccaatacttgaatggcagattggagttttt 
cagagtatgtgtttcatctgctagtctttctctccttcatagcttttcttttcctggact 
tgctccttttgagttgcttttgcgtttctcatgcctaggcaagtgtaatagaaattatgt 
agctccttatgttggcaaaggagctctatatagtttcactttgtataaaaagtttaggga 
cccacgctggttgggtaccatgggaaataatttttaaggttcaagaaaacttggaccctt 
taagggaagagggggaaaaaaaaaagttnttg 

SEQ ID NO: 410. >Z26993JT17 # TY Transcript # LN 2248 # Source Gene: Z26993 

# Encoded protein: Z26993_P1 
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ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcat.tggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcactgactttggtttctgtg 
cccagatcacccctgagcagagcaaacgcagtaccatggtcggaacgccatactggatgg 
caccagaggtggttacacggaaagcttatggccctaaagtcgacatatggtctctgggta 
tcatggctattgagatggtagaaggagagcctccatacctcaatgaaaatcccttgaggg 
ccttgtacctaatagcaactaatggaaccccagaacttcagaatccagagaaactttccc 
caatatttcgggatttcttaaatcgatgtttggaaatggatgtggaaaaaaggggttcag 
ccaaagaattattacagcatcctttcctgaaactggccaaaccgttatctagcttgacac 
cactgatcatggcagctaaagaagcaatgaagagtaaccgttaacatcactgctgtggcc 
tcatactcttttttccattttctacaagaagccttttagtatatgaaaattattactctt 
tttggggtttaaagaaatggtctgcataacctgaatgaaagaagcaaatgactattctct 
gaagacaaccaagagaaaattgcaaaaagacaattggatggattatgataaagaagatgc 
taccaatgaaatagaaaaccaacgagatgagaagactgtgatcctcatgtactcagaggc 
acttccctcctaagtcaaagaccatcctcactgactatgtgccaacgcctcgtttcaggc 
ttgtgactcaacaaagggcttttccattgatagaagcagtttgggatttgtagttgcgac 
ttcttcgatagttacctgcacgtccattgctggcaactgacttgtcattaaaacctggct 
ctttggttaagggagctacgctgtggtttattcttaagttacgtggataaactaacctct 
aacagaaatatactttggttaattttga 

SEQ ID N0:411. >Z26993_T18 # TY Transcript # LN 1791 # Source Gene: Z26993 
tf Encoded protein: Z26993_P3 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
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tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acagttacctggtaggagatgaattgtttgtggtcatggaataccttgctggggggtcac 
tcactgatgtggtaacagaaacgtgcatggatgaagcacagattgctgctgtatgcagag 
agtgtttacaggcattggagtttttacatgctaatcaagtgatccacagagacatcaaaa 
gtgacaatgtacttttgggaatggaaggatctgttaagctcagtgagtaacaaatggaca 
attcacaatcatttattataattttctgccttttgtttaattaaaaacttttcttgacca 
atgcaaggattaataatgatttaatataggaagaggtgagacctttgaaaactgagtgat 
cctctgagggttttaactaatcagtgccagagaaaggactcgctggataagacattacct 
aataattaccactagataagacattattaggtaatgtcttgtcttgtctttttgagatgg 
aggatcacttgaacccaggaggcggaggttgcagtgagccgagatcatgccatcgcactc 
cagcctggtcgacaaagcgaaactccgtctccataaataaataaataaatgttggtgaga 
agataaggaaactagaaatctcgttcattgcaaaatataaaatggtacatctgctttgga 
aaacaacttgacaattctgtgtgaagttaatacggaattaccagatgacct 

SEQ ID N0:412. >Z26993_T19 # TY Transcript # LN 1698 # Source Gene: Z26993 

# Encoded protein: Z26993_P4 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgagaaagatggctttc 
cttctggaacaccagcactgaatgccaagggaacagaagcacccgcagtagtgacagagg 
aggaggatgatgatgaagagactgctcctcccgttattgccccgcgaccggatcatacga 
aatcaatttacacacggtctgtaattgaccctgttcctgcaccagttggtgattcacatg 
ttgatggtgctgccaagtctttagacaaacagaaaaagaagactaagatgacagatgaag 
agattatggagaaattaagaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaaggggcttctggtacagttttcactgctactgacgttgcac 
tgggacaggaggttgctatcaaacaaattaatttacagaaacagccaaagaaggaactga 
tcattaacgagattctggtgatgaaagaattgaaaaatcccaacatcgttaactttttgg 
acaggtaagtatgactattccttaaacaccgggagaaaatgtagaaattttaagtctcat 
ggtttcgggggtggggctggccactttggaataaactgtcagtgccgagaaataagatgg 
aagaaagaagaggagaggtaaaaatagcacagtgcagttttagaaggtccttctctgcaa 
ggacatcacttggcaggtgagagaaggcagttaattgaaggccatagctggtggtgttta 
tactcatcgcaaacgacctgaagcgaacacatgaatggttggctgtgattcgtgtcttga 
ctgacagaggggactggttacatggtttttaaacttgtggatcaatttttatttcaaatg 
ctatccttcatggacaaatttatatgtctatcaaaacctcatttagttgaggaattaatg 
ttaaataacagaccaaaaagcagaatagaaaatgatgcatattggctgggtgcagtggct 
cacacctgtaatcccagctactcaggaggctgaggcaggagaattgcttgaacccaggag 
gtggaggttgcagtgagccgagatcgcaccactgcactccagcctgggcaatggaatgag 
actccatctcaaaaaaaa 

SEQ ID NO: 413. >Z26993_T20 # TY Transcript > LN 977 # Source Gene: Z26993 

# Encoded protein: Z26993_P5 

ttcccgccccgcttccccttccctcccctcccctccccgcaccgcgcgctagcccggggc 
ggctccgcagcccgccgggagctctgaccgaggcgcctcgctggggcggggaccttgcct 
tgcccggggccatttcataattctgaatcatgtctgataacggagaactggaagataagc 
ctccagcacctcctgtgcgaatgagcagcaccatctttagcactggaggcaaagaccctt 
tgtcagccaatcacagtttgaaacctttgccctctgttccagaagagaaaaagcccaggc 
ataaaatcatctccatattctcaggcacagagaaaggaagtaaaaagaaagaaaaggaac 
ggccagaaatttctcctccatctgattttgagcacaccatccatgttggctttgatgctg 
ttactggagaattcactggcatgccagaacagtgggctcgattactacagacctccaata 
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tcaccaaactagagcaaaagaagaatcctcaggctgtgctggatgtcctaaagttctacg 
actccaacacagtgaagcagaaatatctgagctttactcctcctgtttccttcaggaaca 
ccagcactgaatgccaagggaacagaagcacccgcagtagtgacagaggaggaggatgat 
gatgaagagactgctcctcccgttattgccccgcgaccggatcatacgaaatcaatttac 
acacggtctgtaattgaccctgttcctgcaccagttggtgattcacatgttgatggtgct 
gccaagtctttagacaaacagaaaaagaagactaagatgacagatgaagagattatggag 
aaattaagtatgttatctacattttacatcattgttaaattgttcacggctcttaaaaca 
tttggcccagatggattttaacttacactttacattgtgacttaaagtataaagaataaa 
ttgtttccttatagcaa 

SEQ ID NO:414. >Z26993_T21 # TY Transcript # LN 746 # Source Gene: Z26993 

# Encoded protein: 

gtgggcaaactgtaactttaaaatgattcaggatcataacttcagttatttttctgtgct 
gttacgtagcagctttccagttttcttcctaactctcaagctaccagctctagttttatt 
gttagggtgtttaatatttgaattcttcaacatagaataccatgtgtttaggaaaaataa 
aagtgtgactgtatttggtatgttgtatttttagttacctggtaggagatgaattgtttg 
tggtcatggaataccttgctggggggtcactcactgatgtggtaacagaaacgtgcatgg 
atgaagcacagattgctgctgtatgcagagaggtaggtttgcctccttcttgatatgtta 
tggtgatatgtggttagcaagaagtgaacaaaaggaaatttttctgcacaggtaattgtt 
attgtgggaggtgctttctctgtatacattctctgcaaattactccatctccatgccatt 
tcatcctacacaggagtttcagccctttccttcaggtataagataatttgggaatgaatt 
gtggacctttgaagatccagattcacaaggaaattctggtagagtgatgatgatcattta 
ccatccaaacaggacacttttgagagtaaaagggtatatatcattagaaatacaccactc 
aggctgggcgcggtggctcacgcctgtaatcccagcactttgggaggccgaggcgggtgg 
atcacaaggtcaggagatcgagacca 

SEQ ID NO: 415. >Z26993_T22 # TY Transcript # LN 778 # Source Gene: Z26993 

# Encoded protein: 

ttcttgagattacctgattctaaagctgaacatctgagaacctaattaatctctgacaaa 
ctggtacagtcttttcaaaatgtgaccattattgagcaaatgattagttaggaggtctaa 
cctaatggaaataatgtgtcttcttgtttctgttacttttaaacttttaatgtgtcttgt 
ttctgatacttttaaacttttattatgtatttaagtttctttagtgttttgtgtttgaaa 
cctgttatttttggaacttagtattatgtttgaaaatcacttataaaaacagtggagttc 
attaaaaggaataataatatcacaatcaatttacacttaaaatgcttgactgagcttcca 
gctttttgtttattctaggaactatcgtgagcataggtgaccctaagaaaaaatatacaa 
gatatgaaaaaattggacaagggtaagtatttgtgactgtattacgataatattcagtat 
tcagtatttcctggctttatagcatgatgtttattttgcctgccgacatcaagtacttat 
atagtattctgtgagtggtacccttgggggatttgtgtaaataaataatgcagttatttg 
tagatgctgttaaaaaactgtactagttgttagcaatgactgtgtatcattattttttct 
taagtgtcatttcccatctgtgataattagagggaagaattgcccataaaatatatttaa 
aagacaataagaatatttactgcatgactataagcaataaaaaatgcatttaaatcat 

SEQ ID NO: 416. >Z26993_P1 « TY Protein # CC #LN 524 # Source Gene: Z26993 

# Encoding Transcript: 1 

MSDNGELEDKPPAPPVRMSSTIFSTGGKDPLSANHSLKPLPSVPEEKKPRHKIISIFSGT 
EKGSKKKEKERPEISPPSDFEHTIHVGFDAVTGEFTGMPEQWARLLQTSNITKLEQKKNP 
QAVLDVLKFYDSNTVKQKYLSFTPPEKDGFPSGTPALNAKGTEAPAVVTEEEDDDEETAP 
PVIAPRPDHTKSIYTRSVIDPVPAPVGDSHVDGAAKSLDKQKKKTKMTDEEIMEKLRTIV 
SIGDPKKKYTRYEKIGQGASGTVFTATDVALGQEVAIKQINLQKQPKKELIINEILVMKE 
LKNPNIVNFLDSYLVGDELFWMEYLAGGSLTDWTETCMDEAQIAAVCRECLQALEFLH 
ANQVIH RDIKS DNVLLGMEGS VKLT DFGFCAQI T PEQSKRSTMVGT P YWMAPEWTRKAY 
GPKVDIWSLGIMAIEMVEGEPPYLNENPLRALYLIATNGTPELQNPEKLSPIFRDFLNRC 
LEMDVEKRGSAKELLQHPFLKIiAKPLSSLTPLIMAAKEAMKSNR 

SEQ ID NO: 417. >Z26993_P2 # TY Protein # CC #LN 424 # Source Gene: Z26993 

# Encoding Transcript: 3 

QGMSVNSPGNSIEANMDGVLKTDGGEISGRSFSFFLLHFSVPENMEMILCLGFFSSGTEG 
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KGFKLCWLTEEEDDDEETAPPVIAPRPDHTKSIYTRSVIDPVPAPVGDSHVDGAAKSLDK 
QKKKTKMTDEEIMEKLRTIVSIGDPKKKYTRYEKIGQGASGTVFTATDVALGQEVAIKQI 
NLQKQPKKELIINEILVMKELKNPNIVNFLDSYLVGDELFWMEYLAGGSLTDWTETCM 
DEAQ I AAVC REC LQAL E FLH ANQV I H RD I KS DN VLLGMEG S VKLT D FG FCAQ I T PEQ S KR 
STMVGTPYWMAPEWTRKAYGPKVDIWSLGIMAIEMVEGEPPYLNENPLRALYLIATNGT 
PELQN PEKLSP I FRDFLNRCLEMDVEKRGSAKELLQHPFLKLAKPLS SLTPLI MAAKEAM 
KSNR 

SEQ ID NO: 418. >Z26993_P6 # TY Protein # CC #LN 525 # Source Gene: Z26993 

# Encoding Transcript: 14 

MSDNGELEDKPPAPPVRMSSTIFSTGGKDPLSANHSLKPLPSVPEEKKPRHKIISIFSGT 
EKGSKKKEKERPEISPPSDFEHTIHVGFDAVTGEFTGMPEQWARLLQTSNITKLEQKKNP 
QAVLDVLKFYDSNTVKQKYLSFTPPEKDGLPSGTPALNAKGTEAPAVVTEEEDDDEETAP 
PVIAPRPDHTKSIYTRSVIDPVPAPVGDSHVDGAAKSLDKQKKKPKMTDEEIMEKLRTIV 
SIGDPKKKYTRYEKIGQGASGTVFTATDVALGQEVAIKQINLQKQPKKELIINEILVMKE 
LKNPNIVNFLDSYLVGDELFWMEYLAGGSLTDWTETACMDEAQIAAVCRECLQALEFL 
HANQVIHRDIKSDNVLLGMEGSVKLTDFGFCAQITPEQSKRSTMVGTPYWMAPEWTRKA 
YGPKVDIWSLGIMAIEMVEGEPPYLNENPLRALYLIATNGTPELQNPEKLSPIFRDFLNR 
CLEMDVEKRGSAKELLQHPFLKLAKPLSSLTPLIMAAKEAMKSNR 

SEQ ID NO: 419. >Z26993_P3 # TY Protein # CC #LN 386 8 Source Gene: Z26993 

# Encoding Transcript: 18 

MSDNGELEDKPPAPPVRMSSTIFSTGGKDPLSANHSLKPLPSVPEEKKPRHKIISIFSGT 
EKGSKKKEKERPEISPPSDFEHTIHVGFDAVTGEFTGMPEQWARLLQTSNITKLEQKKNP 
QAVLDVLKFYDSNTVKQKYLSFTPPEKDGFPSGTPALNAKGTEAPAWTEEEDDDEETAP 
PVIAPRPDHTKSIYTRSVIDPVPAPVGDSHVDGAAKSLDKQKKKTKMTDEEIMEKLRTIV 
SIGDPKKKYTRYEKIGQGASGTVFTATDVALGQEVAIKQINLQKQPKKELIINEILVMKE 
LKNPNIVNFLDSYLVGDELFVVMEYLAGGSLTDWTETCMDEAQIAAVCRECLQALEFLH 
ANQV I HRDI KS DN VLLGMEGS VKLS E 

SEQ ID NO: 420. >Z26993_P4 # TY Protein # CC #LN 312 # Source Gene: Z26993 

# Encoding Transcript: 19 

MSDNGELEDKPPAPPVRMSSTIFSTGGKDPLSANHSLKPLPSVPEEKKPRHKIISIFSGT 
EKGSKKKEKERPEISPPSDFEHTIHVGFDAVTGEFTGMPEQWARLLQTSNITKLEQKKNP 
QAVLDVLKFYDSNTVKQKYLSFTPPEKDGFPSGTPALNAKGTEAPAWTEEEDDDEETAP 
PVI APRPDHTKSI YTRSVI DPVPAPVGDSHVDGAAKSLDKQKKKTKMTDEEIMEKLRTIV 
SIGDPKKKYTRYEKIGQGASGTVFTATDVALGQEVAIKQINLQKQPKKELIINEILVMKE 
LKNPNIVNFLDR 

SEQ ID NO: 421. >Z26993_P5 # TY Protein # CC #LN 257 # Source Gene: Z26993 

# Encoding Transcript: 20 

MSDNGELEDKPPAPPVRMSSTIFSTGGKDPLSANHSLKPLPSVPEEKKPRHKIISIFSGT 
EKGSKKKEKERPEISPPSDFEHTIHVGFDAVTGEFTGMPEQWARLLQTSNITKLEQKKNP 
QAVLDVLKFYDSNTVKQKYLSFTPPVSFRNTMQLNAKGTEAPAWTEEEDDDEETAPPVI 
APRPDHTKSI YTRSVI DPVPAPVGDSHVDGAAKSLDKQKKKTKMTDEEIMEKLSMLSTFY 
IIVKLFTALKTFGPDGF 



SEQ ID NO: 422. >Z38709 # TY Consensus # Length 15608 # Number of exons 77 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 

cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 

cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 

cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
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ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatttagcacatggacgatgctcaggaaatgctgc 
tagacactgagcgctgaataagtttcataaatacctgcaagtattgcatcaggggatcaa 
tagttaagatttaaaaaaattaacatgaattcgaagcaatataaaagtataattattctg 
agagagttttttcagttgaaactttgattacattttaaatgtgttttggatatttcttgt 
cttaggaatgtctttatcgtgaatctaatttttccagtgaattttgaaatacaaattcaa 
ttccacaggatgcaagcttctggatttaatttggttaataaattatctggtgatatcttc 
ctttggtaaggtttggtagaagcatggcatgcaggaatgtgatgcactttaagtcttgtg 
ttggttggatttcttaaaaaaaaaattgttgtaggagataaagttgttttgatgcctgtg 
aatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtgtaaa 
gaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaatatagt 
tcctatcgagaggatgtattaaaaggagtatgtgctacacaggacatttgctgggagaaa 
tgatggtagtcttactgaaattctagatgcaaaacctatgtattcactggtgttcataat 
ttagaatgtgactacacatgttaatacttactctaagtgttttactgaatttccccccca 
catttttaaatgtttaaataagcattaaataaatttaaataagctttaaaagtcaaatag 
ctagtagaagggcccagagtagcaaaaccagaaagagacacctgcttctcacatctgttt 
gttcagatgtctcttgacctttttgctagggggacgttgttagattatttcatgcggaac 
aagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtacga 
ccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggtgg 
ttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaagc 
atcttgcaactggaaactatttagctgcagaggcttatcaatcacaataatagatttgtc 
ctgaaatgccagtcgttggaaagagaaacatataaatataatcatttaaaaatacacaca 
tgatgtttatgatttcagtgttatatataataaaaattgatcaaaacacatttcattgta 
aaaaaaagtaaagcttctcttttaagaaaataagtaggtattatggacttaaaaacttac 
ggtctgcttcttgcttttgtgttagatggaggggtgcagttgttttttaattgattgcgg 
tacaagcatgactttcactgcccaggtatttatgatgtttctctctctctatcattttgc 
accatacctgacttctttatattcttctttggccagttaattgaagactgtcttcaaaga 
ttagtaatgggtctctgtattctctctctgttactcactgccatggctgcagtggaccct 
gcctttgatcttggatatgccattgccatacaaaatcaataaatttatctaaaatttact 
taatcctgattatcgagatgcccaaaatgaaggaaaaaatgtgagagatggagtccctcc 
aacttcaaagaaaaaacgccaggcaggggagaagatcatgtatactttggtttcagtccc 
gcatggcaatgacattgcatccctttttgaactagatgccacaactcttcagagagctga 
ctgcctggttccaaggaactcatatgttcggttaaggcatttatgcaccaacacatgggt 
aaccagtactagtatccccatagacacagatgaagagaggcctgttatgttaaaggtaaa 
tgtcactgaagatggtttaatattggaacctgccaaaccaaagaagataaagaagcgttc 
gcaatcgtgtctgttccactgtctgaagttcgagacttagactttgccaatgatgccaat 
aaagtactagcgaccacagttaaaaagctagaaaacggcacaataactcagaatgaaagg 
aggtttgtaaccaaattattggaagatctcatattctttgttgctgatgtgcctaataat 
ggacaagaagttctggatgtggttatcactaagccaaaccgagagcgtcaaaaattgatg 
agggaacaaaacatactggcacaggtatttggaattcttaaagcaccctttaaagagaaa 
gcaggagaaggctcgatgctgagacttgaagatctgggggatcaaagatatgcaccctac 
aagtacatgctgcggctctgttaccgcgtcctgagacactcgcagcaggattaccggaaa 
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aatcaggaatatattgctaagaatttctgtgtcatgcagtcccagattggctatgatatt 
ttggcagaagatactatcacagctttgttgcacaacaacagaaaactactagagaaacat 
atcacagcaaaagaaatagaaacatttgtcagtttactcaggagaaatcgggagccaagg 
tttttggattatttgtcagatctgtgtgtgtctaataccactgctatccctgtaactcaa 
gaactcatctgtaaatttatgttgagtccaggcaatgcagacattctcattcaaactaag 
gtggtctcaatgcaagcagacaaccccatggagagctccatcctttcagatgacattgat 
gatgaagaagtttggctctattggattgacagcaacaaggaacctcatggcaaagctatc 
aggcaccttgctcaagaggcaaaagaaggcaccaaagctgacttagaagttcttacctat 
tacaggtaccagctaaacctctttgcaaggatgtgcttggatcgccagtatctggccata 
aaccagatttctacacagctgtctgtagacctgatcctgcggtgtgtgtcggatgagagc 
ctgccgttcgacctccgagcgtccttctgtcgcctcatgctccacatgcacgttgaccgg 
gatccccaggagtccgtggtgcctgttcgctatgccaggctctggacagaaatccccaca 
aagatcacaattcatgaatatgattctataacagactcttccagaaatgatatgaagagg 
aaatttgccctgacaatggaatttgttgaagaatatttgaaagaagttgtaaaccagccc 
tttccttttggggataaagaaaaaaataaactgacatttgaggtggtccacttggctcgg 
aatcttatatactttggattttatagtttcagtgagttattaaggctaacaagaacactt 
ctggctattttagacattgtacaggcccccatgtcatcatactttgaaagattaagcaaa 
tttcaagatggaggaaacaatgtgatgagaaccattcatggggtgggagagatgatgacc 
cagatggtactcagtagaggctccatcttccccatgagcgtgccggatgtgccacccagc 
atccacccgagcaagcaagggagccccaccgagcatgaggatgtgactgtgatggacacc 
aagctgaagatcattgagattttgcagtttatcctgagtgtcagactggattataggatc 
tcatatatgctgtcaatatataagaaggagtttggagaggacaatgacaatgcggagaca 
tctgccagtggatctccagacactttactaccatcagctattgttcctgatatagatgaa 
attgcagctcaggcagaaactatgtttgcgggaaggtatagtaacatttatatatcttta 
tct tact tgtgaataacattt tact tagaattctattgatactttgat at tgttcactct 
atatctatttatttttagaccctctattgttgctaacataatttctgccagaaatgaagc 
tgctaatttactctggaggagaaagcatagcttttctttaggtcaggtaaaaaaaaggac 
aagagagatgagatcagtgtattgaactttcctctgaattctttgcttttctttcctcaa 
ataatgacagcccaagaatgccctttaactcccctggtataaatatcactgagtcttgta 
aaacattgcttccctgaatcttagatatttgaggcaagaaagtaacttctttaatgttat 
cattaactttatatgtaagaaataaagccattttctgccaattggaaatacccatgatta 
aataaaacaagacacctttgacatgactagcctcctttgccatcaggaatcctgccgttt 
attgttttaatattgtttgttcttcagaaatgaagttaatagtgatcaaaaagaaaaatt 
aaaatgtttaagctacaaaagaaaaaaatccagttcaacttgacgatgaaggaggcagga 
cgtttttacgggtcctcattcatctgatcatgcacgactacccgcctttgctgtctggag 
ccctgcagctgttgtttaagcacttcagccagagggcagaggttttacaggcatttaagc 
aggtgcaattactggtgtctaatcaagacgtagataactacaagcaaatcaaggcagatc 
tagaccagcttcgactgacagtagaaaagtctgagctatgggtggagaagagcagcaact 
atgagaatggagaaataggggaaagtcaagtgaaaggtggtgaagagccaattgaggaat 
caaacattttaagtccagtgcaggatggaacaaagaaacctcagattgacagcaacaaga 
gcaataactaccggattgtaaaggagattttgatcaggctaagtaaactctgtgtgcaga 
ataaaaagtgtcggaatcaacatcaacgattactgaaaaatatgggggcgcattcggtgg 
tgttggatcttctgcagataccctatgaaaagaatgatgaaaagatgaatgaagtaatga 
atctagcccatacatttctgcagaatttctgtcgaggaaatccacagaatcaagttcttc 
ttcataaacatctgaatttgtttttaactccaggtctccttgaagcagaaaccatgcggc 
acatcttcatgaacaattaccatctgtgcaacgaaattagcgagagagttgtacaacact 
ttgtgcactgcattgagacacatggccgccacgtggagtacctgaggtttttgcaaacaa 
ttgtaaaagcagatggtaaatatgtgaagaaatgccaggatatggtaatgacagagttga 
taaatgggggtgaagacgtgctgatattttacaatgatagagcatcatttccaatccttc 
tccatatgatgtgttcagagagagaccgaggggatgagagtggccccttagcctaccaca 
tcaccctggtggagttgctggcagcatgcacagaggggaaaaatgtctacactgaaatca 
agtgtaattcccttctcccgctggacgacatagtgagggtggtgacccatgacgactgca 
tccctgaggttaaaattgcttatgtgaactttgttaatcactgttatgttgacactgaag 
tggaaatgaaagaaatctatacaagtaaccacatttggaaattatttgagaacttcttgg 
tggatatggcaagggtttgcaacacaactacagacaggaaacatgcagacatctttttgg 
aaaagtgtgttactgagtcaataatgaatattgtgagcggcttctttaattctccctttt 
cagacaatagtaccagcctccagacacatcagccagtttttattcagctactgcaatctg 
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ccttcagaatttacaattgcacctggccaaacccagcgcagaaagcctcagtggaatcct 
gtatcagaactttggctgaagtggcaaaaaatcgtggaattgccattccagtggatttgg 
acagccaagttaatactcttttcatgaagagccattcaaatatggtgcagagagcagcaa 
tgggttggagactatcagctcgctctgggccacgctttaaggaagctcttggagggcctg 
cttgggattacagaaatattattgaaaagttacaggatgtagtggcctccttggagcacc 
agttcagcccaatgatgcaggctgaattctcagtgttggttgatgtattgtacagtccag 
aactgctgttccctgagggaagcgatgcaagaataagatgtggcgctttcatgtcgaagt 
tgattaatcatacaaagaaactaatggagaaagaagaaaaactgtgcattaaaattcttc 
agacattacgagaaatgttagagaagaaagacagctttgtggaagaggtatttttttaaa 
aaactttattatttaaaaataaagtaggatttccccaatactgtaacctatgaattacaa 
taggctattacgtgattacttgcattatatctggtaacacattaagaaagatacttctga 
atcgatactttaaaggtgattatagtattggtgtgaatggacacctatcaggagcctact 
ccaaaactgcacaggtgggaggaagcttttctggacaagattcagataagatggggatat 
caatgtcagacattcagtgtctgctggataaagaaggtgcatcagaacttgtcatcgatg 
ttatagtgaacaccaaaaatgacagaattttttcagaaggcattttcctcggcattgcct 
tgcttgaaggaggaaatacacaaacacaggtacttgttcatcaatcatggggattacatt 
tattattggtatttcttgtttacttgtcacagaaaccaaggaaaagtaatgtgtttcttt 
tagctcttattactatctaaaaaaaaatattctttctaccagcagttgcatgaacaaaaa 
aagtcagaaaaattctttaaagttctctatgatcgaatgaaggctgctcagaaagaaata 
agatcaacagtgacagttaataccatagatttaggtaacaaaaaaagggacgatgacaat 
gaattgatgacatctggtccacgaatgagagaattttggtcatggggtgaccctaaaacg 
gctctgcaggactaaacttaaatgagacataatcagtgcaaagcaaccaacggcatgaaa 
gatgtttgatttcattgtggcatgcatgtaagagattcaacactacatttaaaagaggga 
atgaaagggcaattaacagaagcttcttcagcaacatccaaagcatattgtgtatacaga 
agagaaatggatccagaaatagacattatgtgcacaggaccagaagcgggaaacactgag 
gaaaaatccgcagaggaagtaacaatgagtcccgcaattgccatcatgcagccaatactg 
agatttcttcagttactgtgtgagaatcacaaccgggaattgcagaacttcttgaggaat 
caaaacaacaaaacaaattacaacctagtctgtgagacccttcagtttctggactgcatt 
tgtggaagtacaaccggtggcctgggcctgttgggtctctacatcaatgagaagaatgta 
gcgctggtcaaccagaacctggagagcttgactgagtattgccagggcccttgccatgaa 
aatcagacctgtatcgctacacatgagtctaatgggattgatatcatcattgctttgatt 
ctgaatgacataaaccctcttggtaaataccgaatggacctggtgctccagctaaagaac 
aatgcatctaaacttttgctggccattatggaaagcagacatgacagtgagaatgcagaa 
agaattctttttaacatgagacccagagaactggtggatgtgatgaagaatgcctataac 
caaggattggaatgtgaccatggggatgatgagggtggagatgatggtgtttctccaaaa 
gatgttggacacaatatctatattctggcccatcagttggcccgccacaataaactgttg 
cagcagatgctcaaaccaggatcggatccagatgaaggagatgaagccttaaagtattat 
gccaaccacactgcacagattgagattgtccggcatgataggaccatggaacaaatagtt 
tttcctgtccccaatatatgtgaatacctcactcgagaatccaagtgccgtgtgttcaat 
acaactgaaagggatgaacaaggaagtaaagtgaatgactttttccagcaaacagaagat 
ctctacaatgaaatgaagtggcagaagaaaatcaggaataaccctgcactgttctggttc 
tcgaggcacatctctctctgggggagcatttccttcaacctggctgtgttcatcaattta 
gctgttgctctcttctacccatttggggatgatggagatgaaggtacactttctccattg 
ttctcggttcttctttggatagcagttgcgatctgcacatctatgctgtttttcttctcc 
aagcctgtgggtattcggccgtttcttgtatcaataatgctcagatcaatatatacaata 
ggtcttgggcctacattaatacttcttggtgcagctaatctttgtaataaaattgttttt 
ctggtgagttttgttggaaatcgtggcacgttcacccgtgggtaccgagcagtcatcctg 
gatatggcctttctctatcacgtggcgtatgtcctggtttgcatgctgggcctttttgtc 
catgaattcttctatagcttcctgctttttgatttggtgtacagggaagagactttgctg 
aatgtcataaaaagtgtcacacgaaatggccgctctattattctaactgcagtcctggct 
ctcatcctcgtctacctgttttccattattgggttcctttttttgaaggatgacttcact 
atggaagttgataggctgaaaaaccgaactcctgttacaggcagtcatcaagtgcctact 
atgactttaactaccatgatggaagcatgtgccaaggagaactgttcacccacaattcca 
gcttcaaatacagctgatgaagagtatgaagatggaattgaaaggacgtgtgacactctc 
cttatgtgcattgtcaccgtgctgaaccagggcctcaggaatggcggtggtgtgggggat 
gtgctaagaaggccatcgaaagatgagcccttgtttgctgcccgagtggtttatgacctt 
cttttttatttcattgttatcattattgttctgaacttgatttttggtgttatcatcgat 
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acttttgctgatctcagaagcgaaaaacagaaaaaagaagaaattctaaagacaacttgt 
ttcatctgtggacttgagagagacaagtttgataataaaacggtttcatttgaggagcac 
attaagtcagaacacaatatgtggcattatttgtacttcatagtcctggtgaaagttaaa 
gacccaacagaatacactggacctgaaagttatgtggctcaaatgattgtggagaagaat 
ttggattggtttcctcggatgcgagccatgtccctcgttagcaatgaaggcgacagtgag 
caaaatgaaattcggagccttcaggagaagttggaatcgaccatgagtctggtcaaacag 
ctgtcgggtcagctggcggagctcaaggagcagatgacagaacaaaggaagaataagcag 
agactgggcttcctcggatcaaacacaccccatgtgaatcatcacatgccaccacactga 
taccatggggggaagccgtgactagcctttcatcagtgtcctgcctgatcactgaataaa 
gaactgagatggaggggagtgaacagtgcctattgttgaaaagttaaaaacaaccaagtg 
ccaagatgttgagtgggttagctccgagaacaatttataactgtgttttcatggttgcga 
agacctaacctcaaatgcatctgctagaaagcgtacatcacacattcgcaatgcatcagg 
aagaaaaggcttgcccaaaaggctggagagggcagggagcggcaggatggaaggagacac 
ggggcagggagaactctcttctgctaaatcgataggagtcagttttgtcttaaatgctga 
ctacagccactgacatggttggctggaatttctttcttttaattgtggcatataggtttg 
tgacacaagaagtcatactttggtggctaagttttactaaggaaaataactgaaaagatt 
aaaagtgagagctgaaaagagaaatgataatgcttccaaactgtagctgtcacagggcaa 
tttctttatttataacatgaagcacaatggatttacagctctaggaacttagtactttgg 
agcttttgcctctcacactgacaacataacaggatgtgattgccttctctgggattcaga 
caggctctgtcaatgtggagcacaaaaggagattttcatataacttgttaaaaacatgtt 
ctaagtcatgtataggctaagattttaagaagatctgggggaataaaaagccaacagtaa 
gccccaggaaagggttttttgagaccatatgtatgctattaaaatatattgagattaata 
catgaaaatgcttaaaagtgataggaattattgagaaacttatgatggtggcattgcctt 
ttataaacatggagtgaaagccatttgactcaacgtttgctgtgcttaaagaattgcttc 
aggacccgagcgtttttatgtatgctgttcctgcagttaggaaaaaaaaatccaagaaat 
gtattgaatactcaagaaaatgccacatttcaattacatatttaaaactgtatctgtaag 
ggcttttcaaaatgtagcaagataaaaatctcttattcaaattgtttttgtattaaatca 
tgcactcagaatttgtcgagggagagaataacctggtgtgttcaggttattcttagagac 
tacacatttggaatagcagagcaaaatatggataatgaaagtgttgggaaaaaagtgatg 
tgacagggagtgaagacacttgataccagactctgggagatactctgcaagttgacctgg 
cctctcccccacaggaacaaaacactgcctccagagtctttaaattctcagttatcaacg 
ccaaggtttaaggtctagatagggtttgctatagggtttgctctgacaatttttaaagct 
tttggattgttctaacatagcttaagctattggttcctaaaaatcccaaatcaagatcta 
tgtagaatataaaggaagcctgaaccaatccttccacatactgttaagatgtagacttgg 
aacaaagctgttgggacccagagcaatgaatttttgaactgaagctactggtactcccca 
gcaccacctcatactaagaattcctcactctgacatgacagtattttttctgaccaggag 
ctgaaagaccctgacattcatgatccaaagatataagaaattttgatatgtctgcatgca 
gcacagaactttgaacctaggaggcagtcaataaatacatgagaaattcagctgtcattc 
agttactcttaattctttataaaactttaaatgataccacatatttgtttgtttaaaatg 
gctttcccaaaatcaaagtagactaaagcagccatctttaaaatccaggatcatcaatgc 
tattaacagtatcaggtaaaaagtacactttaaaatattttaatcaggcagagttttatg 
gatagagaatagaagagaaaggtagtaaatattgaacatattccaatataggaacctatc 
tctgttttagtacaaaatatttctgacatctgaactagaggtcaagagaataaattcatt 
tgtatacatctgagcaacctgtctttcagatgataaagtatctagccttttctgacacca 
taatagttcattttgtagggaataagccattaggtgtatataattgctttctagaaatga 
cctaatgtccccaaccactttgtagtggcagatcactgtttcacagcatattttctccca 
aggaaagtattcaaaagagactgcaactaacaagactcttatttcatcaaaatttaaata 
tttctgagttgtatttttaatgcctctttcttttctgcctaaatgcttagaaattataaa 
gcaaaaaaaaaaaaaaaaacagcaacaaaaaatcgaagcagacaaaaaaggcacttttca 
gaacatcaaattcctaatgaagaagaggagaagataatggggaaatttgacatttgatat 
aaatttatatttgttatgtgtatgtttgttaatgcaactggaatatttgacttaggtgag 
tatcagttaacatccttgtatttatatagtgacatcaaaataaatgcaatcatcttgaac 
tttgatgttaagggagattttgaaaaaatatttagttatttcaaattcatattggttcaa 
aagtatcagtttctgctgaataatatttttaatttcaaaagggttttgttccctctgtgc 
tccattctgaggctcaagagcttaatgcaagtatgttttctgagttaaaataacactttt 
agatgagaaaatgcttgtatcatacagggctataatataaataaataaattgtatgtatg 
caaaatttatcatattttctaccccttaaaaaattaagttagaaataagcttattttctt 
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gcacatcaatatttttctgttggtaaagaggacacaatttctagtagattgttaataaca 
tcaggaagatatttctttcgtcagaactaattgtgtgcttattcccatattctcagctca 
taactccctgttttggctgctctctttattttacaattgtttcattgcaaacagcaaggc 
agtatagcccacactcagccactagcccccagtccccaggactgagaatgagtggggagg 
ctggggagttggtgagagaaggtggagatagggatttctgcttcccagtactctatcatt 
agagaaaaggaggggtaaatattagctaagaagaagaaaaaagcttattcttcagggacc 
cagtagattggacgtaagcaagtaaatattgcacaggcatcatgatgtaatagttgatgg 
accccagagatacccctcacccatgcagagaacaaagctgtgtggaagagcaactaccat 
aagatggtgtgttccttccagcggctgttaaagccatcctgaataacaaatcagctattc 
ctacctacagraaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaaaaaaaaaaaaaaaacaaaaaggaggaagcgacccaaacaacaaataacaacaacac 
acaacgagcggcgcaccagaaactcaaacgtccccagcaccacacgacacccccgacacc 
ccaagccgcccccgccaccaacagtgcgccgcactcgctacccacaaccgcaaccgcgcc 
cgcagcgcacaccctacaagacccacccctacgcccaacctccgacgacccacgcgcccc 
ttctggacgatagctcccacactacccctacgccaccccccaacaagccagaccactcac 
tcacacgtcccttcgcatcactccactccctcctcctatcccatcacaatcacttatccg 
tgcattcacccacacacatcgccccgcgcacactgtttcctcccgctcttgccccccaaa 
acactatccctgcacctcacacccattccaatatgctgtgatacacgacacggtgcaccg 
cagaggagacagacatcttcagcacaaacacagatgcctgcaggagcttggcaagtcact 
aaattgcatgaaattgtgaggtgcacaccaaatgcagggatgggggaggccgtgggaagc 
tctgtattctcaaaatttgacagctaattccggtttttagaaaatgcctgaggccagtag 
aggccctctagtcactcactgctgctgtttctgatataattattgagaaagctatctcac 
ttaatagaagaaaacacgcactatcaaaaccagatagcctaacgtgcatgtgaaaatcga 
gaaagctgaaaacaaatccaggtacccttctctgaactggagtgtttccacagacttgaa 
taatttatgaaattatcacaccagtatttctcatatcaccaagaagactttctctcctgc 
agtagaggattgttatatttgcctaaaaaacacgattccaatatatgacaagggcagata 
atttataagtgaatgttaataaaattggatgtgtataaagacaggttgctcagatgtata 
caaatgaatttattctcttgacctctagttcagatgtcaaaaatattttggtccccccaa 
ccgcgaat 

SEQ ID NO: 423. >Z38709JT1 # TY Transcript # LN 12857 # Source Gene: Z38709 
# Encoded protein: Z38709_P11 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 
cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatattgttgtaggagataaagttgttttgatgcc 
tgtgaatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtg 
taaagaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaata 
tagttcctatcgagaggatgtattaaaaggaggggacgttgttagattatttcatgcgga 
acaagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtac 
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gaccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggt 
ggttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaa 
gcatcttgcaactggaaactatttagctgcagagcttaatcctgattatcgagatgccca 
aaatgaaggaaaaaatgtgagagatggagtccctccaacttcaaagaaaaaacgccaggc 
aggggagaagatcatgtatactttggtttcagtcccgcatggcaatgacattgcatccct 
ttttgaactagatgccacaactcttcagagagctgactgcctggttccaaggaactcata 
tgttcggttaaggcatttatgcaccaacacatgggtaaccagtactagtatccccataga 
cacagatgaagagaggcctgttatgttaaagattggaacctgccaaaccaaagaagataa 
agaagcgttcgcaatcgtgtctgttccactgtctgaagttcgagacttagactttgccaa 
tgatgccaataaagtactagcgaccacagttaaaaagctagaaaacggcacaataactca 
gaatgaaaggaggtttgtaaccaaattattggaagatctcatattctttgttgctgatgt 
gcctaataatggacaagaagttctggatgtggttatcactaagccaaaccgagagcgtca 
aaaattgatgagggaacaaaacatactggcacaggtatttggaattcttaaagcaccctt 
taaagagaaagcaggagaaggctcgatgctgagacttgaagatctgggggatcaaagata 
tgcaccctacaagtacatgctgcggctctgttaccgcgtcctgagacactcgcagcagga 
ttaccggaaaaatcaggaatatattgctaagaatttctgtgtcatgcagtcccagattgg 
ctatgatattttggcagaagatactatcacagctttgttgcacaacaacagaaaactact 
agagaaacatatcacagcaaaagaaatagaaacatttgtcagtttactcaggagaaatcg 
ggagccaaggtttttggattatttgtcagatctgtgtgtgtctaataccactgctatccc 
tgtaactcaagaactcatctgtaaatttatgttgagtccaggcaatgcagacattctcat 
tcaaactaaggtggtctcaatgcaagcagacaaccccatggagagctccatcctttcaga 
tgacattgatgatgaagaagtttggctctattggattgacagcaacaaggaacctcatgg 
caaagctatcaggcaccttgctcaagaggcaaaagaaggcaccaaagctgacttagaagt 
tcttacctattacaggtaccagctaaacctctttgcaaggatgtgcttggatcgccagta 
tctggccataaaccagatttctacacagctgtctgtagacctgatcctgcggtgtgtgtc 
ggatgagagcctgccgttcgacctccgagcgtccttctgtcgcctcatgctccacatgca 
cgttgaccgggatccccaggagtccgtggtgcctgttcgctatgccaggctctggacaga 
aatccccacaaagatcacaattcatgaatatgattctataacagactcttccagaaatga 
tatgaagaggaaatttgccctgacaatggaatttgttgaagaatatttgaaagaagttgt 
aaaccagccctttccttttggggataaagaaaaaaataaactgacatttgaggtggtcca 
cttggctcggaatcttatatactttggattttatagtttcagtgagttattaaggctaac 
aagaacacttctggctattttagacattgtacaggcccccatgtcatcatactttgaaag 
attaagcaaatttcaagatggaggaaacaatgtgatgagaaccattcatggggtgggaga 
gatgatgacccagatggtactcagtagaggctccatcttccccatgagcgtgccggatgt 
gccacccagcatccacccgagcaagcaagggagccccaccgagcatgaggatgtgactgt 
gatggacaccaagctgaagatcattgagattttgcagtttatcctgagtgtcagactgga 
ttataggatctcatatatgctgtcaatatataagaaggagtttggagaggacaatgacaa 
tgcggagacatctgccagtggatctccagacactttactaccatcagctattgttcctga 
tatagatgaaattgcagctcaggcagaaactatgtttgcgggaagaaaagaaaaaaatcc 
agttcaacttgacgatgaaggaggcaggacgtttttacgggtcctcattcatctgatcat 
gcacgactacccgcctttgctgtctggagccctgcagctgttgtttaagcacttcagcca 
gagggcagaggttttacaggcatttaagcaggtgcaattactggtgtctaatcaagacgt 
agataactacaagcaaatcaaggcagatctagaccagcttcgactgacagtagaaaagtc 
tgagctatgggtggagaagagcagcaactatgagaatggagaaataggggaaagtcaagt 
gaaaggtggtgaagagccaattgaggaatcaaacattttaagtccagtgcaggatggaac 
aaagaaacctcagattgacagcaacaagagcaataactaccggattgtaaaggagatttt 
gatcaggctaagtaaactctgtgtgcagaataaaaagtgtcggaatcaacatcaacgatt 
actgaaaaatatgggggcgcattcggtggtgttggatcttctgcagataccctatgaaaa 
gaatgatgaaaagatgaatgaagtaatgaatctagcccatacatttctgcagaatttctg 
tcgaggaaatccacagaatcaagttcttcttcataaacatctgaatttgtttttaactcc 
aggtctccttgaagcagaaaccatgcggcacatcttcatgaacaattaccatctgtgcaa 
cgaaattagcgagagagttgtacaacactttgtgcactgcattgagacacatggccgcca 
cgtggagtacctgaggtttttgcaaacaattgtaaaagcagatggtaaatatgtgaagaa 
atgccaggatatggtaatgacagagttgataaatgggggtgaagacgtgctgatatttta 
caatgatagagcatcatttccaatccttctccatatgatgtgttcagagagagaccgagg 
ggatgagagtggccccttagcctaccacatcaccctggtggagttgctggcagcatgcac 
agaggggaaaaatgtctacactgaaatcaagtgtaattcccttctcccgctggacgacat 
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agtgagggtggtgacccatgacgactgcatccctgaggttaaaattgcttatgtgaactt 
tgttaatcactgttatgttgacactgaagtggaaatgaaagaaatctatacaagtaacca 
catttggaaattatttgagaacttcttggtggatatggcaagggtttgcaacacaactac 
agacaggaaacatgcagacatctttttggaaaagtgtgttactgagtcaataatgaatat 
tgtgagcggcttctttaattctcccttttcagacaatagtaccagcctccagacacatca 
gccagtttttattcagctactgcaatctgccttcagaatttacaattgcacctggccaaa 
cccagcgcagaaagcctcagtggaatcctgtatcagaactttggctgaagtggcaaaaaa 
tcgtggaattgccattccagtggatttggacagccaagttaatactcttttcatgaagag 
ccattcaaatatggtgcagagagcagcaatgggttggagactatcagctcgctctgggcc 
acgctttaaggaagctcttggagggcctgcttgggattacagaaatattattg'aaaagtt 
acaggatgtagtggcctccttggagcaccagttcagcccaatgatgcaggctgaattctc 
agtgttggttgatgtattgtacagtccagaactgctgttccctgagggaagcgatgcaag 
aataagatgtggcgctttcatgtcgaagttgattaatcatacaaagaaactaatggagaa 
agaagaaaaactgtgcattaaaattcttcagacattacgagaaatgttagagaagaaaga 
cagctttgtggaagagggtaacacattaagaaagatacttctgaatcgatactttaaagg 
tgattatagtattggtgtgaatggacacctatcaggagcctactccaaaactgcacaggt 
gggaggaagcttttctggacaagattcagataagatggggatatcaatgtcagacattca 
gtgtctgctggataaagaaggtgcatcagaacttgtcatcgatgttatagtgaacaccaa 
aaatgacagaattttttcagaaggcattttcctcggcattgccttgcttgaaggaggaaa 
tacacaaacacagtattctttctaccagcagttgcatgaacaaaaaaagtcagaaaaatt 
ctttaaagttctctatgatcgaatgaaggctgctcagaaagaaataagatcaacagtgac 
agttaataccatagatttaggtaacaaaaaaagggacgatgacaatgaattgatgacatc 
tggtccacgaatgagagtaagagattcaacactacatttaaaagagggaatgaaagggca 
attaacagaagcttcttcagcaacatccaaagcatattgtgtatacagaagagaaatgga 
tccagaaatagacattatgtgcacaggaccagaagcgggaaacactgaggaaaaatccgc 
agaggaagtaacaatgagtcccgcaattgccatcatgcagccaatactgagatttcttca 
gttactgtgtgagaatcacaaccgggaattgcagaacttcttgaggaatcaaaacaacaa 
aacaaattacaacctagtctgtgagacccttcagtttctggactgcatttgtggaagtac 
aaccggtggcctgggcctgttgggtctctacatcaatgagaagaatgtagcgctggtcaa 
ccagaacctggagagcttgactgagtattgccagggcccttgccatgaaaatcagacctg 
tatcgctacacatgagtctaatgggattgatatcatcattgctttgattctgaatgacat 
aaaccctcttggtaaataccgaatggacctggtgctccagctaaagaacaatgcatctaa 
acttttgctggccattatggaaagcagacatgacagtgagaatgcagaaagaattctttt 
taacatgagacccagagaactggtggatgtgatgaagaatgcctataaccaaggattgga 
atgtgaccatggggatgatgagggtggagatgatggtgtttctccaaaagatgttggaca 
caatatctatattctggcccatcagttggcccgccacaataaactgttgcagcagatgct 
caaaccaggatcggatccagatgaaggagatgaagccttaaagtattatgccaaccacac 
tgcacagattgagattgtccggcatgataggaccatggaacaaatagtttttcctgtccc 
caatatatgtgaatacctcactcgagaatccaagtgccgtgtgttcaatacaactgaaag 
ggatgaacaaggaagtaaagtgaatgactttttccagcaaacagaagatctctacaatga 
aatgaagtggcagaagaaaatcaggaataaccctgcactgttctggttctcgaggcacat 
ctctctctgggggagcatttccttcaacctggctgtgttcatcaatttagctgttgctct 
cttctacccatttggggatgatggagatgaaggtacactttctccattgttctcggttct 
tctttggatagcagttgcgatctgcacatctatgctgtttttcttctccaagcctgtggg 
tattcggccgtttcttgtatcaataatgctcagatcaatatatacaataggtcttgggcc 
tacattaatacttcttggtgcagctaatctttgtaataaaattgtttttctggtgagttt 
tgttggaaatcgtggcacgttcacccgtgggtaccgagcagtcatcctggatatggcctt 
tctctatcacgtggcgtatgtcctggtttgcatgctgggcctttttgtccatgaattctt 
ctatagcttcctgctttttgatttggtgtacagggaagagactttgctgaatgtcataaa 
aagtgtcacacgaaatggccgctctattattctaactgcagtcctggctctcatcctcgt 
ctacctgttttccattattgggttcctttttttgaaggatgacttcactatggaagttga 
taggctgaaaaaccgaactcctgttacaggcagtcatcaagtgcctactatgactttaac 
taccatgatggaagcatgtgccaaggagaactgttcacccacaattccagcttcaaatac 
agctgatgaagagtatgaagatggaattgaaaggacgtgtgacactctccttatgtgcat 
tgtcaccgtgctgaaccagggcctcaggaatggcggtggtgtgggggatgtgctaagaag 
gccatcgaaagatgagcccttgtttgctgcccgagtggtttatgaccttcttttttattt 
cattgttatcattattgttctgaacttgatttttggtgttatcatcgatacttttgctga 
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tctcagaagcgaaaaacagaaaaaagaagaaattctaaagacaacttgtttcatctgtgg 
acttgagagagacaagtttgataataaaacggtttcatttgaggagcacattaagtcaga 
acacaatatgtggcattatttgtacttcatagtcctggtgaaagttaaagacccaacaga 
atacactggacctgaaagttatgtggctcaaatgattgtggagaagaatttggattggtt 
tcctcggatgcgagccatgtccctcgttagcaatgaaggcgacagtgagcaaaatgaaat 
tcggagccttcaggagaagttggaatcgaccatgagtctggtcaaacagctgtcgggtca 
gctggcggagctcaaggagcagatgacagaacaaaggaagaataagcagagactgggctt 
cctcggatcaaacacaccccatgtgaatcatcacatgccaccacactgataccatggggg 
gaagccgtgactagcctttcatcagtgtcctgcctgatcactgaataaagaactgagatg 
gaggggagtgaacagtgcctattgttgaaaagttaaaaacaaccaagtgccaagatgttg 
agtgggttagctccgagaacaatttataactgtgttttcatggttgcgaagacctaacct 
caaatgcatctgctagaaagcgtacatcacacattcgcaatgcatcaggaagaaaaggct 
tgcccaaaaggctggagagggcagggagcggcaggatggaaggagacacggggcagggag 
aactctcttctgctaaatcgataggagtcagttttgtcttaaatgctgactacagccact 
gacatggttggctggaatttctttcttttaattgtggcatataggtttgtgacacaagaa 
gtcatactttggtggctaagttttactaaggaaaataactgaaaagattaaaagtgagag 
ctgaaaagagaaatgataatgcttccaaactgtagctgtcacagggcaatttctttattt 
ataacatgaagcacaatggatttacagctctaggaacttagtactttggagcttttgcct 
ctcacactgacaacataacaggatgtgattgccttctctgggattcagacaggctctgtc 
aatgtggagcacaaaaggagattttcatataacttgttaaaaacatgttctaagtcatgt 
ataggctaagattttaagaagatctgggggaataaaaagccaacagtaagccccaggaaa 
gggttttttgagaccatatgtatgctattaaaatatattgagattaatacatgaaaatgc 
ttaaaagtgataggaattattgagaaacttatgatggtggcattgccttttataaacatg 
gagtgaaagccatttgactcaacgtttgctgtgcttaaagaattgcttcaggacccgagc 
gtttttatgtatgctgttcctgcagttaggaaaaaaaaatccaagaaatgtattgaatac 
tcaagaaaatgccacatttcaattacatatttaaaactgtatctgtaagggcttttcaaa 
atgtagcaagataaaaatctcttattcaaattgtttttgtattaaatcatgcactcagaa 
tttgtcgagggagagaataacctggtgtgttcaggttattcttagagactacacatttgg 
aatagcagagcaaaatatggataatgaaagtgttgggaaaaaagtgatgtgacagggagt 
gaagacacttgataccagactctgggagatactctgcaagttgacctggcctctccccca 
caggaacaaaacactgcctccagagtctttaaattctcagttatcaacgccaaggtttaa 
ggtctagatagggtttgctatagggtttgctctgacaatttttaaagcttttggattgtt 
ctaacatagcttaagctattggttcctaaaaatcccaaatcaagatctatgtagaatata 
aaggaagcctgaaccaatccttccacatactgttaagatgtagacttggaacaaagctgt 
tgggacccagagcaatgaatttttgaactgaagctactggtactccccagcaccacctca 
tactaagaattcctcactctgacatgacagtattttttctgaccaggagctgaaagaccc 
tgacattcatgatccaaagatataagaaattttgatatgtctgcatgcagcacagaactt 
tgaacctaggaggcagtcaataaatacatgagaaattcagctgtcattcagttactctta 
attctttataaaactttaaatgataccacatatttgtttgtttaaaatggctttcccaaa 
atcaaagtagactaaagcagccatctttaaaatccaggatcatcaatgctattaacagta 
tcaggtaaaaagtacactttaaaatattttaatcaggcagagttttatggatagagaata 
gaagagaaaggtagtaaatattgaacatattccaatataggaacctatctctgttttagt 
acaaaatatttctgacatctgaactagaggtcaagagaataaattcatttgtatacatct 
gagcaacctgtctttcagatgataaagtatctagccttttctgacaccataatagttcat 
tttgtagggaataagccattaggtgtatataattgctttctagaaatgacctaatgtccc 
caaccactttgtagtggcagatcactgtttcacagcatattttctcccaaggaaagtatt 
caaaagagactgcaactaacaagactcttatttcatcaaaatttaaatatttctgagttg 
tatttttaatgcctctttcttttctgcctaaatgcttagaaattataaagcaaaaaaaaa 
aaaaaaaacagcaacaaaaaatcgaagcagacaaaaaaggcacttttcagaacatcaaat 
tcctaatgaagaagaggagaagataatggggaaatttgacatttgatataaatttatatt 
tgttatgtgtatgtttgttaatgcaactggaatatttgacttaggtgagtatcagttaac 
atccttgtatttatatagtgacatcaaaataaatgcaatcatcttgaactttgatgttaa 
gggagattttgaaaaaatatttagttatttcaaattcatattggttcaaaagtatcagtt 
tctgctgaataatatttttaatttcaaaagggttttgttccctctgtgctccattctgag 
gctcaagagcttaatgcaagtatgttttctgagttaaaataacacttttagatgagaaaa 
tgcttgtatcatacagggctataatataaataaataaattgtatgtatgcaaaatttatc 
atattttctaccccttaaaaaattaagttagaaataagcttattttcttgcacatcaata 
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tttttctgttggtaaagaggacacaatttctagtagattgttaataacatcaggaagata 
tttctttcgtcagaactaattgtgtgcttattcccatattctcagctcataactccctgt 
tttggctgctctctttattttacaattgtttcattgcaaacagcaaggcagtatagccca 
cactcagccactagcccccagtccccaggactgagaatgagtggggaggctggggagttg 
gtgagagaaggtggagatagggatttctgcttcccagtactctatcattagagaaaagga 
ggggtaaatattagctaagaagaagaaaaaagcttattcttcagggacccagtagattgg 
acgtaagcaagtaaatattgcacaggcatcatgatgtaatagttgatggaccccagagat 
acccctcacccatgcagagaacaaagctgtgtggaagagcaactaccataagatggtgtg 
ttccttccagcggctgttaaagccatcctgaataacaaatcagctattcctacctacaga 
cccattccaatatgctgtgatacacgacacggtgcaccgcagaggagacagacatcttca 
gcacaaacacagatgcctgcaggagcttggcaagtcactaaattgcatgaaattgtgagg 
tgcacaccaaatgcagggatgggggaggccgtgggaagctctgtattctcaaaatttgac 
agctaattccggtttttagaaaatgcctgaggccagtagaggccctctagtcactcactg 
ctgctgtttctgatataattattgagaaagctatctcacttaatagaagaaaacacgcac 
tatcaaaaccagatagcctaacgtgcatgtgaaaatcgagaaagctgaaaacaaatccag 
gtacccttctctgaactggagtgtttccacagacttgaataatttatgaaattatcacac 
cagtatttctcatatcaccaagaagactttctctcctgcagtagaggattgttatatttg 
cctaaaaaacacgattccaatatatgacaagggcagataatttataagtgaatgttaata 
aaattggatgtgtataa 

SEQ ID NO:424. >Z38709_T2 # TY Transcript # LN 11942 # Source Gene: Z38709 
# Encoded protein: Z38709_P11 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 
cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatattgttgtaggagataaagttgttttgatgcc 
tgtgaatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtg 
taaagaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaata 
tagttcctatcgagaggatgtattaaaaggaggggacgttgttagattatttcatgcgga 
acaagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtac 
gaccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggt 
ggttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaa 
gcatcttgcaactggaaactatttagctgcagagcttaatcctgattatcgagatgccca 
aaatgaaggaaaaaatgtgagagatggagtccctccaacttcaaagaaaaaacgccaggc 
aggggagaagatcatgtatactttggtttcagtcccgcatggcaatgacattgcatccct 
ttttgaactagatgccacaactcttcagagagctgactgcctggttccaaggaactcata 
tgttcggttaaggcatttatgcaccaacacatgggtaaccagtactagtatccccataga 
cacagatgaagagaggcctgttatgttaaagattggaacctgccaaaccaaagaagataa 
agaagcgttcgcaatcgtgtctgttccactgtctgaagttcgagacttagactttgccaa 
tgatgccaataaagtactagcgaccacagttaaaaagctagaaaacggcacaataactca 



280/365 



Figure 1 (Cont'd) 



gaatgaaaggaggtttgtaaccaaattattggaagatctcatattctttgttgctgatgt 
gcctaataatggacaagaagttctggatgtggttatcactaagccaaaccgagagcgtca 
aaaattgatgagggaacaaaacatactggcacaggtatttggaattcttaaagcaccctt 
taaagagaaagcaggagaaggctcgatgctgagacttgaagatctgggggatcaaagata 
tgcaccctacaagtacatgctgcggctctgttaccgcgtcctgagacactcgcagcagga 
ttaccggaaaaatcaggaatatattgctaagaatttctgtgtcatgcagtcccagattgg 
ctatgatattttggcagaagatactatcacagctttgttgcacaacaacagaaaactact 
agagaaacatatcacagcaaaagaaatagaaacatttgtcagtttactcaggagaaatcg 
ggagccaaggtttttggattatttgtcagatctgtgtgtgtctaataccactgctatccc 
tgtaactcaagaactcatctgtaaatttatgttgagtccaggcaatgcagacattctcat 
tcaaactaaggtggtctcaatgcaagcagacaaccccatggagagctccatcctttcaga 
tgacattgatgatgaagaagtttggctctattggattgacagcaacaaggaacctcatgg 
caaagctatcaggcaccttgctcaagaggcaaaagaaggcaccaaagctgacttagaagt 
tcttacctattacaggtaccagctaaacctctttgcaaggatgtgcttggatcgccagta 
tctggccataaaccagatttctacacagctgtctgtagacctgatcctgcggtgtgtgtc 
ggatgagagcctgccgttcgacctccgagcgtccttctgtcgcctcatgctccacatgca 
cgttgaccgggatccccaggagtccgtggtgcctgttcgctatgccaggctctggacaga 
aatccccacaaagatcacaattcatgaatatgattctataacagactcttccagaaatga 
tatgaagaggaaatttgccctgacaatggaatttgttgaagaatatttgaaagaagttgt 
aaaccagccctttccttttggggataaagaaaaaaataaactgacatttgaggtggtcca 
cttggctcggaatcttatatactttggattttatagtttcagtgagttattaaggctaac 
aagaacacttctggctattttagacattgtacaggcccccatgtcatcatactttgaaag 
attaagcaaatttcaagatggaggaaacaatgtgatgagaaccattcatggggtgggaga 
gatgatgacccagatggtactcagtagaggctccatcttccccatgagcgtgccggatgt 
gccacccagcatccacccgagcaagcaagggagccccaccgagcatgaggatgtgactgt 
gatggacaccaagctgaagatcattgagattttgcagtttatcctgagtgtcagactgga 
ttataggatctcatatatgctgtcaatatataagaaggagtttggagaggacaatgacaa 
tgcggagacatctgccagtggatctccagacactttactaccatcagctattgttcctga 
tatagatgaaattgcagctcaggcagaaactatgtttgcgggaagaaaagaaaaaaatcc 
agttcaacttgacgatgaaggaggcaggacgtttttacgggtcctcattcatctgatcat 
gcacgactacccgcctttgctgtctggagccctgcagctgttgtttaagcacttcagcca 
gagggcagaggttttacaggcatttaagcaggtgcaattactggtgtctaatcaagacgt 
agataactacaagcaaatcaaggcagatctagaccagcttcgactgacagtagaaaagtc 
tgagctatgggtggagaagagcagcaactatgagaatggagaaataggggaaagtcaagt 
gaaaggtggtgaagagccaattgaggaatcaaacattttaagtccagtgcaggatggaac 
aaagaaacctcagattgacagcaacaagagcaataactaccggattgtaaaggagatttt 
gatcaggctaagtaaactctgtgtgcagaataaaaagtgtcggaatcaacatcaacgatt 
actgaaaaatatgggggcgcattcggtggtgttggatcttctgcagataccctatgaaaa 
gaatgatgaaaagatgaatgaagtaatgaatctagcccatacatttctgcagaatttctg 
tcgaggaaatccacagaatcaagttcttcttcataaacatctgaatttgtttttaactcc 
aggtctccttgaagcagaaaccatgcggcacatcttcatgaacaattaccatctgtgcaa 
cgaaattagcgagagagttgtacaacactttgtgcactgcattgagacacatggccgcca 
cgtggagtacctgaggtttttgcaaacaattgtaaaagcagatggtaaatatgtgaagaa 
atgccaggatatggtaatgacagagttgataaatgggggtgaagacgtgctgatatttta 
caatgatagagcatcatttccaatccttctccatatgatgtgttcagagagagaccgagg 
ggatgagagtggccccttagcctaccacatcaccctggtggagttgctggcagcatgcac 
agaggggaaaaatgtctacactgaaatcaagtgtaattcccttctcccgctggacgacat 
agtgagggtggtgacccatgacgactgcatccctgaggttaaaattgcttatgtgaactt 
tgttaatcactgttatgttgacactgaagtggaaatgaaagaaatctatacaagtaacca 
catttggaaattatttgagaacttcttggtggatatggcaagggtttgcaacacaactac 
agacaggaaacatgcagacatctttttggaaaagtgtgttactgagtcaataatgaatat 
tgtgagcggcttctttaattctcccttttcagacaatagtaccagcctccagacacatca 
gccagtttttattcagctactgcaatctgccttcagaatttacaattgcacctggccaaa 
cccagcgcagaaagcctcagtggaatcctgtatcagaactttggctgaagtggcaaaaaa 
tcgtggaattgccattccagtggatttggacagccaagttaatactcttttcatgaagag 
ccattcaaatatggtgcagagagcagcaatgggttggagactatcagctcgctctgggcc 
acgctttaaggaagctcttggagggcctgcttgggattacagaaatattattgaaaagtt 
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acaggatgtagtggcctccttggagcaccagttcagcccaatgatgcaggctgaattctc 
agtgttggttgatgtattgtacagtccagaactgctgttccctgagggaagcgatgcaag 
aataagatgtggcgctttcatgtcgaagttgattaatcatacaaagaaactaatggagaa 
agaagaaaaactgtgcattaaaattcttcagacattacgagaaatgttagagaagaaaga 
cagctttgtggaagagggtaacacattaagaaagatacttctgaatcgatactttaaagg 
tgattatagtattggtgtgaatggacacctatcaggagcctactccaaaactgcacaggt 
gggaggaagcttttctggacaagattcagataagatggggatatcaatgtcagacattca 
gtgtctgctggataaagaaggtgcatcagaacttgtcatcgatgttatagtgaacaccaa 
aaatgacagaattttttcagaaggcattttcctcggcattgccttgcttgaaggaggaaa 
tacacaaacacagtattctttctaccagcagttgcatgaacaaaaaaagtcagaaaaatt 
ctttaaagttctctatgatcgaatgaaggctgctcagaaagaaataagatcaacagtgac 
agttaataccatagatttaggtaacaaaaaaagggacgatgacaatgaattgatgacatc 
tggtccacgaatgagagtaagagattcaacactacatttaaaagagggaatgaaagggca 
attaacagaagcttcttcagcaacatccaaagcatattgtgtatacagaagagaaatgga 
tccagaaatagacattatgtgcacaggaccagaagcgggaaacactgaggaaaaatccgc 
agaggaagtaacaatgagtcccgcaattgccatcatgcagccaatactgagatttcttca 
gttactgtgtgagaatcacaaccgggaattgcagaacttcttgaggaatcaaaacaacaa 
aacaaattacaacctagtctgtgagacccttcagtttctggactgcatttgtggaagtac 
aaccggtggcctgggcctgttgggtctctacatcaatgagaagaatgtagcgctggtcaa 
ccagaacctggagagcttgactgagtattgccagggcccttgccatgaaaatcagacctg 
tatcgctacacatgagtctaatgggattgatatcatcattgctttgattctgaatgacat 
aaaccctcttggtaaataccgaatggacctggtgctccagctaaagaacaatgcatctaa 
acttttgctggccattatggaaagcagacatgacagtgagaatgcagaaagaattctttt 
taacatgagacccagagaactggtggatgtgatgaagaatgcctataaccaaggattgga 
atgtgaccatggggatgatgagggtggagatgatggtgtttctccaaaagatgttggaca 
caatatctatattctggcccatcagttggcccgccacaataaactgttgcagcagatgct 
caaaccaggatcggatccagatgaaggagatgaagccttaaagtattatgccaaccacac 
tgcacagattgagattgtccggcatgataggaccatggaacaaatagtttttcctgtccc 
caatatatgtgaatacctcactcgagaatccaagtgccgtgtgttcaatacaactgaaag 
ggatgaacaaggaagtaaagtgaatgactttttccagcaaacagaagatctctacaatga 
aatgaagtggcagaagaaaatcaggaataaccctgcactgttctggttctcgaggcacat 
ctctctctgggggagcatttccttcaacctggctgtgttcatcaatttagctgttgctct 
cttctacccatttggggatgatggagatgaaggtacactttctccattgttctcggttct 
tctttggatagcagttgcgatctgcacatctatgctgtttttcttctccaagcctgtggg 
tattcggccgtttcttgtatcaataatgctcagatcaatatatacaataggtcttgggcc 
tacattaatacttcttggtgcagctaatctttgtaataaaattgtttttctggtgagttt 
tgttggaaatcgtggcacgttcacccgtgggtaccgagcagtcatcctggatatggcctt 
tctctatcacgtggcgtatgtcctggtttgcatgctgggcctttttgtccatgaattctt 
ctatagcttcctgctttttgatttggtgtacagggaagagactttgctgaatgtcataaa 
aagtgtcacacgaaatggccgctctattattctaactgcagtcctggctctcatcctcgt 
ctacctgttttccattattgggttcctttttttgaaggatgacttcactatggaagttga 
taggctgaaaaaccgaactcctgttacaggcagtcatcaagtgcctactatgactttaac 
taccatgatggaagcatgtgccaaggagaactgttcacccacaattccagcttcaaatac 
agctgatgaagagtatgaagatggaattgaaaggacgtgtgacactctccttatgtgcat 
tgtcaccgtgctgaaccagggcctcaggaatggcggtggtgtgggggatgtgctaagaag 
gccatcgaaagatgagcccttgtttgctgcccgagtggtttatgaccttcttttttattt 
cattgttatcattattgttctgaacttgatttttggtgttatcatcgatacttttgctga 
tctcagaagcgaaaaacagaaaaaagaagaaattctaaagacaacttgtttcatctgtgg 
acttgagagagacaagtttgataataaaacggtttcatttgaggagcacattaagtcaga 
acacaatatgtggcattatttgtacttcatagtcctggtgaaagttaaagacccaacaga 
atacactggacctgaaagttatgtggctcaaatgattgtggagaagaatttggattggtt 
tcctcggatgcgagccatgtccctcgttagcaatgaaggcgacagtgagcaaaatgaaat 
tcggagccttcaggagaagttggaatcgaccatgagtctggtcaaacagctgtcgggtca 
gctggcggagctcaaggagcagatgacagaacaaaggaagaataagcagagactgggctt 
cctcggatcaaacacaccccatgtgaatcatcacatgccaccacactgataccatggggg 
gaagccgtgactagcctttcatcagtgtcctgcctgatcactgaataaagaactgagatg 
gaggggagtgaacagtgcctattgttgaaaagttaaaaacaaccaagtgccaagatgttg 
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agtgggttagctccgagaacaatttataactgtgttttcatggttgcgaagacctaacct 
caaatgcatctgctagaaagcgtacatcacacattcgcaatgcatcaggaagaaaaggct 
tgcccaaaaggctggagagggcagggagcggcaggatggaaggagacacggggcagggag 
aactctcttctgctaaatcgataggagtcagttttgtcttaaatgctgactacagccact 
gacatggttggctggaatttctttcttttaattgtggcatataggtttgtgacacaagaa 
gtcatactttggtggctaagttttactaaggaaaataactgaaaagattaaaagtgagag 
ctgaaaagagaaatgataatgcttccaaactgtagctgtcacagggcaatttctttattt 
ataacatgaagcacaatggatttacagctctaggaacttagtactttggagcttttgcct 
ctcacactgacaacataacaggatgtgattgccttctctgggattcagacaggctctgtc 
aatgtggagcacaaaaggagattttcatataacttgttaaaaacatgttctaagtcatgt 
ataggctaagattttaagaagatctgggggaataaaaagccaacagtaagccccaggaaa 
gggttttttgagaccatatgtatgctattaaaatatattgagattaatacatgaaaatgc 
ttaaaagtgataggaattattgagaaacttatgatggtggcattgccttttataaacatg 
gagtgaaagccatttgactcaacgtttgctgtgcttaaagaattgcttcaggacccgagc 
gtttttatgtatgctgttcctgcagttaggaaaaaaaaatccaagaaatgtattgaatac 
tcaagaaaatgccacatttcaattacatatttaaaactgtatctgtaagggcttttcaaa 
atgtagcaagataaaaatctcttattcaaattgtttttgtattaaatcatgcactcagaa 
tttgtcgagggagagaataacctggtgtgttcaggttattcttagagactacacatttgg 
aatagcagagcaaaatatggataatgaaagtgttgggaaaaaagtgatgtgacagggagt 
gaagacacttgataccagactctgggagatactctgcaagttgacctggcctctccccca 
caggaacaaaacactgcctccagagtctttaaattctcagttatcaacgccaaggtttaa 
ggtctagatagggtttgctatagggtttgctctgacaatttttaaagcttttggattgtt 
ctaacatagcttaagctattggttcctaaaaatcccaaatcaagatctatgtagaatata 
aaggaagcctgaaccaatccttccacatactgttaagatgtagacttggaacaaagctgt 
tgggacccagagcaatgaatttttgaactgaagctactggtactccccagcaccacctca 
tactaagaattcctcactctgacatgacagtattttttctgaccaggagctgaaagaccc 
tgacattcatgatccaaagatataagaaattttgatatgtctgcatgcagcacagaactt 
tgaacctaggaggcagtcaataaatacatgagaaattcagctgtcattcagttactctta 
attctttataaaactttaaatgataccacatatttgtttgtttaaaatggctttcccaaa 
atcaaagtagactaaagcagccatctttaaaatccaggatcatcaatgctattaacagta 
tcaggtaaaaagtacactttaaaatattttaatcaggcagagttttatggatagagaata 
gaagagaaaggtagtaaatattgaacatattccaatataggaacctatctctgttttagt 
acaaaatatttctgacatctgaactagaggtcaagagaataaattcatttgtatacatct 
gagcaacctgtctttcagatgataaagtatctagccttttctgacaccataatagttcat 
tttgtagggaataagccattaggtgtatataattgctttctagaaatgacctaatgtccc 
caaccactttgtagtggcagatcactgtttcacagcatattttctcccaaggaaagtatt 
caaaagagactgcaactaacaagactcttatttcatcaaaatttaaatatttctgagttg 
tatttttaatgcctctttcttttctgcctaaatgcttagaaattataaagcaaaaaaaaa 
aaaaaaaacagcaacaaaaaatcgaagcagacaaaaaaggcacttttcagaacatcaaat 
tcctaatgaagaagaggagaagataatggggaaatttgacatttgatataaatttatatt 
tgttatgtgtatgtttgttaatgcaactggaatatttgacttagacccattccaatatgc 
tgtgatacacgacacggtgcaccgcagaggagacagacatcttcagcacaaacacagatg 
cctgcaggagcttggcaagtcactaaattgcatgaaattgtgaggtgcacaccaaatgca 
gggatgggggaggccgtgggaagctctgtattctcaaaatttgacagctaattccggttt 
ttagaaaatgcctgaggccagtagaggccctctagtcactcactgctgctgtttctgata 
taattattgagaaagctatctcacttaatagaagaaaacacgcactatcaaaaccagata 
gcctaacgtgcatgtgaaaatcgagaaagctgaaaacaaatccaggtacccttctctgaa 
ctggagtgtttccacagacttgaataatttatgaaattatcacaccagtatttctcatat 
caccaagaagactttctctcctgcagtagaggattgttatatttgcctaaaaaacacgat 
tccaatatatgacaagggcagataatttataagtgaatgttaataaaattggatgtgtat 
aa 

SEQ ID NO: 425. >Z38709_T3 # TY Transcript # LN 11694 # Source Gene: Z38709 
# Encoded protein: Z38709_P11 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 



283/365 



Figur 1 (Cont'd) 



cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatattgttgtaggagataaagttgttttgatgcc 
tgtgaatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtg 
taaagaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaata 
tagttcctatcgagaggatgtattaaaaggaggggacgttgttagattatttcatgcgga 
acaagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtac 
gaccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggt 
ggttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaa 
gcatcttgcaactggaaactatttagctgcagagcttaatcctgattatcgagatgccca 
aaatgaaggaaaaaatgtgagagatggagtccctccaacttcaaagaaaaaacgccaggc 
aggggagaagatcatgtatactttggtttcagtcccgcatggcaatgacattgcatccct 
ttttgaactagatgccacaactcttcagagagctgactgcctggttccaaggaactcata 
tgttcggttaaggcatttatgcaccaacacatgggtaaccagtactagtatccccataga 
cacagatgaagagaggcctgttatgttaaagattggaacctgccaaaccaaagaagataa 
agaagcgttcgcaatcgtgtctgttccactgtctgaagttcgagacttagactttgccaa 
tgatgccaataaagtactagcgaccacagttaaaaagctagaaaacggcacaataactca 
gaatgaaaggaggtttgtaaccaaattattggaagatctcatattctttgttgctgatgt 
gcctaataatggacaagaagttctggatgtggttatcactaagccaaaccgagagcgtca 
aaaattgatgagggaacaaaacatactggcacaggtatttggaattcttaaagcaccctt 
taaagagaaagcaggagaaggctcgatgctgagacttgaagatctgggggatcaaagata 
tgcaccctacaagtacatgctgcggctctgttaccgcgtcctgagacactcgcagcagga 
ttaccggaaaaatcaggaatatattgctaagaatttctgtgtcatgcagtcccagattgg 
ctatgatattttggcagaagatactatcacagctttgttgcacaacaacagaaaactact 
agagaaacatatcacagcaaaagaaatagaaacatttgtcagtttactcaggagaaatcg 
ggagccaaggtttttggattatttgtcagatctgtgtgtgtctaataccactgctatccc 
tgtaactcaagaactcatctgtaaatttatgttgagtccaggcaatgcagacattctcat 
tcaaactaaggtggtctcaatgcaagcagacaaccccatggagagctccatcctttcaga 
tgacattgatgatgaagaagtttggctctattggattgacagcaacaaggaacctcatgg 
caaagctatcaggcaccttgctcaagaggcaaaagaaggcaccaaagctgacttagaagt 
tcttacctattacaggtaccagctaaacctctttgcaaggatgtgcttggatcgccagta 
tctggccataaaccagatttctacacagctgtctgtagacctgatcctgcggtgtgtgtc 
ggatgagagcctgccgttcgacctccgagcgtccttctgtcgcctcatgctccacatgca 
cgttgaccgggatccccaggagtccgtggtgcctgttcgctatgccaggctctggacaga 
aatccccacaaagatcacaattcatgaatatgattctataacagactcttccagaaatga 
tatgaagaggaaatttgccctgacaatggaatttgttgaagaatatttgaaagaagttgt 
aaaccagccctttccttttggggataaagaaaaaaataaactgacatttgaggtggtcca 
cttggctcggaatcttatatactttggattttatagtttcagtgagttattaaggctaac 
aagaacacttctggctattttagacattgtacaggcccccatgtcatcatactttgaaag 
attaagcaaatttcaagatggaggaaacaatgtgatgagaaccattcatggggtgggaga 
gatgatgacccagatggtactcagtagaggctccatcttccccatgagcgtgccggatgt 
gccacccagcatccacccgagcaagcaagggagccccaccgagcatgaggatgtgactgt 
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Figure 1 (Cont'd) 



gatggacacca.agctgaagatcattgagattttgcagtttatcctgagtgtcagactgga 
ttataggatctcatatatgctgtcaatatataagaaggagtttggagaggacaatgacaa 
tgcggagacatctgccagtggatctccagacactttactaccatcagctattgttcctga 
tatagatgaaattgcagctcaggcagaaactatgtttgcgggaagaaaagaaaaaaatcc 
agttcaacttgacgatgaaggaggcaggacgtttttacgggtcctcattcatctgatcat 
gcacgactacccgcctttgctgtctggagccctgcagctgttgtttaagcacttcagcca 
gagggcagaggttttacaggcatttaagcaggtgcaattactggtgtctaatcaagacgt 
agataactacaagcaaatcaaggcagatctagaccagcttcgactgacagtagaaaagtc 
tgagctatgggtggagaagagcagcaactatgagaatggagaaataggggaaagtcaagt 
gaaaggtggtgaagagccaattgaggaatcaaacattttaagtccagtgcaggatggaac 
aaagaaacctcagattgacagcaacaagagcaataactaccggattgtaaaggagatttt 
gatcaggctaagtaaactctgtgtgcagaataaaaagtgtcggaatcaacatcaacgatt 
actgaaaaatatgggggcgcattcggtggtgttggatcttctgcagataccctatgaaaa 
gaatgatgaaaagatgaatgaagtaatgaatctagcccatacatttctgcagaatttctg 
tcgaggaaatccacagaatcaagttcttcttcataaacatctgaatttgtttttaactcc 
aggtctccttgaagcagaaaccatgcggcacatcttcatgaacaattaccatctgtgcaa 
cgaaattagcgagagagttgtacaacactttgtgcactgcattgagacacatggccgcca 
cgtggagtacctgaggtttttgcaaacaattgtaaaagcagatggtaaatatgtgaagaa 
atgccaggatatggtaatgacagagttgataaatgggggtgaagacgtgctgatatttta 
caatgatagagcatcatttccaatccttctccatatgatgtgttcagagagagaccgagg 
ggatgagagtggccccttagcctaccacatcaccctggtggagttgctggcagcatgcac 
agaggggaaaaatgtctacactgaaatcaagtgtaattcccttctcccgctggacgacat 
agtgagggtggtgacccatgacgactgcatccctgaggttaaaattgcttatgtgaactt 
tgttaatcactgttatgttgacactgaagtggaaatgaaagaaatctatacaagtaacca 
catttggaaattatttgagaacttcttggtggatatggcaagggtttgcaacacaactac 
agacaggaaacatgcagacatctttttggaaaagtgtgttactgagtcaataatgaatat 
tgtgagcggcttctttaattctcccttttcagacaatagtaccagcctccagacacatca 
gccagtttttattcagctactgcaatctgccttcagaatttacaattgcacctggccaaa 
cccagcgcagaaagcctcagtggaatcctgtatcagaactttggctgaagtggcaaaaaa 
tcgtggaattgccattccagtggatttggacagccaagttaatactcttttcatgaagag 
ccattcaaatatggtgcagagagcagcaatgggttggagactatcagctcgctctgggcc 
acgctttaaggaagctcttggagggcctgcttgggattacagaaatattattgaaaagtt 
acaggatgtagtggcctccttggagcaccagttcagcccaatgatgcaggctgaattctc 
agtgttggttgatgtattgtacagtccagaactgctgttccctgagggaagcgatgcaag 
aataagatgtggcgctttcatgtcgaagttgattaatcatacaaagaaactaatggagaa 
agaagaaaaactgtgcattaaaattcttcagacattacgagaaatgttagagaagaaaga 
cagctttgtggaagagggtaacacattaagaaagatacttctgaatcgatactttaaagg 
tgattatagtattggtgtgaatggacacctatcaggagcctactccaaaactgcacaggt 
gggaggaagcttttctggacaagattcagataagatggggatatcaatgtcagacattca 
gtgtctgctggataaagaaggtgcatcagaacttgtcatcgatgttatagtgaacaccaa 
aaatgacagaattttttcagaaggcattttcctcggcattgccttgcttgaaggaggaaa 
tacacaaacacagtattctttctaccagcagttgcatgaacaaaaaaagtcagaaaaatt 
ctttaaagttctctatgatcgaatgaaggctgctcagaaagaaataagatcaacagtgac 
agttaataccatagatttaggtaacaaaaaaagggacgatgacaatgaattgatgacatc 
tggtccacgaatgagagtaagagattcaacactacatttaaaagagggaatgaaagggca 
attaacagaagcttcttcagcaacatccaaagcatattgtgtatacagaagagaaatgga 
tccagaaatagacattatgtgcacaggaccagaagcgggaaacactgaggaaaaatccgc 
agaggaagtaacaatgagtcccgcaattgccatcatgcagccaatactgagatttcttca 
gttactgtgtgagaatcacaaccgggaattgcagaacttcttgaggaatcaaaacaacaa 
aacaaattacaacctagtctgtgagacccttcagtttctggactgcatttgtggaagtac 
aaccggtggcctgggcctgttgggtctctacatcaatgagaagaatgtagcgctggtcaa 
ccagaacctggagagcttgactgagtattgccagggcccttgccatgaaaatcagacctg 
tatcgctacacatgagtctaatgggattgatatcatcattgctttgattctgaatgacat 
aaaccctcttggtaaataccgaatggacctggtgctccagctaaagaacaatgcatctaa 
acttttgctggccattatggaaagcagacatgacagtgagaatgcagaaagaattctttt 
taacatgagacccagagaactggtggatgtgatgaagaatgcctataaccaaggattgga 
atgtgaccatggggatgatgagggtggagatgatggtgtttctccaaaagatgttggaca 
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Figur 1 (Confd) 



caatatctatattctggcccatcagttggcccgccacaataaactgttgcagcagatgct 
caaaccaggatcggatccagatgaaggagatgaagccttaaagtattatgccaaccacac 
tgcacagattgagattgtccggcatgataggaccatggaacaaatagtttttcctgtccc 
caatatatgtgaatacctcactcgagaatccaagtgccgtgtgttcaatacaactgaaag 
ggatgaacaaggaagtaaagtgaatgactttttccagcaaacagaagatctctacaatga 
aatgaagtggcagaagaaaatcaggaataaccctgcactgttctggttctcgaggcacat 
ctctctctgggggagcatttccttcaacctggctgtgttcatcaatttagctgttgctct 
cttctacccatttggggatgatggagatgaaggtacactttctccattgttctcggttct 
tctttggatagcagttgcgatctgcacatctatgctgtttttcttctccaagcctgtggg 
tattcggccgtttcttgtatcaataatgctcagatcaatatatacaataggtcttgggcc 
tacattaatacttcttggtgcagctaatctttgtaataaaattgtttttctggtgagttt 
tgttggaaatcgtggcacgttcacccgtgggtaccgagcagtcatcctggatatggcctt 
tctctatcacgtggcgtatgtcctggtttgcatgctgggcctttttgtccatgaattctt 
ctatagcttcctgctttttgatttggtgtacagggaagagactttgctgaatgtcataaa 
aagtgtcacacgaaatggccgctctattattctaactgcagtcctggctctcatcctcgt 
ctacctgttttccattattgggttcctttttttgaaggatgacttcactatggaagttga 
taggctgaaaaaccgaactcctgttacaggcagtcatcaagtgcctactatgactttaac 
taccatgatggaagcatgtgccaaggagaactgttcacccacaattccagcttcaaatac 
agctgatgaagagtatgaagatggaattgaaaggacgtgtgacactctccttatgtgcat 
tgtcaccgtgctgaaccagggcctcaggaatggcggtggtgtgggggatgtgctaagaag 
gccatcgaaagatgagcccttgtttgctgcccgagtggtttatgaccttcttttttattt 
cattgttatcattattgttctgaacttgatttttggtgttatcatcgatacttttgctga 
tctcagaagcgaaaaacagaaaaaagaagaaattctaaagacaacttgtttcatctgtgg 
acttgagagagacaagtttgataataaaacggtttcatttgaggagcacattaagtcaga 
acacaatatgtggcattatttgtacttcatagtcctggtgaaagttaaagacccaacaga 
atacactggacctgaaagttatgtggctcaaatgattgtggagaagaatttggattggtt 
tcctcggatgcgagccatgtccctcgttagcaatgaaggcgacagtgagcaaaatgaaat 
tcggagccttcaggagaagttggaatcgaccatgagtctggtcaaacagctgtcgggtca 
gctggcggagctcaaggagcagatgacagaacaaaggaagaataagcagagactgggctt 
cctcggatcaaacacaccccatgtgaatcatcacatgccaccacactgataccatggggg 
gaagccgtgactagcctttcatcagtgtcctgcctgatcactgaataaagaactgagatg 
gaggggagtgaacagtgcctattgttgaaaagttaaaaacaaccaagtgccaagatgttg 
agtgggttagctccgagaacaatttataactgtgttttcatggttgcgaagacctaacct 
caaatgcatctgctagaaagcgtacatcacacattcgcaatgcatcaggaagaaaaggct 
tgcccaaaaggctggagagggcagggagcggcaggatggaaggagacacggggcagggag 
aactctcttctgctaaatcgataggagtcagttttgtcttaaatgctgactacagccact 
gacatggttggctggaatttctttcttttaattgtggcatataggtttgtgacacaagaa 
gtcatactttggtggctaagttttactaaggaaaataactgaaaagattaaaagtgagag 
ctgaaaagagaaatgataatgcttccaaactgtagctgtcacagggcaatttctttattt 
ataacatgaagcacaatggatttacagctctaggaacttagtactttggagcttttgcct 
ctcacactgacaacataacaggatgtgattgccttctctgggattcagacaggctctgtc 
aatgtggagcacaaaaggagattttcatataacttgttaaaaacatgttctaagtcatgt 
ataggctaagattttaagaagatctgggggaataaaaagccaacagtaagccccaggaaa 
gggttttttgagaccatatgtatgctattaaaatatattgagattaatacatgaaaatgc 
ttaaaagtgataggaattattgagaaacttatgatggtggcattgccttttataaacatg 
gagtgaaagccatttgactcaacgtttgctgtgcttaaagaattgcttcaggacccgagc 
gtttttatgtatgctgttcctgcagttaggaaaaaaaaatccaagaaatgtattgaatac 
tcaagaaaatgccacatttcaattacatatttaaaactgtatctgtaagggcttttcaaa 
atgtagcaagataaaaatctcttattcaaattgtttttgtattaaatcatgcactcagaa 
tttgtcgagggagagaataacctggtgtgttcaggttattcttagagactacacatttgg 
aatagcagagcaaaatatggataatgaaagtgttgggaaaaaagtgatgtgacagggagt 
gaagacacttgataccagactctgggagatactctgcaagttgacctggcctctccccca 
caggaacaaaacactgcctccagagtctttaaattctcagttatcaacgccaaggt tt aa 
ggtctagatagggtttgctatagggtttgctctgacaatttttaaagcttttggattgtt 
ctaacatagcttaagctattggttcctaaaaatcccaaatcaagatctatgtagaatata 
aaggaagcctgaaccaatccttccacatactgttaagatgtagacttggaacaaagctgt 
tgggacccagagcaatgaatttttgaactgaagctactggtactccccagcaccacctca 
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Figur 1 (Cont'd) 



tactaagaattcctcactctgacatgacagtattttttctgaccaggagctgaaagaccc 
tgacattcatgatccaaagatataagaaattttgatatgtctgcatgcagcacagaactt 
tgaacctaggaggcagtcaataaatacatgagaaattcagctgtcattcagttactctta 
attctttataaaactttaaatgataccacatatttgtttgtttaaaatggctttcccaaa 
atcaaagtagactaaagcagccatctttaaaatccaggatcatcaatgctattaacagta 
tcaggtaaaaagtacactttaaaatattttaatcaggcagagttttatggatagagaata 
gaagagaaaggtagtaaatattgaacatattccaatataggaacctatctctgttttagt 
acaaaatatttctgacatctgaactagaggtcaagagaataaattcatttgtatacatct 
gagcaacctgtctttcagatgataaagtatctagccttttctgacaccataatagttcat 
tttgtagggaataagccattaggtgtatataattgctttctagaaatgacctaatgtccc 
caaccactttgtagtggcagatcactgtttcacagcatattttctcccaaggaaagtatt 
caaaagagactgcaactaacaagactcttatttcatcaaaatttaaatatttctgagttg 
tatttttaatgcctctttcttttctgcctaaatgcttagaaattataaagcaaaaaaaaa 
aaaaaaaacagcaacaaaaaatcgaagcagacaaaaaaggcacttttcagaacatcaaat 
tcctaatgaagaagaggagaagataatggggaaatttgacatttgatataaatttatatt 
tgttatgtgtatgtttgttaatgcaactggaatatttgacttaggtgagtatcagttaac 
atccttgtatttatatagtgacatcaaaataaatgcaatcatcttgaactttgatgttaa 
gggagattttgaaaaaatatttagttatttcaaattcatattggttcaaaagtatcagtt 
tctgctgaataatatttttaatttcaaaagggttttgttccctctgtgctccattctgag 
gctcaagagcttaatgcaagtatgttttctgagttaaaataacacttttagatgagaaaa 
tgcttgtatcatacagggctataatataaataaataaattgtatgtatgcaaaa 

SEQ ID NO: 426. >Z38709_T4 # TY Transcript # LN 12184 # Source Gene: Z38709 
# Encoded protein: Z38709_P11 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 
cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatattgttgtaggagataaagttgttttgatgcc 
tgtgaatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtg 
taaagaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaata 
tagttcctatcgagaggatgtattaaaaggaggggacgttgttagattatttcatgcgga 
acaagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtac 
gaccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggt 
ggttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaa 
gcatcttgcaactggaaactatttagctgcagagcttaatcctgattatcgagatgccca 
aaatgaaggaaaaaatgtgagagatggagtccctccaacttcaaagaaaaaacgccaggc 
aggggagaagatcatgtatactttggtttcagtcccgcatggcaatgacattgcatccct 
ttttgaactagatgccacaactcttcagagagctgactgcctggttccaaggaactcata 
tgttcggttaaggcatttatgcaccaacacatgggtaaccagtactagtatccccataga 
cacagatgaagagaggcctgttatgttaaagattggaacctgccaaaccaaagaagataa 
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Figure 1 (Cont'd) 



agaagcgttcgcaatcgtgtctgttccactgtctgaagttcgagacttagactttgccaa 
tgatgccaataaagtactagcgaccacagttaaaaagctagaaaacggcacaataactca 
gaatgaaaggaggtttgtaaccaaattattggaagatctcatattctttgttgctgatgt 
gcctaataatggacaagaagttctggatgtggttatcactaagccaaaccgagagcgtca 
aaaattgatgagggaacaaaacatactggcacaggtatttggaattcttaaagcaccctt 
taaagagaaagcaggagaaggctcgatgctgagacttgaagatctgggggatcaaagata 
tgcaccctacaagtacatgctgcggctctgttaccgcgtcctgagacactcgcagcagga 
ttaccggaaaaatcaggaatatattgctaagaatttctgtgtcatgcagtcccagattgg 
ctatgatattttggcagaagatactatcacagctttgttgcacaacaacagaaaactact 
agagaaacatatcacagcaaaagaaatagaaacatttgtcagtttactcaggagaaatcg 
ggagccaaggtttttggattatttgtcagatctgtgtgtgtctaataccactgctatccc 
tgtaactcaagaactcatctgtaaatttatgttgagtccaggcaatgcagacattctcat 
tcaaactaaggtggtctcaatgcaagcagacaaccccatggagagctccatcctttcaga 
tgacattgatgatgaagaagtttggctctattggattgacagcaacaaggaacctcatgg 
caaagctatcaggcaccttgctcaagaggcaaaagaaggcaccaaagctgacttagaagt 
tcttacctattacaggtaccagctaaacctctttgcaaggatgtgcttggatcgccagta 
tctggccataaaccagatttctacacagctgtctgtagacctgatcctgcggtgtgtgtc 
ggatgagagcctgccgttcgacctccgagcgtccttctgtcgcctcatgctccacatgca 
cgttgaccgggatccccaggagtccgtggtgcctgttcgctatgccaggctctggacaga 
aatccccacaaagatcacaattcatgaatatgattctataacagactcttccagaaatga 
tatgaagaggaaatttgccctgacaatggaatttgttgaagaatatttgaaagaagttgt 
aaaccagccctttccttttggggataaagaaaaaaataaactgacatttgaggtggtcca 
cttggctcggaatcttatatactttggattttatagtttcagtgagttattaaggctaac 
aagaacacttctggctattttagacattgtacaggcccccatgtcatcatactttgaaag 
attaagcaaatttcaagatggaggaaacaatgtgatgagaaccattcatggggtgggaga 
gatgatgacccagatggtactcagtagaggctccatcttccccatgagcgtgccggatgt 
gccacccagcatccacccgagcaagcaagggagccccaccgagcatgaggatgtgactgt 
gatggacaccaagctgaagatcattgagattttgcagtttatcctgagtgtcagactgga 
ttataggatctcatatatgctgtcaatatataagaaggagtttggagaggacaatgacaa 
tgcggagacatctgccagtggatctccagacactttactaccatcagctattgttcctga 
tatagatgaaattgcagctcaggcagaaactatgtttgcgggaagaaaagaaaaaaatcc 
agttcaacttgacgatgaaggaggcaggacgtttttacgggtcctcattcatctgatcat 
gcacgactacccgcctttgctgtctggagccctgcagctgttgtttaagcacttcagcca 
gagggcagaggttttacaggcatttaagcaggtgcaattactggtgtctaatcaagacgt 
agataactacaagcaaatcaaggcagatctagaccagcttcgactgacagtagaaaagtc 
tgagctatgggtggagaagagcagcaactatgagaatggagaaataggggaaagtcaagt 
gaaaggtggtgaagagccaattgaggaatcaaacattttaagtccagtgcaggatggaac 
aaagaaacctcagattgacagcaacaagagcaataactaccggattgtaaaggagatttt 
gatcaggctaagtaaactctgtgtgcagaataaaaagtgtcggaatcaacatcaacgatt 
actgaaaaatatgggggcgcattcggtggtgttggatcttctgcagataccctatgaaaa 
gaatgatgaaaagatgaatgaagtaatgaatctagcccatacatttctgcagaatttctg 
tcgaggaaatccacagaatcaagttcttcttcataaacatctgaatttgtttttaactcc 
aggtctccttgaagcagaaaccatgcggcacatcttcatgaacaattaccatctgtgcaa 
cgaaattagcgagagagttgtacaacactttgtgcactgcattgagacacatggccgcca 
cgtggagtacctgaggtttttgcaaacaattgtaaaagcagatggtaaatatgtgaagaa 
atgccaggatatggtaatgacagagttgataaatgggggtgaagacgtgctgatatttta 
caatgatagagcatcatttccaatccttctccatatgatgtgttcagagagagaccgagg 
ggatgagagtggccccttagcctaccacatcaccctggtggagttgctggcagcatgcac 
agaggggaaaaatgtctacactgaaatcaagtgtaattcccttctcccgctggacgacat 
agtgagggtggtgacccatgacgactgcatccctgaggttaaaattgcttatgtgaactt 
tgttaatcactgttatgttgacactgaagtggaaatgaaagaaatctatacaagtaacca 
catttggaaattatttgagaacttcttggtggatatggcaagggtttgcaacacaactac 
agacaggaaacatgcagacatctttttggaaaagtgtgttactgagtcaataatgaatat 
tgtgagcggcttctttaattctcccttttcagacaatagtaccagcctccagacacatca 
gccagtttttattcagctactgcaatctgccttcagaatttacaattgcacctggccaaa 
cccagcgcagaaagcctcagtggaatcctgtatcagaactttggctgaagtggcaaaaaa 
tcgtggaattgccattccagtggatttggacagccaagttaatactcttttcatgaagag 
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Figure 1 (Cont'd) 



ccattcaaatatggtgcagagagcagcaatgggttggagactatcagctcgctctgggcc 
acgctttaaggaagctcttggagggcctgcttgggattacagaaatattattgaaaagtt 
acaggatgtagtggcctccttggagcaccagttcagcccaatgatgcaggctgaattctc 
agtgttggttgatgtattgtacagtccagaactgctgttccctgagggaagcgatgcaag 
aataagatgtggcgctttcatgtcgaagttgattaatcatacaaagaaactaatggagaa 
agaagaaaaactgtgcattaaaattcttcagacattacgagaaatgttagagaagaaaga 
cagctttgtggaagagggtaacacattaagaaagatacttctgaatcgatactttaaagg 
tgattatagtattggtgtgaatggacacctatcaggagcctactccaaaactgcacaggt 
gggaggaagcttttctggacaagattcagataagatggggatatcaatgtcagacattca 
gtgtctgctggataaagaaggtgcatcagaacttgtcatcgatgttatagtgaacaccaa 
aaatgacagaattttttcagaaggcattttcctcggcattgccttgcttgaaggaggaaa 
tacacaaacacagtattctttctaccagcagttgcatgaacaaaaaaagtcagaaaaatt 
ctttaaagttctctatgatcgaatgaaggctgctcagaaagaaataagatcaacagtgac 
agttaataccatagatttaggtaacaaaaaaagggacgatgacaatgaattgatgacatc 
tggtccacgaatgagagtaagagattcaacactacatttaaaagagggaatgaaagggca 
attaacagaagcttcttcagcaacatccaaagcatattgtgtatacagaagagaaatgga 
tccagaaatagacattatgtgcacaggaccagaagcgggaaacactgaggaaaaatccgc 
agaggaagtaacaatgagtcccgcaattgccatcatgcagccaatactgagatttcttca 
gttactgtgtgagaatcacaaccgggaattgcagaacttcttgaggaatcaaaacaacaa 
aacaaattacaacctagtctgtgagacccttcagtttctggactgcatttgtggaagtac 
aaccggtggcctgggcctgttgggtctctacatcaatgagaagaatgtagcgctggtcaa 
ccagaacctggagagcttgactgagtattgccagggcccttgccatgaaaatcagacctg 
tatcgctacacatgagtctaatgggattgatatcatcattgctttgattctgaatgacat 
aaaccctcttggtaaataccgaatggacctggtgctccagctaaagaacaatgcatctaa 
acttttgctggccattatggaaagcagacatgacagtgagaatgcagaaagaattctttt 
taacatgagacccagagaactggtggatgtgatgaagaatgcctataaccaaggattgga 
atgtgaccatggggatgatgagggtggagatgatggtgtttctccaaaagatgttggaca 
caatatctatattctggcccatcagttggcccgccacaataaactgttgcagcagatgct 
caaaccaggatcggatccagatgaaggagatgaagccttaaagtattatgccaaccacac 
tgcacagattgagattgtccggcatgataggaccatggaacaaatagtttttcctgtccc 
caatatatgtgaatacctcactcgagaatccaagtgccgtgtgttcaatacaactgaaag 
ggatgaacaaggaagtaaagtgaatgactttttccagcaaacagaagatctctacaatga 
aatgaagtggcagaagaaaatcaggaataaccctgcactgttctggttctcgaggcacat 
ctctctctgggggagcatttccttcaacctggctgtgttcatcaatttagctgttgctct 
cttctacccatttggggatgatggagatgaaggtacactttctccattgttctcggttct 
tctttggatagcagttgcgatctgcacatctatgctgtttttcttctccaagcctgtggg 
tattcggccgtttcttgtatcaataatgctcagatcaatatatacaataggtcttgggcc 
tacattaatacttcttggtgcagctaatctttgtaataaaattgtttttctggtgagttt 
tgttggaaatcgtggcacgttcacccgtgggtaccgagcagtcatcctggatatggcctt 
tctctatcacgtggcgtatgtcctggtttgcatgctgggcctttttgtccatgaattctt 
ctatagcttcctgctttttgatttggtgtacagggaagagactttgctgaatgtcataaa 
aagtgtcacacgaaatggccgctctattattctaactgcagtcctggctctcatcctcgt 
ctacctgttttccattattgggttcctttttttgaaggatgacttcactatggaagttga 
taggctgaaaaaccgaactcctgttacaggcagtcatcaagtgcctactatgactttaac 
taccatgatggaagcatgtgccaaggagaactgttcacccacaattccagcttcaaatac 
agctgatgaagagtatgaagatggaattgaaaggacgtgtgacactctccttatgtgcat 
tgtcaccgtgctgaaccagggcctcaggaatggcggtggtgtgggggatgtgctaagaag 
gccatcgaaagatgagcccttgtttgctgcccgagtggtttatgaccttcttttttattt 
cattgttatcattattgttctgaacttgatttttggtgttatcatcgatacttttgctga 
tctcagaagcgaaaaacagaaaaaagaagaaattctaaagacaacttgtttcatctgtgg 
acttgagagagacaagtttgataataaaacggtttcatttgaggagcacattaagtcaga 
acacaatatgtggcattatttgtacttcatagtcctggtgaaagttaaagacccaacaga 
atacactggacctgaaagttatgtggctcaaatgattgtggagaagaatttggattggtt 
tcctcggatgcgagccatgtccctcgttagcaatgaaggcgacagtgagcaaaatgaaat 
tcggagccttcaggagaagttggaatcgaccatgagtctggtcaaacagctgtcgggtca 
gctggcggagctcaaggagcagatgacagaacaaaggaagaataagcagagactgggctt 
cctcggatcaaacacaccccatgtgaatcatcacatgccaccacactgataccatggggg 
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gaagccgtgactagcctttcatcagtgtcctgcctgatcactgaataaagaactgagatg 
gaggggagtgaacagtgcctattgttgaaaagttaaaaacaaccaagtgccaagatgttg 
agtgggttagctccgagaacaatttataactgtgttttcatggttgcgaagacctaacct 
caaatgcatctgctagaaagcgtacatcacacattcgcaatgcatcaggaagaaaaggct 
tgcccaaaaggctggagagggcagggagcggcaggatggaaggagacacggggcagggag 
aactctcttctgctaaatcgataggagtcagttttgtcttaaatgctgactacagccact 
gacatggttggctggaatttctttcttttaattgtggcatataggtttgtgacacaagaa 
gtcatactttggtggctaagttttactaaggaaaataactgaaaagattaaaagtgagag 
ctgaaaagagaaatgataatgcttccaaactgtagctgtcacagggcaatttctttattt 
ataacatgaagcacaatggatttacagctctaggaacttagtactttggagcttttgcct 
ctcacactgaeaacataacaggatgtgattgccttctctgggattcagacaggctctgtc 
aatgtggagcacaaaaggagattttcatataacttgttaaaaacatgttctaagtcatgt 
ataggctaagattttaagaagatctgggggaataaaaagccaacagtaagccccaggaaa 
gggttttttgagaccatatgtatgctattaaaatatattgagattaatacatgaaaatgc 
ttaaaagtgataggaattattgagaaacttatgatggtggcattgccttttataaacatg 
gagtgaaagccatttgactcaacgtttgctgtgcttaaagaattgcttcaggacccgagc 
gtttttatgtatgctgttcctgcagttaggaaaaaaaaatccaagaaatgtattgaatac 
tcaagaaaatgccacatttcaattacatatttaaaactgtatctgtaagggcttttcaaa 
atgtagcaagataaaaatctcttattcaaattgtttttgtattaaatcatgcactcagaa 
tttgtcgagggagagaataacctggtgtgttcaggttattcttagagactacacatttgg 
aatagcagagcaaaatatggataatgaaagtgttgggaaaaaagtgatgtgacagggagt 
gaagacacttgataccagactctgggagatactctgcaagttgacctggcctctccccca 
caggaacaaaacactgcctccagagtctttaaattctcagttatcaacgccaaggtttaa 
ggtctagatagggtttgctatagggtttgctctgacaatttttaaagcttttggattgtt 
ctaacatagcttaagctattggttcctaaaaatcccaaatcaagatctatgtagaatata 
aaggaagcctgaaccaatccttccacatactgttaagatgtagacttggaacaaagctgt 
tgggacccagagcaatgaatttttgaactgaagctactggtactccccagcaccacctca 
tactaagaattcctcactctgacatgacagtattttttctgaccaggagctgaaagaccc 
tgacattcatgatccaaagatataagaaattttgatatgtctgcatgcagcacagaactt 
tgaacctaggaggcagtcaataaatacatgagaaattcagctgtcattcagttactctta 
attctttataaaactttaaatgataccacatatttgtttgtttaaaatggctttcccaaa 
atcaaagtagactaaagcagccatctttaaaatccaggatcatcaatgctattaacagta 
tcaggtaaaaagtacactttaaaatattttaatcaggcagagttttatggatagagaata 
gaagagaaaggtagtaaatattgaacatattccaatataggaacctatctctgttttagt 
acaaaatatttctgacatctgaactagaggtcaagagaataaattcatttgtatacatct 
gagcaacctgtctttcagatgataaagtatctagccttttctgacaccataatagttcat 
tttgtagggaataagccattaggtgtatataattgctttctagaaatgacctaatgtccc 
caaccactttgtagtggcagatcactgtttcacagcatattttctcccaaggaaagtatt 
caaaagagactgcaactaacaagactcttatttcatcaaaatttaaatatttctgagttg 
tatttttaatgcctctttcttttctgcctaaatgcttagaaattataaagcaaaaaaaaa 
aaaaaaaacagcaacaaaaaatcgaagcagacaaaaaaggcacttttcagaacatcaaat 
tcctaatgaagaagaggagaagataatggggaaatttgacatttgatataaatttatatt 
tgttatgtgtatgtttgttaatgcaactggaatatttgacttaggtgagtatcagttaac 
atccttgtatttatatagtgacatcaaaataaatgcaatcatcttgaactttgatgttaa 
gggagattttgaaaaaatatttagttatttcaaattcatattggttcaaaagtatcagtt 
tctgctgaataatatttttaatttcaaaagggttttgttccctctgtgctccattctgag 
gctcaagagcttaatgcaagtatgttttctgagttaaaataacacttttagatgagaaaa 
tgcttgtatcatacagggctataatataaataaataaattgtatgtatgcaaaaraaaaa 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aacaaaaaggaggaagcgacccaaacaacaaataacaacaacacacaacgagcggcgcac 
cagaaactcaaacgtccccagcaccacacgacacccccgacaccccaagccgcccccgcc 
accaacagtgcgccgcactcgctacccacaaccgcaaccgcgcccgcagcgcacacccta 
caagacccacccctacgcccaacctccgacgacccacgcgccccttctggacgatagctc 
ccacactacccctacgccaccccccaacaagccagaccactcactcacacgtcccttcgc 
atcactccactccctcctcctatcccatcacaatcacttatccgtgcattcacccacaca 
catcgccccgcgcacactgtttcctcccgctcttgccccccaaaacactatccctgcacc 
tcac 
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SEQ ID NO: 427. >Z38709JT5 « TY Transcript # LN 11447 # Source Gene: Z38709 
# Encoded protein: Z38709_P11 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 
cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatattgttgtaggagataaagttgttttgatgcc 
tgtgaatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtg 
taaagaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaata 
tagttcctatcgagaggatgtattaaaaggaggggacgttgttagattatttcatgcgga 
acaagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtac 
gaccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggt 
ggttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaa 
gcatcttgcaactggaaactatttagctgcagagcttaatcctgattatcgagatgccca 
aaatgaaggaaaaaatgtgagagatggagtccctccaacttcaaagaaaaaacgccaggc 
aggggagaagatcatgtatactttggtttcagtcccgcatggcaatgacattgcatccct 
ttttgaactagatgccacaactcttcagagagctgactgcctggttccaaggaactcata 
tgttcggttaaggcatttatgcaccaacacatgggtaaccagtactagtatccccataga 
cacagatgaagagaggcctgttatgttaaagattggaacctgccaaaccaaagaagataa 
agaagcgttcgcaatcgtgtctgttccactgtctgaagttcgagacttagactttgccaa 
tgatgccaataaagtactagcgaccacagttaaaaagctagaaaacggcacaataactca 
gaatgaaaggaggtttgtaaccaaattattggaagatctcatattctttgttgctgatgt 
gcctaataatggacaagaagttctggatgtggttatcactaagccaaaccgagagcgtca 
aaaattgatgagggaacaaaacatactggcacaggtatttggaattcttaaagcaccctt 
taaagagaaagcaggagaaggctcgatgctgagacttgaagatctgggggatcaaagata 
tgcaccctacaagtacatgctgcggctctgttaccgcgtcctgagacactcgcagcagga 
ttaccggaaaaatcaggaatatattgctaagaatttctgtgtcatgcagtcccagattgg 
ctatgatattttggcagaagatactatcacagctttgttgcacaacaacagaaaactact 
agagaaacatatcacagcaaaagaaatagaaacatttgtcagtttactcaggagaaatcg 
ggagccaaggtttttggattatttgtcagatctgtgtgtgtctaataccactgctatccc 
tgtaactcaagaactcatctgtaaatttatgttgagtccaggcaatgcagacattctcat 
tcaaactaaggtggtctcaatgcaagcagacaaccccatggagagctccatcctttcaga 
tgacattgatgatgaagaagtttggctctattggattgacagcaacaaggaacctcatgg 
caaagctatcaggcaccttgctcaagaggcaaaagaaggcaccaaagctgacttagaagt 
tcttacctattacaggtaccagctaaacctctttgcaaggatgtgcttggatcgccagta 
tctggccataaaccagatttctacacagctgtctgtagacctgatcctgcggtgtgtgtc 
ggatgagagcctgccgttcgacctccgagcgtccttctgtcgcctcatgctccacatgca 
cgttgaccgggatccccaggagtccgtggtgcctgttcgctatgccaggctctggacaga 
aatccccacaaagatcacaattcatgaatatgattctataacagactcttccagaaatga 
tatgaagaggaaatttgccctgacaatggaatttgttgaagaatatttgaaagaagttgt 
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aaaccagccctttccttttggggataaagaaaaaaataaactgacatttgaggtggtcca 
cttggctcggaatcttatatactttggattttatagtttcagtgagttattaaggctaac 
aagaacacttctggctattttagacattgtacaggcccccatgtcatcatactttgaaag 
attaagcaaatttcaagatggaggaaacaatgtgatgagaaccattcatggggtgggaga 
gatgatgacccagatggtactcagtagaggctccatcttccccatgagcgtgccggatgt 
gccacccagcatccacccgagcaagcaagggagccccaccgagcatgaggatgtgactgt 
gatggacaccaagctgaagatcattgagattttgcagtttatcctgagtgtcagactgga 
ttataggatctcatatatgctgtcaatatataagaaggagtttggagaggacaatgacaa 
tgcggagacatctgccagtggatctccagacactttactaccatcagctattgttcctga 
tatagatgaaattgcagctcaggcagaaactatgtttgcgggaagaaaagaaaaaaatcc 
agttcaacttgacgatgaaggaggcaggacgtttttacgggtcctcattcatctgatcat 
gcacgactacccgcctttgctgtctggagccctgcagctgttgtttaagcacttcagcca 
gagggcagaggttttacaggcatttaagcaggtgcaattactggtgtctaatcaagacgt 
agataactacaagcaaatcaaggcagatctagaccagcttcgactgacagtagaaaagtc 
tgagctatgggtggagaagagcagcaactatgagaatggagaaataggggaaagtcaagt 
gaaaggtggtgaagagccaattgaggaatcaaacattttaagtccagtgcaggatggaac 
aaagaaacctcagattgacagcaacaagagcaataactaccggattgtaaaggagatttt 
gatcaggctaagtaaactctgtgtgcagaataaaaagtgtcggaatcaacatcaacgatt 
actgaaaaatatgggggcgcattcggtggtgttggatcttctgcagataccctatgaaaa 
gaatgatgaaaagatgaatgaagtaatgaatctagcccatacatttctgcagaatttctg 
tcgaggaaatccacagaatcaagttcttcttcataaacatctgaatttgtttttaactcc 
aggtctccttgaagcagaaaccatgcggcacatcttcatgaacaattaccatctgtgcaa 
cgaaattagcgagagagttgtacaacactttgtgcactgcattgagacacatggccgcca 
cgtggagtacctgaggtttttgcaaacaattgtaaaagcagatggtaaatatgtgaagaa 
atgccaggatatggtaatgacagagttgataaatgggggtgaagacgtgctgatatttta 
caatgatagagcatcatttccaatccttctccatatgatgtgttcagagagagaccgagg 
ggatgagagtggccccttagcctaccacatcaccctggtggagttgctggcagcatgcac 
agaggggaaaaatgtctacactgaaatcaagtgtaattcccttctcccgctggacgacat 
agtgagggtggtgacccatgacgactgcatccctgaggttaaaattgcttatgtgaactt 
tgttaatcactgttatgttgacactgaagtggaaatgaaagaaatctatacaagtaacca 
catttggaaattatttgagaacttcttggtggatatggcaagggtttgcaacacaactac 
agacaggaaacatgcagacatctttttggaaaagtgtgttactgagtcaataatgaatat 
tgtgagcggcttctttaattctcccttttcagacaatagtaccagcctccagacacatca 
gccagtttttattcagctactgcaatctgccttcagaatttacaattgcacctggccaaa 
cccagcgcagaaagcctcagtggaatcctgtatcagaactttggctgaagtggcaaaaaa 
tcgtggaattgccattccagtggatttggacagccaagttaatactcttttcatgaagag 
ccattcaaatatggtgcagagagcagcaatgggttggagactatcagctcgctctgggcc 
acgctttaaggaagctcttggagggcctgcttgggattacagaaatattattgaaaagtt 
acaggatgtagtggcctccttggagcaccagttcagcccaatgatgcaggctgaattctc 
agtgttggttgatgtattgtacagtccagaactgctgttccctgagggaagcgatgcaag 
aataagatgtggcgctttcatgtcgaagttgattaatcatacaaagaaactaatggagaa 
agaagaaaaactgtgcattaaaattcttcagacattacgagaaatgttagagaagaaaga 
cagctttgtggaagagggtaacacattaagaaagatacttctgaatcgatactttaaagg 
tgattatagtattggtgtgaatggacacctatcaggagcctactccaaaactgcacaggt 
gggaggaagcttttctggacaagattcagataagatggggatatcaatgtcagacattca 
gtgtctgctggataaagaaggtgcatcagaacttgtcatcgatgttatagtgaacaccaa 
aaatgacagaattttttcagaaggcattttcctcggcattgccttgcttgaaggaggaaa 
tacacaaacacagtattctttctaccagcagttgcatgaacaaaaaaagtcagaaaaatt 
ctttaaagttctctatgatcgaatgaaggctgctcagaaagaaataagatcaacagtgac 
agttaataccatagatttaggtaacaaaaaaagggacgatgacaatgaattgatgacatc 
tggtccacgaatgagagtaagagattcaacactacatttaaaagagggaatgaaagggca 
attaacagaagcttcttcagcaacatccaaagcatattgtgtatacagaagagaaatgga 
tccagaaatagacattatgtgcacaggaccagaagcgggaaacactgaggaaaaatccgc 
agaggaagtaacaatgagtcccgcaattgccatcatgcagccaatactgagatttcttca 
gttactgtgtgagaatcacaaccgggaattgcagaacttcttgaggaatcaaaacaacaa 
aacaaattacaacctagtctgtgagacccttcagtttctggactgcatttgtggaagtac 
aaccggtggcctgggcctgttgggtctctacatcaatgagaagaatgtagcgctggtcaa 
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ccagaacctggagagcttgactgagtattgccagggcccttgccatgaaaatcagacctg 
tatcgctacacatgagtctaatgggattgatatcatcattgctttgattctgaatgacat 
aaaccctcttggtaaataccgaatggacctggtgctccagctaaagaacaatgcatctaa 
acttttgctggccattatggaaagcagacatgacagtgagaatgcagaaagaattctttt 
taacatgagacccagagaactggtggatgtgatgaagaatgcctataaccaaggattgga 
atgtgaccatggggatgatgagggtggagatgatggtgtttctccaaaagatgttggaca 
caatatctatattctggcccatcagttggcccgccacaataaactgttgcagcagatgct 
caaaccaggatcggatccagatgaaggagatgaagccttaaagtattatgccaaccacac 
tgcacagattgagattgtccggcatgataggaccatggaacaaatagtttttcctgtccc 
caatatatgtgaatacctcactcgagaatccaagtgccgtgtgttcaatacaactgaaag 
ggatgaacaaggaagtaaagtgaatgactttttccagcaaacagaagatctctacaatga 
aatgaagtggcagaagaaaatcaggaataaccctgcactgttctggttctcgaggcacat 
ctctctctgggggagcatttccttcaacctggctgtgttcatcaatttagctgttgctct 
cttctacccatttggggatgatggagatgaaggtacactttctccattgttctcggttct 
tctttggatagcagttgcgatctgcacatctatgctgtttttcttctccaagcctgtggg 
tattcggccgtttcttgtatcaataatgctcagatcaatatatacaataggtcttgggcc 
tacattaatacttcttggtgcagctaatctttgtaataaaattgtttttctggtgagttt 
tgttggaaatcgtggcacgttcacccgtgggtaccgagcagtcatcctggatatggcctt 
tctctatcacgtggcgtatgtcctggtttgcatgctgggcctttttgtccatgaattctt 
ctatagcttcctgctttttgatttggtgtacagggaagagactttgctgaatgtcataaa 
aagtgtcacacgaaatggccgctctattattctaactgcagtcctggctctcatcctcgt 
ctacctgttttccattattgggttcctttttttgaaggatgacttcactatggaagttga 
taggctgaaaaaccgaactcctgttacaggcagtcatcaagtgcctactatgactttaac 
taccatgatggaagcatgtgccaaggagaactgttcacccacaattccagcttcaaatac 
agctgatgaagagtatgaagatggaattgaaaggacgtgtgacactctccttatgtgcat 
tgtcaccgtgctgaaccagggcctcaggaatggcggtggtgtgggggatgtgctaagaag 
gccatcgaaagatgagcccttgtttgctgcccgagtggtttatgaccttcttttttattt 
cattgttatcattattgttctgaacttgatttttggtgttatcatcgatacttttgctga 
tctcagaagcgaaaaacagaaaaaagaagaaattctaaagacaacttgtttcatctgtgg 
acttgagagagacaagtttgataataaaacggtttcatttgaggagcacattaagtcaga 
acacaatatgtggcattatttgtacttcatagtcctggtgaaagttaaagacccaacaga 
atacactggacctgaaagttatgtggctcaaatgattgtggagaagaatttggattggtt 
tcctcggatgcgagccatgtccctcgttagcaatgaaggcgacagtgagcaaaatgaaat 
tcggagccttcaggagaagttggaatcgaccatgagtctggtcaaacagctgtcgggtca 
gctggcggagctcaaggagcagatgacagaacaaaggaagaataagcagagactgggctt 
cctcggatcaaacacaccccatgtgaatcatcacatgccaccacactgataccatggggg 
gaagccgtgactagcctttcatcagtgtcctgcctgatcactgaataaagaactgagatg 
gaggggagtgaacagtgcctattgttgaaaagttaaaaacaaccaagtgccaagatgttg 
agtgggttagctccgagaacaatttataactgtgttttcatggttgcgaagacctaacct 
caaatgcatctgctagaaagcgtacatcacacattcgcaatgcatcaggaagaaaaggct 
tgcccaaaaggctggagagggcagggagcggcaggatggaaggagacacggggcagggag 
aactctcttctgctaaatcgataggagtcagttttgtcttaaatgctgactacagccact 
gacatggttggctggaatttctttcttttaattgtggcatataggtttgtgacacaagaa 
gtcatactttggtggctaagttttactaaggaaaataactgaaaagattaaaagtgagag 
ctgaaaagagaaatgataatgcttccaaactgtagctgtcacagggcaatttctttattt 
ataacatgaagcacaatggatttacagctctaggaacttagtactttggagcttttgcct 
ctcacactgacaacataacaggatgtgattgccttctctgggattcagacaggctctgtc 
aatgtggagcacaaaaggagattttcatataacttgttaaaaacatgttctaagtcatgt 
ataggctaagattttaagaagatctgggggaataaaaagccaacagtaagccccaggaaa 
gggttttttgagaccatatgtatgctattaaaatatattgagattaatacatgaaaatgc 
ttaaaagtgataggaattattgagaaacttatgatggtggcattgccttttataaacatg 
gagtgaaagccatttgactcaacgtttgctgtgcttaaagaattgcttcaggacccgagc 
gtttttatgtatgctgttcctgcagttaggaaaaaaaaatccaagaaatgtattgaatac 
tcaagaaaatgccacatttcaattacatatttaaaactgtatctgtaagggcttttcaaa 
atgtagcaagataaaaatctcttattcaaattgtttttgtattaaatcatgcactcagaa 
tttgtcgagggagagaataacctggtgtgttcaggttattcttagagactacacatttgg 
aatagcagagcaaaatatggataatgaaagtgttgggaaaaaagtgatgtgacagggagt 



293/365 



Figur 1 (Cont'd) 



gaagacacttgataccagactctgggagatactctgcaagttgacctggcctctccccca 
caggaacaaaacactgcctccagagtctttaaattctcagttatcaacgccaaggtttaa 
ggtctagatagggtttgctatagggtttgctctgacaatttttaaagcttttggattgtt 
ctaacatagcttaagctattggttcctaaaaatcccaaatcaagatctatgtagaatata 
aaggaagcctgaaccaatccttccacatactgttaagatgtagacttggaacaaagctgt 
tgggacccagagcaatgaatttttgaactgaagctactggtactccccagcaccacctca 
tactaagaattcctcactctgacatgacagtattttttctgaccaggagctgaaagaccc 
tgacattcatgatccaaagatataagaaattttgatatgtctgcatgcagcacagaactt 
tgaacctaggaggcagtcaataaatacatgagaaattcagctgtcattcagttactctta 
attctttataaaactttaaatgataccacatatttgtttgtttaaaatggctttcccaaa 
atcaaagtagactaaagcagccatctttaaaatccaggatcatcaatgctattaacagta 
tcaggtaaaaagtacactttaaaatattttaatcaggcagagttttatggatagagaata 
gaagagaaaggtagtaaatattgaacatattccaatataggaacctatctctgttttagt 
acaaaatatttctgacatctgaactagaggtcaagagaataaattcatttgtatacatct 
gagcaacctgtctttcagatgataaagtatctagccttttctgacaccataatagttcat 
tttgtagggaataagccattaggtgtatataattgctttctagaaatgacctaatgtccc 
caaccactttgtagtggcagatcactgtttcacagcatattttctcccaaggaaagtatt 
caaaagagactgcaactaacaagactcttatttcatcaaaatttaaatatttctgagttg 
tatttttaatgcctctttcttttctgcctaaatgcttagaaattataaagcaaaaaaaaa 
aaaaaaaacagcaacaaaaaatcgaagcagacaaaaaaggcacttttcagaacatcaaat 
tcctaatgaagaagaggagaagataatggggaaatttgacatttgatataaatttatatt 
tgttatgtgtatgtttgttaatgcaactggaatatttgacttaggtgagtatcagttaac 
atccttgtatttatatagtgacatcaaaataaatgcaatcatcttga 

SEQ ID NO:428. >Z38709_T6 # TY Transcript # LN 12799 # Source Gene: Z38709 
# Encoded protein: Z38709__P11 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 
cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatattgttgtaggagataaagttgttttgatgcc 
tgtgaatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtg 
taaagaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaata 
tagttcctatcgagaggatgtattaaaaggaggggacgttgttagattatttcatgcgga 
acaagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtac 
gaccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggt 
ggttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaa 
gcatcttgcaactggaaactatttagctgcagagcttaatcctgattatcgagatgccca 
aaatgaaggaaaaaatgtgagagatggagtccctccaacttcaaagaaaaaacgccaggc 
aggggagaagatcatgtatactttggtttcagtcccgcatggcaatgacattgcatccct 
ttttgaactagatgccacaactcttcagagagctgactgcctggttccaaggaactcata 
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tgttcggttaaggcatttatgcaccaacacatgggtaaccagtactagtatccccataga 
cacagatgaagagaggcctgttatgttaaagattggaacctgccaaaccaaagaagataa 
agaagcgttcgcaatcgtgtctgttccactgtctgaagttcgagacttagactttgccaa 
tgatgccaataaagtactagcgaccacagttaaaaagctagaaaacggcacaataactca 
gaatgaaaggaggtttgtaaccaaattattggaagatctcatattctttgttgctgatgt 
gcctaataatggacaagaagttctggatgtggttatcactaagccaaaccgagagcgtca 
aaaattgatgagggaacaaaacatactggcacaggtatttggaattcttaaagcaccctt 
taaagagaaagcaggagaaggctcgatgctgagacttgaagatctgggggatcaaagata 
tgcaccctacaagtacatgctgcggctctgttaccgcgtcctgagacactcgcagcagga 
ttaccggaaaaatcaggaatatattgctaagaatttctgtgtcatgcagtcccagattgg 
ctatgatattttggcagaagatactatcacagctttgttgcacaacaacagaaaactact 
agagaaacatatcacagcaaaagaaatagaaacatttgtcagtttactcaggagaaatcg 
ggagccaaggtttttggattatttgtcagatctgtgtgtgtctaataccactgctatccc 
tgtaactcaagaactcatctgtaaatttatgttgagtccaggcaatgcagacattctcat 
tcaaactaaggtggtctcaatgcaagcagacaaccccatggagagctccatcctttcaga 
tgacattgatgatgaagaagtttggctctattggattgacagcaacaaggaacctcatgg 
caaagctatcaggcaccttgctcaagaggcaaaagaaggcaccaaagctgacttagaagt 
tcttacctattacaggtaccagctaaacctctttgcaaggatgtgcttggatcgccagta 
tctggccataaaccagatttctacacagctgtctgtagacctgatcctgcggtgtgtgtc 
ggatgagagcctgccgttcgacctccgagcgtccttctgtcgcctcatgctccacatgca 
cgttgaccgggatccccaggagtccgtggtgcctgttcgctatgccaggctctggacaga 
aatccccacaaagatcacaattcatgaatatgattctataacagactcttccagaaatga 
tatgaagaggaaatttgccctgacaatggaatttgttgaagaatatttgaaagaagttgt 
aaaccagccctttccttttggggataaagaaaaaaataaactgacatttgaggtggtcca 
cttggctcggaatcttatatactttggattttatagtttcagtgagttattaaggctaac 
aagaacacttctggctattttagacattgtacaggcccccatgtcatcatactttgaaag 
attaagcaaatttcaagatggaggaaacaatgtgatgagaaccattcatggggtgggaga 
gatgatgacccagatggtactcagtagaggctccatcttccccatgagcgtgccggatgt 
gccacccagcatccacccgagcaagcaagggagccccaccgagcatgaggatgtgactgt 
gatggacaccaagctgaagatcattgagattttgcagtttatcctgagtgtcagactgga 
ttataggatctcatatatgctgtcaatatataagaaggagtttggagaggacaatgacaa 
tgcggagacatctgccagtggatctccagacactttactaccatcagctattgttcctga 
tatagatgaaattgcagctcaggcagaaactatgtttgcgggaagaaaagaaaaaaatcc 
agttcaacttgacgatgaaggaggcaggacgtttttacgggtcctcattcatctgatcat 
gcacgactacccgcctttgctgtctggagccctgcagctgttgtttaagcacttcagcca 
gagggcagaggttttacaggcatttaagcaggtgcaattactggtgtctaatcaagacgt 
agataactacaagcaaatcaaggcagatctagaccagcttcgactgacagtagaaaagtc 
tgagctatgggtggagaagagcagcaactatgagaatggagaaataggggaaagtcaagt 
gaaaggtggtgaagagccaattgaggaatcaaacattttaagtccagtgcaggatggaac 
aaagaaacctcagattgacagcaacaagagcaataactaccggattgtaaaggagatttt 
gatcaggctaagtaaactctgtgtgcagaataaaaagtgtcggaatcaacatcaacgatt 
actgaaaaatatgggggcgcattcggtggtgttggatcttctgcagataccctatgaaaa 
gaatgatgaaaagatgaatgaagtaatgaatctagcccatacatttctgcagaatttctg 
tcgaggaaatccacagaatcaagttcttcttcataaacatctgaatttgtttttaactcc 
aggtctccttgaagcagaaaccatgcggcacatcttcatgaacaattaccatctgtgcaa 
cgaaattagcgagagagttgtacaacactttgtgcactgcattgagacacatggccgcca 
cgtggagtacctgaggtttttgcaaacaattgtaaaagcagatggtaaatatgtgaagaa 
atgccaggatatggtaatgacagagttgataaatgggggtgaagacgtgctgatatttta 
caatgatagagcatcatttccaatccttctccatatgatgtgttcagagagagaccgagg 
ggatgagagtggccccttagcctaccacatcaccctggtggagttgctggcagcatgcac 
agaggggaaaaatgtctacactgaaatcaagtgtaattcccttctcccgctggacgacat 
agtgagggtggtgacccatgacgactgcatccctgaggttaaaattgcttatgtgaactt 
tgttaatcactgttatgttgacactgaagtggaaatgaaagaaatctatacaagtaacca 
catttggaaattatttgagaacttcttggtggatatggcaagggtttgcaacacaactac 
agacaggaaacatgcagacatctttttggaaaagtgtgttactgagtcaataatgaatat 
tgtgagcggcttctttaattctcccttttcagacaatagtaccagcctccagacacatca 
gccagtttttattcagctactgcaatctgccttcagaatttacaattgcacctggccaaa 
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cccagcgcagaaagcctcagtggaatcctgtatcagaactttggctgaagtggcaaaaaa 
tcgtggaattgccattccagtggatttggacagccaagttaatactcttttcatgaagag 
ccattcaaatatggtgcagagagcagcaatgggttggagactatcagctcgctctgggcc 
acgctttaaggaagctcttggagggcctgcttgggattacagaaatattattgaaaagtt 
acaggatgtagtggcctccttggagcaccagttcagcccaatgatgcaggctgaattctc 
agtgttggttgatgtattgtacagtccagaactgctgttccctgagggaagcgatgcaag 
aataagatgtggcgctttcatgtcgaagttgattaatcatacaaagaaactaatggagaa 
agaagaaaaactgtgcattaaaattcttcagacattacgagaaatgttagagaagaaaga 
cagctttgtggaagagggtaacacattaagaaagatacttctgaatcgatactttaaagg 
tgattatagtattggtgtgaatggacacctatcaggagcctactccaaaactgcacaggt 
gggaggaagcttttctggacaagattcagataagatggggatatcaatgtcagacattca 
gtgtctgctggataaagaaggtgcatcagaacttgtcatcgatgttatagtgaacaccaa 
aaatgacagaattttttcagaaggcattttcctcggcattgccttgcttgaaggaggaaa 
tacacaaacacagtattctttctaccagcagttgcatgaacaaaaaaagtcagaaaaatt 
ctttaaagttctctatgatcgaatgaaggctgctcagaaagaaataagatcaacagtgac 
agttaataccatagatttaggtaacaaaaaaagggacgatgacaatgaattgatgacatc 
tggtccacgaatgagagtaagagattcaacactacatttaaaagagggaatgaaagggca 
attaacagaagcttcttcagcaacatccaaagcatattgtgtatacagaagagaaatgga 
tccagaaatagacattatgtgcacaggaccagaagcgggaaacactgaggaaaaatccgc 
agaggaagtaacaatgagtcccgcaattgccatcatgcagccaatactgagatttcttca 
gttactgtgtgagaatcacaaccgggaattgcagaacttcttgaggaatcaaaacaacaa 
aacaaattacaacctagtctgtgagacccttcagtttctggactgcatttgtggaagtac 
aaccggtggcctgggcctgttgggtctctacatcaatgagaagaatgtagcgctggtcaa 
ccagaacctggagagcttgactgagtattgccagggcccttgccatgaaaatcagacctg 
tatcgctacacatgagtctaatgggattgatatcatcattgctttgattctgaatgacat 
aaaccctcttggtaaataccgaatggacctggtgctccagctaaagaacaatgcatctaa 
acttttgctggccattatggaaagcagacatgacagtgagaatgcagaaagaattctttt 
taacatgagacccagagaactggtggatgtgatgaagaatgcctataaccaaggattgga 
atgtgaccatggggatgatgagggtggagatgatggtgtttctccaaaagatgttggaca 
caatatctatattctggcccatcagttggcccgccacaataaactgttgcagcagatgct 
caaaccaggatcggatccagatgaaggagatgaagccttaaagtattatgccaaccacac 
tgcacagattgagattgtccggcatgataggaccatggaacaaatagtttttcctgtccc 
caatatatgtgaatacctcactcgagaatccaagtgccgtgtgttcaatacaactgaaag 
ggatgaacaaggaagtaaagtgaatgactttttccagcaaacagaagatctctacaatga 
aatgaagtggcagaagaaaatcaggaataaccctgcactgttctggttctcgaggcacat 
ctctctctgggggagcatttccttcaacctggctgtgttcatcaatttagctgttgctct 
cttctacccatttggggatgatggagatgaaggtacactttctccattgttctcggttct 
tctttggatagcagttgcgatctgcacatctatgctgtttttcttctccaagcctgtggg 
tattcggccgtttcttgtatcaataatgctcagatcaatatatacaataggtcttgggcc 
tacattaatacttcttggtgcagctaatctttgtaataaaattgtttttctggtgagttt 
tgttggaaatcgtggcacgttcacccgtgggtaccgagcagtcatcctggatatggcctt 
tctctatcacgtggcgtatgtcctggtttgcatgctgggcctttttgtccatgaattctt 
ctatagcttcctgctttttgatttggtgtacagggaagagactttgctgaatgtcataaa 
aagtgtcacacgaaatggccgctctattattctaactgcagtcctggctctcatcctcgt 
ctacctgttttccattattgggttcctttttttgaaggatgacttcactatggaagttga 
taggctgaaaaaccgaactcctgttacaggcagtcatcaagtgcctactatgactttaac 
taccatgatggaagcatgtgccaaggagaactgttcacccacaattccagcttcaaatac 
agctgatgaagagtatgaagatggaattgaaaggacgtgtgacactctccttatgtgcat 
tgtcaccgtgctgaaccagggcctcaggaatggcggtggtgtgggggatgtgctaagaag 
gccatcgaaagatgagcccttgtttgctgcccgagtggtttatgaccttcttttttattt 
cattgttatcattattgttctgaacttgatttttggtgttatcatcgatacttttgctga 
tctcagaagcgaaaaacagaaaaaagaagaaattctaaagacaacttgtttcatctgtgg 
acttgagagagacaagtttgataataaaacggtttcatttgaggagcacattaagtcaga 
acacaatatgtggcattatttgtacttcatagtcctggtgaaagttaaagacccaacaga 
atacactggacctgaaagttatgtggctcaaatgattgtggagaagaatttggattggtt 
tcctcggatgcgagccatgtccctcgttagcaatgaaggcgacagtgagcaaaatgaaat 
tcggagccttcaggagaagttggaatcgaccatgagtctggtcaaacagctgtcgggtca 
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gctggcggagctcaaggagcagatgacagaacaaaggaagaataagcagagactgggctt 
cctcggatcaaacacaccccatgtgaatcatcacatgccaccacactgataccatggggg 
gaagccgtgactagcctttcatcagtgtcctgcctgatcactgaataaagaactgagatg 
gaggggagtgaacagtgcctattgttgaaaagttaaaaacaaccaagtgccaagatgttg 
agtgggttagctccgagaacaatttataactgtgttttcatggttgcgaagacctaacct 
caaatgcatctgctagaaagcgtacatcacacattcgcaatgcatcaggaagaaaaggct 
tgcccaaaaggctggagagggcagggagcggcaggatggaaggagacacggggcagggag 
aactctcttctgctaaatcgataggagtcagttttgtcttaaatgctgactacagccact 
gacatggttggctggaatttctttcttttaattgtggcatataggtttgtgacacaagaa 
gtcatactttggtggctaagttttactaaggaaaataactgaaaagattaaaagtgagag 
ctgaaaagagaaatgataatgcttccaaactgtagctgtcacagggcaatttctttattt 
ataacatgaagcacaatggatttacagctctaggaacttagtactttggagcttttgcct 
ctcacactgacaacataacaggatgtgattgccttctctgggattcagacaggctctgtc 
aatgtggagcacaaaaggagattttcatataacttgttaaaaacatgttctaagtcatgt 
ataggctaagattttaagaagatctgggggaataaaaagccaacagtaagccccaggaaa 
gggttttttgagaccatatgtatgctattaaaatatattgagattaatacatgaaaatgc 
ttaaaagtgataggaattattgagaaacttatgatggtggcattgccttttataaacatg 
gagtgaaagccatttgactcaacgtttgctgtgcttaaagaattgcttcaggacccgagc 
gtttttatgtatgctgttcctgcagttaggaaaaaaaaatccaagaaatgtattgaatac 
tcaagaaaatgccacatttcaattacatatttaaaactgtatctgtaagggcttttcaaa 
atgtagcaagataaaaatctcttattcaaattgtttttgtattaaatcatgcactcagaa 
tttgtcgagggagagaataacctggtgtgttcaggttattcttagagactacacatttgg 
aatagcagagcaaaatatggataatgaaagtgttgggaaaaaagtgatgtgacagggagt 
gaagacacttgataccagactctgggagatactctgcaagttgacctggcctctccccca 
caggaacaaaacactgcctccagagtctttaaattctcagttatcaacgccaaggtttaa 
ggtctagatagggtttgctatagggtttgctctgacaatttttaaagcttttggattgtt 
ctaacatagcttaagctattggttcctaaaaatcccaaatcaagatctatgtagaatata 
aaggaagcctgaaccaatccttccacatactgttaagatgtagacttggaacaaagctgt 
tgggacccagagcaatgaatttttgaactgaagctactggtactccccagcaccacctca 
tactaagaattcctcactctgacatgacagtattttttctgaccaggagctgaaagaccc 
tgacattcatgatccaaagatataagaaattttgatatgtctgcatgcagcacagaactt 
tgaacctaggaggcagtcaataaatacatgagaaattcagctgtcattcagttactctta 
attctttataaaactttaaatgataccacatatttgtttgtttaaaatggctttcccaaa 
atcaaagtagactaaagcagccatctttaaaatccaggatcatcaatgctattaacagta 
tcaggtaaaaagtacactttaaaatattttaatcaggcagagttttatggatagagaata 
gaagagaaaggtagtaaatattgaacatattccaatataggaacctatctctgttttagt 
acaaacaacctgtctttcagatgataaagtatctagccttttctgacaccataatagttc 
attttgtagggaataagccattaggtgtatataattgctttctagaaatgacctaatgtc 
cccaaccactttgtagtggcagatcactgtttcacagcatattttctcccaaggaaagta 
ttcaaaagagactgcaactaacaagactcttatttcatcaaaatttaaatatttctgagt 
tgtatttttaatgcctctttcttttctgcctaaatgcttagaaattataaagcaaaaaaa 
aaaaaaaaaacagcaacaaaaaatcgaagcagacaaaaaaggcacttttcagaacatcaa 
attcctaatgaagaagaggagaagataatggggaaatttgacatttgatataaatttata 
tttgttatgtgtatgtttgttaatgcaactggaatatttgacttaggtgagtatcagtta 
acatccttgtatttatatagtgacatcaaaataaatgcaatcatcttgaactttgatgtt 
aagggagattttgaaaaaatatttagttatttcaaattcatattggttcaaaagtatcag 
tttctgctgaataatatttttaatttcaaaagggttttgttccctctgtgctccattctg 
aggctcaagagcttaatgcaagtatgttttctgagttaaaataacacttttagatgagaa 
aatgcttgtatcatacagggctataatataaataaataaattgtatgtatgcaaaattta 
tcatattttctaccccttaaaaaattaagttagaaataagcttattttcttgcacatcaa 
tatttttctgttggtaaagaggacacaatttctagtagattgttaataacatcaggaaga 
tatttctttcgtcagaactaattgtgtgcttattcccatattctcagctcataactccct 
gttttggctgctctctttattttacaattgtttcattgcaaacagcaaggcagtatagcc 
cacactcagccactagcccccagtccccaggactgagaatgagtggggaggctggggagt 
tggtgagagaaggtggagatagggatttctgcttcccagtactctatcattagagaaaag 
gaggggtaaatattagctaagaagaagaaaaaagcttattcttcagggacccagtagatt 
ggacgtaagcaagtaaatattgcacaggcatcatgatgtaatagttgatggaccccagag 
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Figure 1 (Conf d) 



atacccctcacccatgcagagaacaaagctgtgtggaagagcaactaccataagatggtg 
tgttccttccagcggctgttaaagccatcctgaataacaaatcagctattcctacctaca 
gacccattccaatatgctgtgatacacgacacggtgcaccgcagaggagacagacatctt 
cagcacaaacacagatgcctgcaggagcttggcaagtcactaaattgcatgaaattgtga 
ggtgcacaccaaatgcagggatgggggaggccgtgggaagctctgtattctcaaaatttg 
acagctaattccggtttttagaaaatgcctgaggccagtagaggccctctagtcactcac 
tgctgctgtttctgatataattattgagaaagctatctcacttaatagaagaaaacacgc 
actatcaaaaccagatagcctaacgtgcatgtgaaaatcgagaaagctgaaaacaaatcc 
aggtacccttctctgaactggagtgtttccacagacttgaataatttatgaaattatcac 
accagtatttctcatatcaccaagaagactttctctcctgcagtagaggattgttatatt 
tgcctaaaaaacacgattccaatatatgacaagggcagataatttataagtgaatgttaa 
taaaattggatgtgtataa 

SEQ ID NO: 429. >Z38709_T7 # TY Transcript # LN 10973 # Source Gene: Z38709 
# Encoded protein: Z38709_P11 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 
cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatattgttgtaggagataaagttgttttgatgcc 
tgtgaatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtg 
taaagaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaata 
tagttcctatcgagaggatgtattaaaaggaggggacgttgttagattatttcatgcgga 
acaagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtac 
gaccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggt 
ggttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaa 
gcatcttgcaactggaaactatttagctgcagagcttaatcctgattatcgagatgccca 
aaatgaaggaaaaaatgtgagagatggagtccctccaacttcaaagaaaaaacgccaggc 
aggggagaagatcatgtatactttggtttcagtcccgcatggcaatgacattgcatccct 
ttttgaactagatgccacaactcttcagagagctgactgcctggttccaaggaactcata 
tgttcggttaaggcatttatgcaccaacacatgggtaaccagtactagtatccccataga 
cacagatgaagagaggcctgttatgttaaagattggaacctgccaaaccaaagaagataa 
agaagcgttcgcaatcgtgtctgttccactgtctgaagttcgagacttagactttgccaa 
tgatgccaataaagtactagcgaccacagttaaaaagctagaaaacggcacaataactca 
gaatgaaaggaggtttgtaaccaaattattggaagatctcatattctttgttgctgatgt 
gcctaataatggacaagaagttctggatgtggttatcactaagccaaaccgagagcgtca 
aaaattgatgagggaacaaaacatactggcacaggtatttggaattcttaaagcaccctt 
taaagagaaagcaggagaaggctcgatgctgagacttgaagatctgggggatcaaagata 
tgcaccctacaagtacatgctgcggctctgttaccgcgtcctgagacactcgcagcagga 
ttaccggaaaaatcaggaatatattgctaagaatttctgtgtcatgcagtcccagattgg 
ctatgatattttggcagaagatactatcacagctttgttgcacaacaacagaaaactact 
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Figure 1 (Cont'd) 



agagaaacatatcacagcaaaagaaatagaaacatttgtcagtttactcaggagaaatcg 
ggagccaaggtttttggattatttgtcagatctgtgtgtgtctaataccactgctatccc 
tgtaactcaagaactcatctgtaaatttatgttgagtccaggcaatgcagacattctcat 
tcaaactaaggtggtctcaatgcaagcagacaaccccatggagagctccatcctttcaga 
tgacattgatgatgaagaagtttggctctattggattgacagcaacaaggaacctcatgg 
caaagctatcaggcaccttgctcaagaggcaaaagaaggcaccaaagctgacttagaagt 
tcttacctattacaggtaccagctaaacctctttgcaaggatgtgcttggatcgccagta 
tctggccataaaccagatttctacacagctgtctgtagacctgatcctgcggtgtgtgtc 
ggatgagagcctgccgttcgacctccgagcgtccttctgtcgcctcatgctccacatgca 
cgttgaccgggatccccaggagtccgtggtgcctgttcgctatgccaggctctggacaga 
aatccccacaaagatcacaattcatgaatatgattctataacagactcttccagaaatga 
tatgaagaggaaatttgccctgacaatggaatttgttgaagaatatttgaaagaagttgt 
aaaccagccctttccttttggggataaagaaaaaaataaactgacatttgaggtggtcca 
cttggctcggaatcttatatactttggattttatagtttcagtgagttattaaggctaac 
aagaacacttctggctattttagacattgtacaggcccccatgtcatcatactttgaaag 
attaagcaaatttcaagatggaggaaacaatgtgatgagaaccattcatggggtgggaga 
gatgatgacccagatggtactcagtagaggctccatcttccccatgagcgtgccggatgt 
gccacccagcatccacccgagcaagcaagggagccccaccgagcatgaggatgtgactgt 
gatggacaccaagctgaagatcattgagattttgcagtttatcctgagtgtcagactgga 
ttataggatctcatatatgctgtcaatatataagaaggagtttggagaggacaatgacaa 
tgcggagacatctgccagtggatctccagacactttactaccatcagctattgttcctga 
tatagatgaaattgcagctcaggcagaaactatgtttgcgggaagaaaagaaaaaaatcc 
agttcaacttgacgatgaaggaggcaggacgtttttacgggtcctcattcatctgatcat 
gcacgactacccgcctttgctgtctggagccctgcagctgttgtttaagcacttcagcca 
gagggcagaggttttacaggcatttaagcaggtgcaattactggtgtctaatcaagacgt 
agataactacaagcaaatcaaggcagatctagaccagcttcgactgacagtagaaaagtc 
tgagctatgggtggagaagagcagcaactatgagaatggagaaataggggaaagtcaagt 
gaaaggtggtgaagagccaattgaggaatcaaacattttaagtccagtgcaggatggaac 
aaagaaacctcagattgacagcaacaagagcaataactaccggattgtaaaggagatttt 
gatcaggctaagtaaactctgtgtgcagaataaaaagtgtcggaatcaacatcaacgatt 
actgaaaaatatgggggcgcattcggtggtgttggatcttctgcagataccctatgaaaa 
gaatgatgaaaagatgaatgaagtaatgaatctagcccatacatttctgcagaatttctg 
tcgaggaaatccacagaatcaagttcttcttcataaacatctgaatttgtttttaactcc 
aggtctccttgaagcagaaaccatgcggcacatcttcatgaacaattaccatctgtgcaa 
cgaaattagcgagagagttgtacaacactttgtgcactgcattgagacacatggccgcca 
cgtggagtacctgaggtttttgcaaacaattgtaaaagcagatggtaaatatgtgaagaa 
atgccaggatatggtaatgacagagttgataaatgggggtgaagacgtgctgatatttta 
caatgatagagcatcatttccaatccttctccatatgatgtgttcagagagagaccgagg 
ggatgagagtggccccttagcctaccacatcaccctggtggagttgctggcagcatgcac 
agaggggaaaaatgtctacactgaaatcaagtgtaattcccttctcccgctggacgacat 
agtgagggtggtgacccatgacgactgcatccctgaggttaaaattgcttatgtgaactt 
tgttaatcactgttatgttgacactgaagtggaaatgaaagaaatctatacaagtaacca 
catttggaaattatttgagaacttcttggtggatatggcaagggtttgcaacacaactac 
agacaggaaacatgcagacatctttttggaaaagtgtgttactgagtcaataatgaatat 
tgtgagcggcttctttaattctcccttttcagacaatagtaccagcctccagacacatca 
gccagtttttattcagctactgcaatctgccttcagaatttacaattgcacctggccaaa 
cccagcgcagaaagcctcagtggaatcctgtatcagaactttggctgaagtggcaaaaaa 
tcgtggaattgccattccagtggatttggacagccaagttaatactcttttcatgaagag 
ccattcaaatatggtgcagagagcagcaatgggttggagactatcagctcgctctgggcc 
acgctttaaggaagctcttggagggcctgcttgggattacagaaatattattgaaaagtt 
acaggatgtagtggcctccttggagcaccagttcagcccaatgatgcaggctgaattctc 
agtgttggttgatgtattgtacagtccagaactgctgttccctgagggaagcgatgcaag 
aataagatgtggcgctttcatgtcgaagttgattaatcatacaaagaaactaatggagaa 
agaagaaaaactgtgcattaaaattcttcagacattacgagaaatgttagagaagaaaga 
cagctttgtggaagagggtaacacattaagaaagatacttctgaatcgatactttaaagg 
tgattatagtattggtgtgaatggacacctatcaggagcctactccaaaactgcacaggt 
gggaggaagcttttctggacaagattcagataagatggggatatcaatgtcagacattca 
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Figure 1 (Cont'd) 



gtgtctgctggataaagaaggtgcatcagaacttgtcatcgatgttatagtgaacaccaa 
aaatgacagaattttttcagaaggcattttcctcggcattgccttgcttgaaggaggaaa 
tacacaaacacagtattctttctaccagcagttgcatgaacaaaaaaagtcagaaaaatt 
ctttaaagttctctatgatcgaatgaaggctgctcagaaagaaataagatcaacagtgac 
agttaataccatagatttaggtaacaaaaaaagggacgatgacaatgaattgatgacatc 
tggtccacgaatgagagtaagagattcaacactacatttaaaagagggaatgaaagggca 
attaacagaagcttcttcagcaacatccaaagcatattgtgtatacagaagagaaatgga 
tccagaaatagacattatgtgcacaggaccagaagcgggaaacactgaggaaaaatccgc 
agaggaagtaacaatgagtcccgcaattgccatcatgcagccaatactgagatttcttca 
gttactgtgtgagaatcacaaccgggaattgcagaacttcttgaggaatcaaaacaacaa 
aacaaattacaacctagtctgtgagacccttcagtttctggactgcatttgtggaagtac 
aaccggtggcctgggcctgttgggtctctacatcaatgagaagaatgtagcgctggtcaa 
ccagaacctggagagcttgactgagtattgccagggcccttgccatgaaaatcagacctg 
tatcgctacacatgagtctaatgggattgatatcatcattgctttgattctgaatgacat 
aaaccctcttggtaaataccgaatggacctggtgctccagctaaagaacaatgcatctaa 
acttttgctggccattatggaaagcagacatgacagtgagaatgcagaaagaattctttt 
taacatgagacccagagaactggtggatgtgatgaagaatgcctataaccaaggattgga 
atgtgaccatggggatgatgagggtggagatgatggtgtttctccaaaagatgttggaca 
caatatctatattctggcccatcagttggcccgccacaataaactgttgcagcagatgct 
caaaccaggatcggatccagatgaaggagatgaagccttaaagtattatgccaaccacac 
tgcacagattgagattgtccggcatgataggaccatggaacaaatagtttttcctgtccc 
caatatatgtgaatacctcactcgagaatccaagtgccgtgtgttcaatacaactgaaag 
ggatgaacaaggaagtaaagtgaatgactttttccagcaaacagaagatctctacaatga 
aatgaagtggcagaagaaaatcaggaataaccctgcactgttctggttctcgaggcacat 
ctctctctgggggagcatttccttcaacctggctgtgttcatcaatttagctgttgctct 
cttctacccatttggggatgatggagatgaaggtacactttctccattgttctcggttct 
tctttggatagcagttgcgatctgcacatctatgctgtttttcttctccaagcctgtggg 
tattcggccgtttcttgtatcaataatgctcagatcaatatatacaataggtcttgggcc 
tacattaatacttcttggtgcagctaatctttgtaataaaattgtttttctggtgagttt 
tgttggaaatcgtggcacgttcacccgtgggtaccgagcagtcatcctggatatggcctt 
tctctatcacgtggcgtatgtcctggtttgcatgctgggcctttttgtccatgaattctt 
ctatagcttcctgctttttgatttggtgtacagggaagagactttgctgaatgtcataaa 
aagtgtcacacgaaatggccgctctattattctaactgcagtcctggctctcatcctcgt 
ctacctgttttccattattgggttcctttttttgaaggatgacttcactatggaagttga 
taggctgaaaaaccgaactcctgttacaggcagtcatcaagtgcctactatgactttaac 
taccatgatggaagcatgtgccaaggagaactgttcacccacaattccagcttcaaatac 
agctgatgaagagtatgaagatggaattgaaaggacgtgtgacactctccttatgtgcat 
tgtcaccgtgctgaaccagggcctcaggaatggcggtggtgtgggggatgtgctaagaag 
gccatcgaaagatgagcccttgtttgctgcccgagtggtttatgaccttcttttttattt 
cattgttatcattattgttctgaacttgatttttggtgttatcatcgatacttttgctga 
tctcagaagcgaaaaacagaaaaaagaagaaattctaaagacaacttgtttcatctgtgg 
acttgagagagacaagtttgataataaaacggtttcatttgaggagcacattaagtcaga 
acacaatatgtggcattatttgtacttcatagtcctggtgaaagttaaagacccaacaga 
atacactggacctgaaagttatgtggctcaaatgattgtggagaagaatttggattggtt 
tcctcggatgcgagccatgtccctcgttagcaatgaaggcgacagtgagcaaaatgaaat 
tcggagccttcaggagaagttggaatcgaccatgagtctggtcaaacagctgtcgggtca 
gctggcggagctcaaggagcagatgacagaacaaaggaagaataagcagagactgggctt 
cctcggatcaaacacaccccatgtgaatcatcacatgccaccacactgataccatggggg 
gaagccgtgactagcctttcatcagtgtcctgcctgatcactgaataaagaactgagatg 
gaggggagtgaacagtgcctattgttgaaaagttaaaaacaaccaagtgccaagatgttg 
agtgggttagctccgagaacaatttataactgtgttttcatggttgcgaagacctaacct 
caaatgcatctgctagaaagcgtacatcacacattcgcaatgcatcaggaagaaaaggct 
tgcccaaaaggctggagagggcagggagcggcaggatggaaggagacacggggcagggag 
aactctcttctgctaaatcgataggagtcagttttgtcttaaatgctgactacagccact 
gacatggttggctggaatttctttcttttaattgtggcatataggtttgtgacacaagaa 
gtcatactttggtggctaagttttactaaggaaaataactgaaaagattaaaagtgagag 
ctgaaaagagaaatgataatgcttccaaactgtagctgtcacagggcaatttctttattt 
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Figure 1 (Cont'd) 



ataacatgaagcacaatggatttacagctctaggaacttagtactttggagcttttgcct 
ctcacactgacaacataacaggatgtgattgccttctctgggattcagacaggctctgtc 
aatgtggagcacaaaaggagattttcatataacttgttaaaaacatgttctaagtcatgt 
ataggctaagattttaagaagatctgggggaataaaaagccaacagtaagccccaggaaa 
gggttttttgagaccatatgtatgctattaaaatatattgagattaatacatgaaaatgc 
ttaaaagtgataggaattattgagaaacttatgatggtggcattgccttttataaacatg 
gagtgaaagccatttgactcaacgtttgctgtgcttaaagaattgcttcaggacccgagc 
gtttttatgtatgctgttcctgcagttaggaaaaaaaaatccaagaaatgtattgaatac 
tcaagaaaatgccacatttcaattacatatttaaaactgtatctgtaagggcttttcaaa 
atgtagcaagataaaaatctcttattcaaattgtttttgtattaaatcatgcactcagaa 
tttgtcgagggagagaataacctggtgtgttcaggttattcttagagactacacatttgg 
aatagcagagcaaaatatggataatgaaagtgttgggaaaaaagtgatgtgacagggagt 
gaagacacttgataccagactctgggagatactctgcaagttgacctggcctctccccca 
caggaacaaaacactgcctccagagtctttaaattctcagttatcaacgccaaggtttaa 
ggtctagatagggtttgctatagggtttgctctgacaatttttaaagcttttggattgtt 
ctaacatagcttaagctattggttcctaaaaatcccaaatcaagatctatgtagaatata 
aaggaagcctgaaccaatccttccacatactgttaagatgtagacttggaacaaagctgt 
tgggacccagagcaatgaatttttgaactgaagctactggtactccccagcaccacctca 
tactaagaattcctcactctgacatgacagtattttttctgaccaggagctgaaagaccc 
tgacattcatgatccaaagatataagaaattttgatatgtctgcatgcagcacagaactt 
tgaacctaggaggcagtcaataaatacatgagaaattcagctgtcattcagttactctta 
attctttataaaactttaaatgataccacatatttgtttgtttaaaatggctttcccaaa 
atcaaagtagactaaagcagccatctttaaaatccaggatcatcaatgctattaacagta 
tcaggtaaaaagtacactttaaaatattttaatcaggcagagttttatggatagagaata 
gaagagaaaggtagtaaatattgaacatattccaatataggaacctatctctgttttagt 
acaaaatatttctgacatctgaaagacaggttgctcagatgtatacaaatgaatttattc 
tcttgacctctagttcagatgtcaaaaatattttggtccccccaaccgcgaat 

SEQ ID NO: 430. >Z38709_T8 # TY Transcript # LN 12974 # Source Gene: Z38709 
# Encoded protein: Z38709_P2 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 
cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatattgttgtaggagataaagttgttttgatgcc 
tgtgaatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtg 
taaagaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaata 
tagttcctatcgagaggatgtattaaaaggaggggacgttgttagattatttcatgcgga 
acaagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtac 
gaccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggt 
ggttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaa 
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Figur 1 (Cont'd) 



gcatcttgcaactggaaactatttagctgcagagcttaatcctgattatcgagatgccca 
aaatgaaggaaaaaatgtgagagatggagtccctccaacttcaaagaaaaaacgccaggc 
aggggagaagatcatgtatactttggtttcagtcccgcatggcaatgacattgcatccct 
ttttgaactagatgccacaactcttcagagagctgactgcctggttccaaggaactcata 
tgttcggttaaggcatttatgcaccaacacatgggtaaccagtactagtatccccataga 
cacagatgaagagaggcctgttatgttaaagattggaacctgccaaaccaaagaagataa 
agaagcgttcgcaatcgtgtctgttccactgtctgaagttcgagacttagactttgccaa 
tgatgccaataaagtactagcgaccacagttaaaaagctagaaaacggcacaataactca 
gaatgaaaggaggtttgtaaccaaattattggaagatctcatattctttgttgctgatgt 
gcctaataatggacaagaagttctggatgtggttatcactaagccaaaccgagagcgtca 
aaaattgatgagggaacaaaacatactggcacaggtatttggaattcttaaagcaccctt 
taaagagaaagcaggagaaggctcgatgctgagacttgaagatctgggggatcaaagata 
tgcaccctacaagtacatgctgcggctctgttaccgcgtcctgagacactcgcagcagga 
ttaccggaaaaatcaggaatatattgctaagaatttctgtgtcatgcagtcccagattgg 
ctatgatattttggcagaagatactatcacagctttgttgcacaacaacagaaaactact 
agagaaacatatcacagcaaaagaaatagaaacatttgtcagtttactcaggagaaatcg 
ggagccaaggtttttggattatttgtcagatctgtgtgtgtctaataccactgctatccc 
tgtaactcaagaactcatctgtaaatttatgttgagtccaggcaatgcagacattctcat 
tcaaactaaggtggtctcaatgcaagcagacaaccccatggagagctccatcctttcaga 
tgacattgatgatgaagaagtttggctctattggattgacagcaacaaggaacctcatgg 
caaagctatcaggcaccttgctcaagaggcaaaagaaggcaccaaagctgacttagaagt 
tcttacctattacaggtaccagctaaacctctttgcaaggatgtgcttggatcgccagta 
tctggccataaaccagatttctacacagctgtctgtagacctgatcctgcggtgtgtgtc 
ggatgagagcctgccgttcgacctccgagcgtccttctgtcgcctcatgctccacatgca 
cgttgaccgggatccccaggagtccgtggtgcctgttcgctatgccaggctctggacaga 
aatccccacaaagatcacaattcatgaatatgattctataacagactcttccagaaatga 
tatgaagaggaaatttgccctgacaatggaatttgttgaagaatatttgaaagaagttgt 
aaaccagccctttccttttggggataaagaaaaaaataaactgacatttgaggtggtcca 
cttggctcggaatcttatatactttggattttatagtttcagtgagttattaaggctaac 
aagaacacttctggctattttagacattgtacaggcccccatgtcatcatactttgaaag 
attaagcaaatttcaagatggaggaaacaatgtgatgagaaccattcatggggtgggaga 
gatgatgacccagatggtactcagtagaggctccatcttccccatgagcgtgccggatgt 
gccacccagcatccacccgagcaagcaagggagccccaccgagcatgaggatgtgactgt 
gatggacaccaagctgaagatcattgagattttgcagtttatcctgagtgtcagactgga 
ttataggatctcatatatgctgtcaatatataagaaggagtttggagaggacaatgacaa 
tgcggagacatctgccagtggatctccagacactttactaccatcagctattgttcctga 
tatagatgaaattgcagctcaggcagaaactatgtttgcgggaagaaaagaaaaaaatcc 
agttcaacttgacgatgaaggaggcaggacgtttttacgggtcctcattcatctgatcat 
gcacgactacccgcctttgctgtctggagccctgcagctgttgtttaagcacttcagcca 
gagggcagaggttttacaggcatttaagcaggtgcaattactggtgtctaatcaagacgt 
agataactacaagcaaatcaaggcagatctagaccagcttcgactgacagtagaaaagtc 
tgagctatgggtggagaagagcagcaactatgagaatggagaaataggggaaagtcaagt 
gaaaggtggtgaagagccaattgaggaatcaaacattttaagtccagtgcaggatggaac 
aaagaaacctcagattgacagcaacaagagcaataactaccggattgtaaaggagatttt 
gatcaggctaagtaaactctgtgtgcagaataaaaagtgtcggaatcaacatcaacgatt 
actgaaaaatatgggggcgcattcggtggtgttggatcttctgcagataccctatgaaaa 
gaatgatgaaaagatgaatgaagtaatgaatctagcccatacatttctgcagaatttctg 
tcgaggaaatccacagaatcaagttcttcttcataaacatctgaatttgtttttaactcc 
aggtctccttgaagcagaaaccatgcggcacatcttcatgaacaattaccatctgtgcaa 
cgaaattagcgagagagttgtacaacactttgtgcactgcattgagacacatggccgcca 
cgtggagtacctgaggtttttgcaaacaattgtaaaagcagatggtaaatatgtgaagaa 
atgccaggatatggtaatgacagagttgataaatgggggtgaagacgtgctgatatttta 
caatgatagagcatcatttccaatccttctccatatgatgtgttcagagagagaccgagg 
ggatgagagtggccccttagcctaccacatcaccctggtggagttgctggcagcatgcac 
agaggggaaaaatgtctacactgaaatcaagtgtaattcccttctcccgctggacgacat 
agtgagggtggtgacccatgacgactgcatccctgaggttaaaattgcttatgtgaactt 
tgttaatcactgttatgttgacactgaagtggaaatgaaagaaatctatacaagtaacca 
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catttggaaattatttgagaacttcttggtggatatggcaagggtttgcaacacaactac 
agacaggaaacatgcagacatctttttggaaaagtgtgttactgagtcaataatgaatat 
tgtgagcggcttctttaattctcccttttcagacaatagtaccagcctccagacacatca 
gccagtttttattcagctactgcaatctgccttcagaatttacaattgcacctggccaaa 
cccagcgcagaaagcctcagtggaatcctgtatcagaactttggctgaagtggcaaaaaa 
tcgtggaattgccattccagtggatttggacagccaagttaatactcttttcatgaagag 
ccattcaaatatggtgcagagagcagcaatgggttggagactatcagctcgctctgggcc 
acgctttaaggaagctcttggagggcctgcttgggattacagaaatattattgaaaagtt 
acaggatgtagtggcctccttggagcaccagttcagcccaatgatgcaggctgaattctc 
agtgttggttgatgtattgtacagtccagaactgctgttccctgagggaagcgatgcaag 
aataagatgtggcgctttcatgtcgaagttgattaatcatacaaagaaactaatggagaa 
agaagaaaaactgtgcattaaaattcttcagacattacgagaaatgttagagaagaaaga 
cagctttgtggaagagggtaacacattaagaaagatacttctgaatcgatactttaaagg 
tgattatagtattggtgtgaatggacacctatcaggagcctactccaaaactgcacaggt 
gggaggaagcttttctggacaagattcagataagatggggatatcaatgtcagacattca 
gtgtctgctggataaagaaggtgcatcagaacttgtcatcgatgttatagtgaacaccaa 
aaatgacagaattttttcagaaggcattttcctcggcattgccttgcttgaaggaggaaa 
tacacaaacacagtattctttctaccagcagttgcatgaacaaaaaaagtcagaaaaatt 
ctttaaagttctctatgatcgaatgaaggctgctcagaaagaaataagatcaacagtgac 
agttaataccatagatttaggtaacaaaaaaagggacgatgacaatgaattgatgacatc 
tggtccacgaatgagagaattttggtcatggggtgaccctaaaacggctctgcaggacta 
aacttaaatgagacataatcagtgcaaagcaaccaacggcatgaaagatgtttgatttca 
ttgtggcatgcatgtaagagattcaacactacatttaaaagagggaatgaaagggcaatt 
aacagaagcttcttcagcaacatccaaagcatattgtgtatacagaagagaaatggatcc 
agaaatagacattatgtgcacaggaccagaagcgggaaacactgaggaaaaatccgcaga 
ggaagtaacaatgagtcccgcaattgccatcatgcagccaatactgagatttcttcagtt 
actgtgtgagaatcacaaccgggaattgcagaacttcttgaggaatcaaaacaacaaaac 
aaattacaacctagtctgtgagacccttcagtttctggactgcatttgtggaagtacaac 
cggtggcctgggcctgttgggtctctacatcaatgagaagaatgtagcgctggtcaacca 
gaacctggagagcttgactgagtattgccagggcccttgccatgaaaatcagacctgtat 
cgctacacatgagtctaatgggattgatatcatcattgctttgattctgaatgacataaa 
ccctcttggtaaataccgaatggacctggtgctccagctaaagaacaatgcatctaaact 
tttgctggccattatggaaagcagacatgacagtgagaatgcagaaagaattctttttaa 
catgagacccagagaactggtggatgtgatgaagaatgcctataaccaaggattggaatg 
tgaccatggggatgatgagggtggagatgatggtgtttctccaaaagatgttggacacaa 
tatctatattctggcccatcagttggcccgccacaataaactgttgcagcagatgctcaa 
accaggatcggatccagatgaaggagatgaagccttaaagtattatgccaaccacactgc 
acagattgagattgtccggcatgataggaccatggaacaaatagtttttcctgtccccaa 
tatatgtgaatacctcactcgagaatccaagtgccgtgtgttcaatacaactgaaaggga 
tgaacaaggaagtaaagtgaatgactttttccagcaaacagaagatctctacaatgaaat 
gaagtggcagaagaaaatcaggaataaccctgcactgttctggttctcgaggcacatctc 
tctctgggggagcatttccttcaacctggctgtgttcatcaatttagctgttgctctctt 
ctacccatttggggatgatggagatgaaggtacactttctccattgttctcggttcttct 
ttggatagcagttgcgatctgcacatctatgctgtttttcttctccaagcctgtgggtat 
tcggccgtttcttgtatcaataatgctcagatcaatatatacaataggtcttgggcctac 
attaatacttcttggtgcagctaatctttgtaataaaattgtttttctggtgagttttgt 
tggaaatcgtggcacgttcacccgtgggtaccgagcagtcatcctggatatggcctttct 
ctatcacgtggcgtatgtcctggtttgcatgctgggcctttttgtccatgaattcttcta 
tagcttcctgctttttgatttggtgtacagggaagagactttgctgaatgtcataaaaag 
tgtcacacgaaatggccgctctattattctaactgcagtcctggctctcatcctcgtcta 
cctgttttccattattgggttcctttttttgaaggatgacttcactatggaagttgatag 
gctgaaaaaccgaactcctgttacaggcagtcatcaagtgcctactatgactttaactac 
catgatggaagcatgtgccaaggagaactgttcacccacaattccagcttcaaatacagc 
tgatgaagagtatgaagatggaattgaaaggacgtgtgacactctccttatgtgcattgt 
caccgtgctgaaccagggcctcaggaatggcggtggtgtgggggatgtgctaagaaggcc 
atcgaaagatgagcccttgtttgctgcccgagtggtttatgaccttcttttttatttcat 
tgttatcattattgttctgaacttgatttttggtgttatcatcgatacttttgctgatct 
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cagaagcgaaaaacagaaaaaagaagaaattctaaagacaacttgtttcatctgtggact 
tgagagagacaagtttgataataaaacggtttcatttgaggagcacattaagtcagaaca 
caatatgtggcattatttgtacttcatagtcctggtgaaagttaaagacccaacagaata 
cactggacctgaaagttatgtggctcaaatgattgtggagaagaatttggattggtttcc 
tcggatgcgagccatgtccctcgttagcaatgaaggcgacagtgagcaaaatgaaattcg 
gagccttcaggagaagttggaatcgaccatgagtctggtcaaacagctgtcgggtcagct 
ggcggagctcaaggagcagatgacagaacaaaggaagaataagcagagactgggcttcct 
cggatcaaacacaccccatgtgaatcatcacatgccaccacactgataccatggggggaa 
gccgtgactagcctttcatcagtgtcctgcctgatcactgaataaagaactgagatggag 
gggagtgaacagtgcctattgttgaaaagttaaaaacaaccaagtgccaagatgttgagt 
gggttagctccgagaacaatttataactgtgttttcatggttgcgaagacctaacctcaa 
atgcatctgctagaaagcgtacatcacacattcgcaatgcatcaggaagaaaaggcttgc 
ccaaaaggctggagagggcagggagcggcaggatggaaggagacacggggcagggagaac 
tctcttctgctaaatcgataggagtcagttttgtcttaaatgctgactacagccactgac 
atggttggctggaatttctttcttttaattgtggcatataggtttgtgacacaagaagtc 
atactttggtggctaagttttactaaggaaaataactgaaaagattaaaagtgagagctg 
aaaagagaaatgataatgcttccaaactgtagctgtcacagggcaatttctttatttata 
acatgaagcacaatggatttacagctctaggaacttagtactttggagcttttgcctctc 
acactgacaacataacaggatgtgattgccttctctgggattcagacaggctctgtcaat 
gtggagcacaaaaggagattttcatataacttgttaaaaacatgttctaagtcatgtata 
ggctaagattttaagaagatctgggggaataaaaagccaacagtaagccccaggaaaggg 
ttttttgagaccatatgtatgctattaaaatatattgagattaatacatgaaaatgctta 
aaagtgataggaattattgagaaacttatgatggtggcattgccttttataaacatggag 
tgaaagccatttgactcaacgtttgctgtgcttaaagaattgcttcaggacccgagcgtt 
tttatgtatgctgttcctgcagttaggaaaaaaaaatccaagaaatgtattgaatactca 
agaaaatgccacatttcaattacatatttaaaactgtatctgtaagggcttttcaaaatg 
tagcaagataaaaatctcttattcaaattgtttttgtattaaatcatgcactcagaattt 
gtcgagggagagaataacctggtgtgttcaggttattcttagagactacacatttggaat 
agcagagcaaaatatggataatgaaagtgttgggaaaaaagtgatgtgacagggagtgaa 
gacacttgataccagactctgggagatactctgcaagttgacctggcctctcccccacag 
gaacaaaacactgcctccagagtctttaaattctcagttatcaacgccaaggtttaaggt 
ctagatagggtttgctatagggtttgctctgacaatttttaaagcttttggattgttcta 
acatagcttaagctattggttcctaaaaatcccaaatcaagatctatgtagaatataaag 
gaagcctgaaccaatccttccacatactgttaagatgtagacttggaacaaagctgttgg 
gacccagagcaatgaatttttgaactgaagctactggtactccccagcaccacctcatac 
taagaattcctcactctgacatgacagtattttttctgaccaggagctgaaagaccctga 
cattcatgatccaaagatataagaaattttgatatgtctgcatgcagcacagaactttga 
acctaggaggcagtcaataaatacatgagaaattcagctgtcattcagttactcttaatt 
ctttataaaactttaaatgataccacatatttgtttgtttaaaatggctttcccaaaatc 
aaagtagactaaagcagccatctttaaaatccaggatcatcaatgctattaacagtatca 
ggtaaaaagtacactttaaaatattttaatcaggcagagttttatggatagagaatagaa 
gagaaaggtagtaaatattgaacatattccaatataggaacctatctctgttttagtaca 
aaatatttctgacatctgaactagaggtcaagagaataaattcatttgtatacatctgag 
caacctgtctttcagatgataaagtatctagccttttctgacaccataatagttcatttt 
gtagggaataagccattaggtgtatataattgctttctagaaatgacctaatgtccccaa 
ccactttgtagtggcagatcactgtttcacagcatattttctcccaaggaaagtattcaa 
aagagactgcaactaacaagactcttatttcatcaaaatttaaatatttctgagttgtat 
ttttaatgcctctttcttttctgcctaaatgcttagaaattataaagcaaaaaaaaaaaa 
aaaaacagcaacaaaaaatcgaagcagacaaaaaaggcacttttcagaacatcaaattcc 
taatgaagaagaggagaagataatggggaaatttgacatttgatataaatttatatttgt 
tatgtgtatgtttgttaatgcaactggaatatttgacttaggtgagtatcagttaacatc 
cttgtatttatatagtgacatcaaaataaatgcaatcatcttgaactttgatgttaaggg 
agattttgaaaaaatatttagttatttcaaattcatattggttcaaaagtatcagtttct 
gctgaataatatttttaatttcaaaagggttttgttccctctgtgctccattctgaggct 
caagagcttaatgcaagtatgttttctgagttaaaataacacttttagatgagaaaatgc 
ttgtatcatacagggctataatataaataaataaattgtatgtatgcaaaatttatcata 
ttttctaccccttaaaaaattaagttagaaataagcttattttcttgcacatcaatattt 
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ttctgttggtaaagaggacacaatttctagtagattgttaataacatcaggaagatattt 
ctttcgtcagaactaattgtgtgcttattcccatattctcagctcataactccctgtttt 
ggctgctctctttattttacaattgtttcattgcaaacagcaaggcagtatagcccacac 
tcagccactagcccccagtccccaggactgagaatgagtggggaggctggggagttggtg 
agagaaggtggagatagggatttctgcttcccagtactctatcattagagaaaaggaggg 
gtaaatattagctaagaagaagaaaaaagcttattcttcagggacccagtagattggacg 
taagcaagtaaatattgcacaggcatcatgatgtaatagttgatggaccccagagatacc 
cctcacccatgcagagaacaaagctgtgtggaagagcaactaccataagatggtgtgttc 
cttccagcggctgttaaagccatcctgaataacaaatcagctattcctacctacagaccc 
attccaatatgctgtgatacacgacacggtgcaccgcagaggagacagacatcttcagca 
caaacacagatgcctgcaggagcttggcaagtcactaaattgcatgaaattgtgaggtgc 
acaccaaatgcagggatgggggaggccgtgggaagctctgtattctcaaaatttgacagc 
taattccggtttttagaaaatgcctgaggccagtagaggccctctagtcactcactgctg 
ctgtttctgatataattattgagaaagctatctcacttaatagaagaaaacacgcactat 
caaaaccagatagcctaacgtgcatgtgaaaatcgagaaagctgaaaacaaatccaggta 
cccttctctgaactggagtgtttccacagacttgaataatttatgaaattatcacaccag 
tatttctcatatcaccaagaagactttctctcctgcagtagaggattgttatatttgcct 
aaaaaacacgattccaatatatgacaagggcagataatttataagtgaatgttaataaaa 
ttggatgtgtataa 

SEQ ID NO: 431. >Z38709_T9 # TY Transcript # LN 11748 # Source Gene: Z38709 
# Encoded protein: Z38709_P3 

gtatgtgctacacaggacatttgctgggagaaatgatggtagtcttactgaaattctaga 
tgcaaaacctatgtattcactggtgttcataatttagaatgtgactacacatgttaatac 
ttactctaagtgttttactgaatttcccccccacatttttaaatgtttaaataagcatta 
aataaatttaaataagctttaaaagtcaaatagctagtagaagggcccagagtagcaaaa 
ccagaaagagacacctgcttctcacatctgtttgttcagatgtctcttgacctttttgct 
agggggacgttgttagattatttcatgcggaacaagagaagtttttgacttgtgatgaat 
atgagaaaaaacagcacattttccttcgtacgaccttgcgccaatcagctacttctgcta 
ctagttctaaagcactctgggaaatagaggtggttcatcatgacccatgccgtgggggtg 
caggacagtggaacagcttgttcagatttaagcatcttgcaactggaaactatttagctg 
cagagcttaatcctgattatcgagatgcccaaaatgaaggaaaaaatgtgagagatggag 
tccctccaacttcaaagaaaaaacgccaggcaggggagaagatcatgtatactttggttt 
cagtcccgcatggcaatgacattgcatccctttttgaactagatgccacaactcttcaga 
gagctgactgcctggttccaaggaactcatatgttcggttaaggcatttatgcaccaaca 
catgggtaaccagtactagtatccccatagacacagatgaagagaggcctgttatgttaa 
agattggaacctgccaaaccaaagaagataaagaagcgttcgcaatcgtgtctgttccac 
tgtctgaagttcgagacttagactttgccaatgatgccaataaagtactagcgaccacag 
ttaaaaagctagaaaacggcacaataactcagaatgaaaggaggtttgtaaccaaattat 
tggaagatctcatattctttgttgctgatgtgcctaataatggacaagaagttctggatg 
tggttatcactaagccaaaccgagagcgtcaaaaattgatgagggaacaaaacatactgg 
cacaggtatttggaattcttaaagcaccctttaaagagaaagcaggagaaggctcgatgc 
tgagacttgaagatctgggggatcaaagatatgcaccctacaagtacatgctgcggctct 
gttaccgcgtcctgagacactcgcagcaggattaccggaaaaatcaggaatatattgcta 
agaatttctgtgtcatgcagtcccagattggctatgatattttggcagaagatactatca 
cagctttgttgcacaacaacagaaaactactagagaaacatatcacagcaaaagaaatag 
aaacatttgtcagtttactcaggagaaatcgggagccaaggtttttggattatttgtcag 
atctgtgtgtgtctaataccactgctatccctgtaactcaagaactcatctgtaaattta 
tgttgagtccaggcaatgcagacattctcattcaaactaaggtggtctcaatgcaagcag 
acaaccccatggagagctccatcctttcagatgacattgatgatgaagaagtttggctct 
attggattgacagcaacaaggaacctcatggcaaagctatcaggcaccttgctcaagagg 
caaaagaaggcaccaaagctgacttagaagttcttacctattacaggtaccagctaaacc 
tctttgcaaggatgtgcttggatcgccagtatctggccataaaccagatttctacacagc 
tgtctgtagacctgatcctgcggtgtgtgtcggatgagagcctgccgttcgacctccgag 
cgtccttctgtcgcctcatgctccacatgcacgttgaccgggatccccaggagtccgtgg 
tgcctgttcgctatgccaggctctggacagaaatccccacaaagatcacaattcatgaat 
atgattctataacagactcttccagaaatgatatgaagaggaaatttgccctgacaatgg 
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aatttgttgaagaatatttgaaagaagttgtaaaccagccctttccttttggggataaag 
aaaaaaataaactgacatttgaggtggtccacttggctcggaatcttatatactttggat 
tttatagtttcagtgagttattaaggctaacaagaacacttctggctattttagacattg 
tacaggcccccatgtcatcatactttgaaagattaagcaaatttcaagatggaggaaaca 
atgtgatgagaaccattcatggggtgggagagatgatgacccagatggtactcagtagag 
gctccatcttccccatgagcgtgccggatgtgccacccagcatccacccgagcaagcaag 
ggagccccaccgagcatgaggatgtgactgtgatggacaccaagctgaagatcattgaga 
ttttgcagtttatcctgagtgtcagactggattataggatctcatatatgctgtcaatat 
ataagaaggagtttggagaggacaatgacaatgcggagacatctgccagtggatctccag 
acactttactaccatcagctattgttcctgatatagatgaaattgcagctcaggcagaaa 
ctatgtttgcgggaagaaaagaaaaaaatccagttcaacttgacgatgaaggaggcagga 
cgtttttacgggtcctcattcatctgatcatgcacgactacccgcctttgctgtctggag 
ccctgcagctgttgtttaagcacttcagccagagggcagaggttttacaggcatttaagc 
aggtgcaattactggtgtctaatcaagacgtagataactacaagcaaatcaaggcagatc 
tagaccagcttcgactgacagtagaaaagtctgagctatgggtggagaagagcagcaact 
atgagaatggagaaataggggaaagtcaagtgaaaggtggtgaagagccaattgaggaat 
caaacattttaagtccagtgcaggatggaacaaagaaacctcagattgacagcaacaaga 
gcaataactaccggattgtaaaggagattttgatcaggctaagtaaactctgtgtgcaga 
ataaaaagtgtcggaatcaacatcaacgattactgaaaaatatgggggcgcattcggtgg 
tgttggatcttctgcagataccctatgaaaagaatgatgaaaagatgaatgaagtaatga 
atctagcccatacatttctgcagaatttctgtcgaggaaatccacagaatcaagttcttc 
ttcataaacatctgaatttgtttttaactccaggtctccttgaagcagaaaccatgcggc 
acatcttcatgaacaattaccatctgtgcaacgaaattagcgagagagttgtacaacact 
ttgtgcactgcattgagacacatggccgccacgtggagtacctgaggtttttgcaaacaa 
ttgtaaaagcagatggtaaatatgtgaagaaatgccaggatatggtaatgacagagttga 
taaatgggggtgaagacgtgctgatattttacaatgatagagcatcatttccaatccttc 
tccatatgatgtgttcagagagagaccgaggggatgagagtggccccttagcctaccaca 
tcaccctggtggagttgctggcagcatgcacagaggggaaaaatgtctacactgaaatca 
agtgtaattcccttctcccgctggacgacatagtgagggtggtgacccatgacgactgca 
tccctgaggttaaaattgcttatgtgaactttgttaatcactgttatgttgacactgaag 
tggaaatgaaagaaatctatacaagtaaccacatttggaaattatttgagaacttcttgg 
tggatatggcaagggtttgcaacacaactacagacaggaaacatgcagacatctttttgg 
aaaagtgtgttactgagtcaataatgaatattgtgagcggcttctttaattctccctttt 
cagacaatagtaccagcctccagacacatcagccagtttttattcagctactgcaatctg 
ccttcagaatttacaattgcacctggccaaacccagcgcagaaagcctcagtggaatcct 
gtatcagaactttggctgaagtggcaaaaaatcgtggaattgccattccagtggatttgg 
acagccaagttaatactcttttcatgaagagccattcaaatatggtgcagagagcagcaa 
tgggttggagactatcagctcgctctgggccacgctttaaggaagctcttggagggcctg 
cttgggattacagaaatattattgaaaagttacaggatgtagtggcctccttggagcacc 
agttcagcccaatgatgcaggctgaattctcagtgttggttgatgtattgtacagtccag 
aactgctgttccctgagggaagcgatgcaagaataagatgtggcgctttcatgtcgaagt 
tgattaatcatacaaagaaactaatggagaaagaagaaaaactgtgcattaaaattcttc 
agacattacgagaaatgttagagaagaaagacagctttgtggaagagggtaacacattaa 
gaaagatacttctgaatcgatactttaaaggtgattatagtattggtgtgaatggacacc 
tatcaggagcctactccaaaactgcacaggtgggaggaagcttttctggacaagattcag 
ataagatggggatatcaatgtcagacattcagtgtctgctggataaagaaggtgcatcag 
aacttgtcatcgatgttatagtgaacaccaaaaatgacagaattttttcagaaggcattt 
tcctcggcattgccttgcttgaaggaggaaatacacaaacacagtattctttctaccagc 
agttgcatgaacaaaaaaagtcagaaaaattctttaaagttctctatgatcgaatgaagg 
ctgctcagaaagaaataagatcaacagtgacagttaataccatagatttaggtaacaaaa 
aaagggacgatgacaatgaattgatgacatctggtccacgaatgagagtaagagattcaa 
cactacatttaaaagagggaatgaaagggcaattaacagaagcttcttcagcaacatcca 
aagcatattgtgtatacagaagagaaatggatccagaaatagacattatgtgcacaggac 
cagaagcgggaaacactgaggaaaaatccgcagaggaagtaacaatgagtcccgcaattg 
ccatcatgcagccaatactgagatttcttcagttactgtgtgagaatcacaaccgggaat 
tgcagaacttcttgaggaatcaaaacaacaaaacaaattacaacctagtctgtgagaccc 
ttcagtttctggactgcatttgtggaagtacaaccggtggcctgggcctgttgggtctct 
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acatcaatgagaagaatgtagcgctggtcaaccagaacctggagagcttgactgagtatt 
gccagggcccttgccatgaaaatcagacctgtatcgctacacatgagtctaatgggattg 
atatcatcattgctttgattctgaatgacataaaccctcttggtaaataccgaatggacc 
tggtgctccagctaaagaacaatgcatctaaacttttgctggccattatggaaagcagac 
atgacagtgagaatgcagaaagaattctttttaacatgagacccagagaactggtggatg 
tgatgaagaatgcctataaccaaggattggaatgtgaccatggggatgatgagggtggag 
atgatggtgtttctccaaaagatgttggacacaatatctatattctggcccatcagttgg 
cccgccacaataaactgttgcagcagatgctcaaaccaggatcggatccagatgaaggag 
atgaagccttaaagtattatgccaaccacactgcacagattgagattgtccggcatgata 
ggaccatggaacaaatagtttttcctgtccccaatatatgtgaatacctcactcgagaat 
ccaagtgccgtgtgttcaatacaactgaaagggatgaacaaggaagtaaagtgaatgact 
ttttccagcaaacagaagatctctacaatgaaatgaagtggcagaagaaaatcaggaata 
accctgcactgttctggttctcgaggcacatctctctctgggggagcatttccttcaacc 
tggctgtgttcatcaatttagctgttgctctcttctacccatttggggatgatggagatg 
aaggtacactttctccattgttctcggttcttctttggatagcagttgcgatctgcacat 
ctatgctgtttttcttctccaagcctgtgggtattcggccgtttcttgtatcaataatgc 
tcagatcaatatatacaataggtcttgggcctacattaatacttcttggtgcagctaatc 
tttgtaataaaattgtttttctggtgagttttgttggaaatcgtggcacgttcacccgtg 
ggtaccgagcagtcatcctggatatggcctttctctatcacgtggcgtatgtcctggttt 
gcatgctgggcctttttgtccatgaattcttctatagcttcctgctttttgatttggtgt 
acagggaagagactttgctgaatgtcataaaaagtgtcacacgaaatggccgctctatta 
ttctaactgcagtcctggctctcatcctcgtctacctgttttccattattgggttccttt 
ttttgaaggatgacttcactatggaagttgataggctgaaaaaccgaactcctgttacag 
gcagtcatcaagtgcctactatgactttaactaccatgatggaagcatgtgccaaggaga 
actgttcacccacaattccagcttcaaatacagctgatgaagagtatgaagatggaattg 
aaaggacgtgtgacactctccttatgtgcattgtcaccgtgctgaaccagggcctcagga 
atggcggtggtgtgggggatgtgctaagaaggccatcgaaagatgagcccttgtttgctg 
cccgagtggtttatgaccttcttttttatttcattgttatcattattgttctgaacttga 
tttttggtgttatcatcgatacttttgctgatctcagaagcgaaaaacagaaaaaagaag 
aaattctaaagacaacttgtttcatctgtggacttgagagagacaagtttgataataaaa 
cggtttcatttgaggagcacattaagtcagaacacaatatgtggcattatttgtacttca 
tagtcctggtgaaagttaaagacccaacagaatacactggacctgaaagttatgtggctc 
aaatgattgtggagaagaatttggattggtttcctcggatgcgagccatgtccctcgtta 
gcaatgaaggcgacagtgagcaaaatgaaattcggagccttcaggagaagttggaatcga 
ccatgagtctggtcaaacagctgtcgggtcagctggcggagctcaaggagcagatgacag 
aacaaaggaagaataagcagagactgggcttcctcggatcaaacacaccccatgtgaatc 
atcacatgccaccacactgataccatggggggaagccgtgactagcctttcatcagtgtc 
ctgcctgatcactgaataaagaactgagatggaggggagtgaacagtgcctattgttgaa 
aagttaaaaacaaccaagtgccaagatgttgagtgggttagctccgagaacaatttataa 
ctgtgttttcatggttgcgaagacctaacctcaaatgcatctgctagaaagcgtacatca 
cacattcgcaatgcatcaggaagaaaaggcttgcccaaaaggctggagagggcagggagc 
ggcaggatggaaggagacacggggcagggagaactctcttctgctaaatcgataggagtc 
agttttgtcttaaatgctgactacagccactgacatggttggctggaatttctttctttt 
aattgtggcatataggtttgtgacacaagaagtcatactttggtggctaagttttactaa 
ggaaaataactgaaaagattaaaagtgagagctgaaaagagaaatgataatgcttccaaa 
ctgtagctgtcacagggcaatttctttatttataacatgaagcacaatggatttacagct 
ctaggaacttagtactttggagcttttgcctctcacactgacaacataacaggatgtgat 
tgccttctctgggattcagacaggctctgtcaatgtggagcacaaaaggagattttcata 
taacttgttaaaaacatgttctaagtcatgtataggctaagattttaagaagatctgggg 
gaataaaaagccaacagtaagccccaggaaagggttttttgagaccatatgtatgctatt 
aaaatatattgagattaatacatgaaaatgcttaaaagtgataggaattattgagaaact 
tatgatggtggcattgccttttataaacatggagtgaaagccatttgactcaacgtttgc 
tgtgcttaaagaattgcttcaggacccgagcgtttttatgtatgctgttcctgcagttag 
gaaaaaaaaatccaagaaatgtattgaatactcaagaaaatgccacatttcaattacata 
tttaaaactgtatctgtaagggcttttcaaaatgtagcaagataaaaatctcttattcaa 
attgtttttgtattaaatcatgcactcagaatttgtcgagggagagaataacctggtgtg 
ttcaggttattcttagagactacacatttggaatagcagagcaaaatatggataatgaaa 
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gtgttgggaaaaaagtgatgtgacagggagtgaagacacttgataccagactctgggaga 
tactctgcaagttgacctggcctctcccccacaggaacaaaacactgcctccagagtctt 
taaattctcagttatcaacgccaaggtttaaggtctagatagggtttgctatagggtttg 
ctctgacaatttttaaagcttttggattgttctaacatagcttaagctattggttcctaa 
aaatcccaaatcaagatctatgtagaatataaaggaagcctgaaccaatccttccacata 
ctgttaagatgtagacttggaacaaagctgttgggacccagagcaatgaatttttgaact 
gaagctactggtactccccagcaccacctcatactaagaattcctcactctgacatgaca 
gtattttttctgaccaggagctgaaagaccctgacattcatgatccaaagatataagaaa 
ttttgatatgtctgcatgcagcacagaactttgaacctaggaggcagtcaataaatacat 
gagaaattcagctgtcattcagttactcttaattctttataaaactttaaatgataccac 
atatttgtttgtttaaaatggctttcccaaaatcaaagtagactaaagcagccatcttta 
aaatccaggatcatcaatgctattaacagtatcaggtaaaaagtacactttaaaatattt 
taatcaggcagagttttatggatagagaatagaagagaaaggtagtaaatattgaacata 
ttccaatataggaacctatctctgttttagtacaaaatatttctgacatctgaactagag 
gtcaagagaataaattcatttgtatacatctgagcaacctgtctttcagatgataaagta 
tctagccttttctgacaccataatagttcattttgtagggaataagccattaggtgtata 
taattgctttctagaaatgacctaatgtccccaaccactttgtagtggcagatcactgtt 
tcacagcatattttctcccaaggaaagtattcaaaagagactgcaactaacaagactctt 
atttcatcaaaatttaaatatttctgagttgtatttttaatgcctctttcttttctgcct 
aaatgcttagaaattataaagcaaaaaaaaaaaaaaaaacagcaacaaaaaatcgaagca 
gacaaaaaaggcacttttcagaacatcaaattcctaatgaagaagaggagaagataatgg 
ggaaatttgacatttgatataaatttatatttgttatgtgtatgtttgttaatgcaactg 
gaatatttgacttaggtgagtatcagttaacatccttgtatttatatagtgacatcaaaa 
taaatgcaatcatcttgaactttgatgttaagggagattttgaaaaaatatttagttatt 
tcaaattcatattggttcaaaagtatcagtttctgctgaataatatttttaatttcaaaa 
gggttttgttccctctgtgctccattctgaggctcaagagcttaatgcaagtatgttttc 
tgagttaaaataacacttttagatgagaaaatgcttgtatcatacagggctataatataa 
ataaataaattgtatgtatgcaaaatttatcatattttctaccccttaaaaaattaagtt 
agaaataagcttattttcttgcacatcaatatttttctgttggtaaagaggacacaattt 
ctagtagattgttaataacatcaggaagatatttctttcgtcagaactaattgtgtgctt 
attcccatattctcagctcataactccctgttttggctgctctctttattttacaattgt 
ttcattgcaaacagcaaggcagtatagcccacactcagccactagcccccagtccccagg 
actgagaatgagtggggaggctggggagttggtgagagaaggtggagatagggatttctg 
cttcccagtactctatcattagagaaaaggaggggtaaatattagctaagaagaagaaaa 
aagcttattcttcagggacccagtagattggacgtaagcaagtaaatattgcacaggcat 
catgatgtaatagttgatggaccccagagatacccctcacccatgcagagaacaaagctg 
tgtggaagagcaactaccataagatggtgtgttccttccagcggctgttaaagccatcct 
gaataacaaatcagctattcctacctacagacccattccaatatgctgtgatacacgaca 
cggtgcaccgcagaggagacagacatcttcagcacaaacacagatgcctgcaggagcttg 
gcaagtcactaaattgcatgaaattgtgaggtgcacaccaaatgcagggatgggggaggc 
cgtgggaagctctgtattctcaaaatttgacagctaattccggtttttagaaaatgcctg 
aggccagtagaggccctctagtcactcactgctgctgtttctgatataattattgagaaa 
gctatctcacttaatagaagaaaacacgcactatcaaaaccagatagcctaacgtgcatg 
tgaaaatcgagaaagctgaaaacaaatccaggtacccttctctgaactggagtgtttcca 
cagacttgaataatttatgaaattatcacaccagtatttctcatatcaccaagaagactt 
tctctcctgcagtagaggattgttatatttgcctaaaaaacacgattccaatatatgaca 
agggcagataatttataagtgaatgttaataaaattggatgtgtataa 

SEQ ID NO: 4 32. >Z38709_T10 # TY Transcript # LN 12514 # Source Gene: 

Z38709 # Encoded protein: Z38709_P4 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 
cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
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tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatattgttgtaggagataaagttgttttgatgcc 
tgtgaatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtg 
taaagaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaata 
tagttcctatcgagaggatgtattaaaaggaggggacgttgttagattatttcatgcgga 
acaagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtac 
gaccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggt 
ggttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaa 
gcatcttgcaactggaaactatttagctgcagagcttaatcctgattatcgagatgccca 
aaatgaaggaaaaaatgtgagagatggagtccctccaacttcaaagaaaaaacgccaggc 
aggggagaagatcatgtatactttggtttcagtcccgcatggcaatgacattgcatccct 
ttttgaactagatgccacaactcttcagagagctgactgcctggttccaaggaactcata 
tgttcggttaaggcatttatgcaccaacacatgggtaaccagtactagtatccccataga 
cacagatgaagagaggcctgttatgttaaagattggaacctgccaaaccaaagaagataa 
agaagcgttcgcaatcgtgtctgttccactgtctgaagttcgagacttagactttgccaa 
tgatgccaataaagtactagcgaccacagttaaaaagctagaaaacggcacaataactca 
gaatgaaaggaggtttgtaaccaaattattggaagatctcatattctttgttgctgatgt 
gcctaataatggacaagaagttctggatgtggttatcactaagccaaaccgagagcgtca 
aaaattgatgagggaacaaaacatactggcacaggtatttggaattcttaaagcaccctt 
taaagagaaagcaggagaaggctcgatgctgagacttgaagatctgggggatcaaagata 
tgcaccctacaagtacatgctgcggctctgttaccgcgtcctgagacactcgcagcagga 
ttaccggaaaaatcaggaatatattgctaagaatttctgtgtcatgcagtcccagattgg 
ctatgatattttggcagaagatatatcaggcaccttgctcaagaggcaaaagaaggcacc 
aaagctgacttagaagttcttacctattacaggtaccagctaaacctctttgcaaggatg 
tgcttggatcgccagtatctggccataaaccagatttctacacagctgtctgtagacctg 
atcctgcggtgtgtgtcggatgagagcctgccgttcgacctccgagcgtccttctgtcgc 
ctcatgctccacatgcacgttgaccgggatccccaggagtccgtggtgcctgttcgctat 
gccaggctctggacagaaatccccacaaagatcacaattcatgaatatgattctataaca 
gactcttccagaaatgatatgaagaggaaatttgccctgacaatggaatttgttgaagaa 
tatttgaaagaagttgtaaaccagccctttccttttggggataaagaaaaaaataaactg 
acatttgaggtggtccacttggctcggaatcttatatactttggattttatagtttcagt 
gagttattaaggctaacaagaacacttctggctattttagacattgtacaggcccccatg 
tcatcatactttgaaagattaagcaaatttcaagatggaggaaacaatgtgatgagaacc 
attcatggggtgggagagatgatgacccagatggtactcagtagaggctccatcttcccc 
atgagcgtgccggatgtgccacccagcatccacccgagcaagcaagggagccccaccgag 
catgaggatgtgactgtgatggacaccaagctgaagatcattgagattttgcagtttatc 
ctgagtgtcagactggattataggatctcatatatgctgtcaatatataagaaggagttt 
ggagaggacaatgacaatgcggagacatctgccagtggatctccagacactttactacca 
tcagctattgttcctgatatagatgaaattgcagctcaggcagaaactatgtttgcggga 
agaaaagaaaaaaatccagttcaacttgacgatgaaggaggcaggacgtttttacgggtc 
ctcattcatctgatcatgcacgactacccgcctttgctgtctggagccctgcagctgttg 
tttaagcacttcagccagagggcagaggttttacaggcatttaagcaggtgcaattactg 
gtgtctaatcaagacgtagataactacaagcaaatcaaggcagatctagaccagcttcga 
ctgacagtagaaaagtctgagctatgggtggagaagagcagcaactatgagaatggagaa 
ataggggaaagtcaagtgaaaggtggtgaagagccaattgaggaatcaaacattttaagt 
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ccagtgcaggatggaacaaagaaacctcagattgacagcaacaagagcaataactaccgg 
attgtaaaggagattttgatcaggctaagtaaactctgtgtgcagaataaaaagtgtcgg 
aatcaacatcaacgattactgaaaaatatgggggcgcattcggtggtgttggatcttctg 
cagataccctatgaaaagaatgatgaaaagatgaatgaagtaatgaatctagcccataca 
tttctgcagaatttctgtcgaggaaatccacagaatcaagttcttcttcataaacatctg 
aatttgtttttaactccaggtctccttgaagcagaaaccatgcggcacatcttcatgaac 
aattaccatctgtgcaacgaaattagcgagagagttgtacaacactttgtgcactgcatt 
gagacacatggccgccacgtggagtacctgaggtttttgcaaacaattgtaaaagcagat 
ggtaaatatgtgaagaaatgccaggatatggtaatgacagagttgataaatgggggtgaa 
gacgtgctgatattttacaatgatagagcatcatttccaatccttctccatatgatgtgt 
tcagagagagaccgaggggatgagagtggccccttagcctaccacatcaccctggtggag 
ttgctggcagcatgcacagaggggaaaaatgtctacactgaaatcaagtgtaattccctt 
ctcccgctggacgacatagtgagggtggtgacccatgacgactgcatccctgaggttaaa 
attgcttatgtgaactttgttaatcactgttatgttgacactgaagtggaaatgaaagaa 
atctatacaagtaaccacatttggaaattatttgagaacttcttggtggatatggcaagg 
gtttgcaacacaactacagacaggaaacatgcagacatctttttggaaaagtgtgttact 
gagtcaataatgaatattgtgagcggcttctttaattctcccttttcagacaatagtacc 
agcctccagacacatcagccagtttttattcagctactgcaatctgccttcagaatttac 
aattgcacctggccaaacccagcgcagaaagcctcagtggaatcctgtatcagaactttg 
gctgaagtggeaaaaaatcgtggaattgccattccagtggatttggacagccaagttaat 
actcttttcatgaagagccattcaaatatggtgcagagagcagcaatgggttggagacta 
tcagctcgctctgggccacgctttaaggaagctcttggagggcctgcttgggattacaga 
aatattattgaaaagttacaggatgtagtggcctccttggagcaccagttcagcccaatg 
atgcaggctgaattctcagtgttggttgatgtattgtacagtccagaactgctgttccct 
gagggaagcgatgcaagaataagatgtggcgctttcatgtcgaagttgattaatcataca 
aagaaactaatggagaaagaagaaaaactgtgcattaaaattcttcagacattacgagaa 
atgttagagaagaaagacagctttgtggaagagggtaacacattaagaaagatacttctg 
aatcgatactttaaaggtgattatagtattggtgtgaatggacacctatcaggagcctac 
tccaaaactgcacaggtgggaggaagcttttctggacaagattcagataagatggggata 
tcaatgtcagacattcagtgtctgctggataaagaaggtgcatcagaacttgtcatcgat 
gttatagtgaacaccaaaaatgacagaattttttcagaaggcattttcctcggcattgcc 
ttgcttgaaggaggaaatacacaaacacagtattctttctaccagcagttgcatgaacaa 
aaaaagtcagaaaaattctttaaagttctctatgatcgaatgaaggctgctcagaaagaa 
ataagatcaacagtgacagttaataccatagatttaggtaacaaaaaaagggacgatgac 
aatgaattgatgacatctggtccacgaatgagagtaagagattcaacactacatttaaaa 
gagggaatgaaagggcaattaacagaagcttcttcagcaacatccaaagcatattgtgta 
tacagaagagaaatggatccagaaatagacattatgtgcacaggaccagaagcgggaaac 
actgaggaaaaatccgcagaggaagtaacaatgagtcccgcaattgccatcatgcagcca 
atactgagatttcttcagttactgtgtgagaatcacaaccgggaattgcagaacttcttg 
aggaatcaaaacaacaaaacaaattacaacctagtctgtgagacccttcagtttctggac 
tgcatttgtggaagtacaaccggtggcctgggcctgttgggtctctacatcaatgagaag 
aatgtagcgctggtcaaccagaacctggagagcttgactgagtattgccagggcccttgc 
catgaaaatcagacctgtatcgctacacatgagtctaatgggattgatatcatcattgct 
ttgattctgaatgacataaaccctcttggtaaataccgaatggacctggtgctccagcta 
aagaacaatgcatctaaacttttgctggccattatggaaagcagacatgacagtgagaat 
gcagaaagaattctttttaacatgagacccagagaactggtggatgtgatgaagaatgcc 
tataaccaaggattggaatgtgaccatggggatgatgagggtggagatgatggtgtttct 
ccaaaagatgttggacacaatatctatattctggcccatcagttggcccgccacaataaa 
ctgttgcagcagatgctcaaaccaggatcggatccagatgaaggagatgaagccttaaag 
tattatgccaaccacactgcacagattgagattgtccggcatgataggaccatggaacaa 
atagtttttcctgtccccaatatatgtgaatacctcactcgagaatccaagtgccgtgtg 
ttcaatacaactgaaagggatgaacaaggaagtaaagtgaatgactttttccagcaaaca 
gaagatctctacaatgaaatgaagtggcagaagaaaatcaggaataaccctgcactgttc 
tggttctcgaggcacatctctctctgggggagcatttccttcaacctggctgtgttcatc 
aatttagctgttgctctcttctacccatttggggatgatggagatgaaggtacactttct 
ccattgttctcggttcttctttggatagcagttgcgatctgcacatctatgctgtttttc 
ttctccaagcctgtgggtattcggccgtttcttgtatcaataatgctcagatcaatatat 
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acaataggtcttgggcctacattaatacttcttggtgcagctaatctttgtaataaaatt 
gtttttctggtgagttttgttggaaatcgtggcacgttcacccgtgggtaccgagcagtc 
atcctggatatggcctttctctatcacgtggcgtatgtcctggtttgcatgctgggcctt 
tttgtccatgaattcttctatagcttcctgctttttgatttggtgtacagggaagagact 
ttgctgaatgtcataaaaagtgtcacacgaaatggccgctctattattctaactgcagtc 
ctggctctcatcctcgtctacctgttttccattattgggttcctttttttgaaggatgac 
ttcactatggaagttgataggctgaaaaaccgaactcctgttacaggcagtcatcaagtg 
cctactatgactttaactaccatgatggaagcatgtgccaaggagaactgttcacccaca 
attccagcttcaaatacagctgatgaagagtatgaagatggaattgaaaggacgtgtgac 
actctccttatgtgcattgtcaccgtgctgaaccagggcctcaggaatggcggtggtgtg 
ggggatgtgctaagaaggccatcgaaagatgagcccttgtttgctgcccgagtggtttat 
gaccttcttttttatttcattgttatcattattgttctgaacttgatttttggtgttatc 
atcgatacttttgctgatctcagaagcgaaaaacagaaaaaagaagaaattctaaagaca 
acttgtttcatctgtggacttgagagagacaagtttgataataaaacggtttcatttgag 
gagcacattaagtcagaacacaatatgtggcattatttgtacttcatagtcctggtgaaa 
gttaaagacccaacagaatacactggacctgaaagttatgtggctcaaatgattgtggag 
aagaatttggattggtttcctcggatgcgagccatgtccctcgttagcaatgaaggcgac 
agtgagcaaaatgaaattcggagccttcaggagaagttggaatcgaccatgagtctggtc 
aaacagctgtcgggtcagctggcggagctcaaggagcagatgacagaacaaaggaagaat 
aagcagagactgggcttcctcggatcaaacacaccccatgtgaatcatcacatgccacca 
cactgataccatggggggaagccgtgactagcctttcatcagtgtcctgcctgatcactg 
aataaagaactgagatggaggggagtgaacagtgcctattgttgaaaagttaaaaacaac 
caagtgccaagatgttgagtgggttagctccgagaacaatttataactgtgttttcatgg 
ttgcgaagacctaacctcaaatgcatctgctagaaagcgtacatcacacattcgcaatgc 
atcaggaagaaaaggcttgcccaaaaggctggagagggcagggagcggcaggatggaagg 
agacacggggcagggagaactctcttctgctaaatcgataggagtcagttttgtcttaaa 
tgctgactacagccactgacatggttggctggaatttctttcttttaattgtggcatata 
ggtttgtgacacaagaagtcatactttggtggctaagttttactaaggaaaataactgaa 
aagattaaaagtgagagctgaaaagagaaatgataatgcttccaaactgtagctgtcaca 
gggcaatttctttatttataacatgaagcacaatggatttacagctctaggaacttagta 
ctttggagcttttgcctctcacactgacaacataacaggatgtgattgccttctctggga 
ttcagacaggctctgtcaatgtggagcacaaaaggagattttcatataacttgttaaaaa 
catgttctaagtcatgtataggctaagattttaagaagatctgggggaataaaaagccaa 
cagtaagccccaggaaagggttttttgagaccatatgtatgctattaaaatatattgaga 
ttaatacatgaaaatgcttaaaagtgataggaattattgagaaacttatgatggtggcat 
tgccttttataaacatggagtgaaagccatttgactcaacgtttgctgtgcttaaagaat 
tgcttcaggacccgagcgtttttatgtatgctgttcctgcagttaggaaaaaaaaatcca 
agaaatgtattgaatactcaagaaaatgccacatttcaattacatatttaaaactgtatc 
tgtaagggcttttcaaaatgtagcaagataaaaatctcttattcaaattgtttttgtatt 
aaatcatgcactcagaatttgtcgagggagagaataacctggtgtgttcaggttattctt 
agagactacacatttggaatagcagagcaaaatatggataatgaaagtgttgggaaaaaa 
gtgatgtgacagggagtgaagacacttgataccagactctgggagatactctgcaagttg 
acctggcctctcccccacaggaacaaaacactgcctccagagtctttaaattctcagtta 
tcaacgccaaggtttaaggtctagatagggtttgctatagggtttgctctgacaattttt 
aaagcttttggattgttctaacatagcttaagctattggttcctaaaaatcccaaatcaa 
gatctatgtagaatataaaggaagcctgaaccaatccttccacatactgttaagatgtag 
acttggaacaaagctgttgggacccagagcaatgaatttttgaactgaagctactggtac 
tccccagcaccacctcatactaagaattcctcactctgacatgacagtattttttctgac 
caggagctgaaagaccctgacattcatgatccaaagatataagaaattttgatatgtctg 
catgcagcacagaactttgaacctaggaggcagtcaataaatacatgagaaattcagctg 
tcattcagttactcttaattctttataaaactttaaatgataccacatatttgtttgttt 
aaaatggctttcccaaaatcaaagtagactaaagcagccatctttaaaatccaggatcat 
caatgctattaacagtatcaggtaaaaagtacactttaaaatattttaatcaggcagagt 
tttatggatagagaatagaagagaaaggtagtaaatattgaacatattccaatataggaa 
cctatctctgttttagtacaaaatatttctgacatctgaactagaggtcaagagaataaa 
ttcatttgtatacatctgagcaacctgtctttcagatgataaagtatctagccttttctg 
acaccataatagttcattttgtagggaataagccattaggtgtatataattgctttctag 
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aaatgacctaatgtccccaaccactttgtagtggcagatcactgtttcacagcatatttt 
ctcccaaggaaagtattcaaaagagactgcaactaacaagactcttatttcatcaaaatt 
taaatatttctgagttgtatttttaatgcctctttcttttctgcctaaatgcttagaaat 
tataaagcaaaaaaaaaaaaaaaaacagcaacaaaaaatcgaagcagacaaaaaaggcac 
ttttcagaacatcaaattcctaatgaagaagaggagaagataatggggaaatttgacatt 
tgatataaatttatatttgttatgtgtatgtttgttaatgcaactggaatatttgactta 
ggtgagtatcagttaacatccttgtatttatatagtgacatcaaaataaatgcaatcatc 
ttgaactttgatgttaagggagattttgaaaaaatatttagttatttcaaattcatattg 
gttcaaaagtatcagtttctgctgaataatatttttaatttcaaaagggttttgttccct 
ctgtgctccattctgaggctcaagagcttaatgcaagtatgttttctgagttaaaataac 
acttttagatgagaaaatgcttgtatcatacagggctataatataaataaataaattgta 
tgtatgcaaaatttatcatattttctaccccttaaaaaattaagttagaaataagcttat 
tttcttgcacatcaatatttttctgttggtaaagaggacacaatttctagtagattgtta 
ataacatcaggaagatatttctttcgtcagaactaattgtgtgcttattcccatattctc 
agctcataactccctgttttggctgctctctttattttacaattgtttcattgcaaacag 
caaggcagtatagcccacactcagccactagcccccagtccccaggactgagaatgagtg 
gggaggctggggagttggtgagagaaggtggagatagggatttctgcttcccagtactct 
atcattagagaaaaggaggggtaaatattagctaagaagaagaaaaaagcttattcttca 
gggacccagtagattggacgtaagcaagtaaatattgcacaggcatcatgatgtaatagt 
tgatggaccccagagatacccctcacccatgcagagaacaaagctgtgtggaagagcaac 
taccataagatggtgtgttccttccagcggctgttaaagccatcctgaataacaaatcag 
ctattcctacctacagacccattccaatatgctgtgatacacgacacggtgcaccgcaga 
ggagacagacatcttcagcacaaacacagatgcctgcaggagcttggcaagtcactaaat 
tgcatgaaattgtgaggtgcacaccaaatgcagggatgggggaggccgtgggaagctctg 
tattctcaaaatttgacagctaattccggtttttagaaaatgcctgaggccagtagaggc 
cctctagtcactcactgctgctgtttctgatataattattgagaaagctatctcacttaa 
tagaagaaaacacgcactatcaaaaccagatagcctaacgtgcatgtgaaaatcgagaaa 
gctgaaaacaaatccaggtacccttctctgaactggagtgtttccacagacttgaataat 
ttatgaaattatcacaccagtatttctcatatcaccaagaagactttctctcctgcagta 
gaggattgttatatttgcctaaaaaacacgattccaatatatgacaagggcagataattt 
ataagtgaatgttaataaaattggatgtgtataa 

SEQ ID NO: 433 . >Z38709_T11 # TY Transcript # LN 5881 # Source Gene: Z38709 
# Encoded protein: Z3B709_P5 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 
cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatattgttgtaggagataaagttgttttgatgcc 
tgtgaatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtg 
taaagaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaata 
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tagttcctatcgagaggatgtattaaaaggaggggacgttgttagattatttcatgcgga 
acaagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtac 
gaccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggt 
ggttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaa 
gcatcttgcaactggaaactatttagctgcagagcttaatcctgattatcgagatgccca 
aaatgaaggaaaaaatgtgagagatggagtccctccaacttcaaagaaaaaacgccaggc 
aggggagaagatcatgtatactttggtttcagtcccgcatggcaatgacattgcatccct 
ttttgaactagatgccacaactcttcagagagctgactgcctggttccaaggaactcata 
tgttcggttaaggcatttatgcaccaacacatgggtaaccagtactagtatccccataga 
cacagatgaagagaggcctgttatgttaaagattggaacctgccaaaccaaagaagataa 
agaagcgttcgcaatcgtgtctgttccactgtctgaagttcgagacttagactttgccaa 
tgatgccaataaagtactagcgaccacagttaaaaagctagaaaacggcacaataactca 
gaatgaaaggaggtttgtaaccaaattattggaagatctcatattctttgttgctgatgt 
gcctaataatggacaagaagttctggatgtggttatcactaagccaaaccgagagcgtca 
aaaattgatgagggaacaaaacatactggcacaggtatttggaattcttaaagcaccctt 
taaagagaaagcaggagaaggctcgatgctgagacttgaagatctgggggatcaaagata 
tgcaccctacaagtacatgctgcggctctgttaccgcgtcctgagacactcgcagcagga 
ttaccggaaaaatcaggaatatattgctaagaatttctgtgtcatgcagtcccagattgg 
ctatgatattttggcagaagatactatcacagctttgttgcacaacaacagaaaactact 
agagaaacatatcacagcaaaagaaatagaaacatttgtcagtttactcaggagaaatcg 
ggagccaaggtttttggattatttgtcagatctgtgtgtgtctaataccactgctatccc 
tgtaactcaagaactcatctgtaaatttatgttgagtccaggcaatgcagacattctcat 
tcaaactaaggtggtctcaatgcaagcagacaaccccatggagagctccatcctttcaga 
tgacattgatgatgaagaagtttggctctattggattgacagcaacaaggaacctcatgg 
caaagctatcaggcaccttgctcaagaggcaaaagaaggcaccaaagctgacttagaagt 
tcttacctattacaggtaccagctaaacctctttgcaaggatgtgcttggatcgccagta 
tctggccataaaccagatttctacacagctgtctgtagacctgatcctgcggtgtgtgtc 
ggatgagagcctgccgttcgacctccgagcgtccttctgtcgcctcatgctccacatgca 
cgttgaccgggatccccaggagtccgtggtgcctgttcgctatgccaggctctggacaga 
aatccccacaaagatcacaattcatgaatatgattctataacagactcttccagaaatga 
tatgaagaggaaatttgccctgacaatggaatttgttgaagaatatttgaaagaagttgt 
aaaccagccctttccttttggggataaagaaaaaaataaactgacatttgaggtggtcca 
cttggctcggaatcttatatactttggattttatagtttcagtgagttattaaggctaac 
aagaacacttctggctattttagacattgtacaggcccccatgtcatcatactttgaaag 
attaagcaaatttcaagatggaggaaacaatgtgatgagaaccattcatggggtgggaga 
gatgatgacccagatggtactcagtagaggctccatcttccccatgagcgtgccggatgt 
gccacccagcatccacccgagcaagcaagggagccccaccgagcatgaggatgtgactgt 
gatggacaccaagctgaagatcattgagattttgcagtttatcctgagtgtcagactgga 
ttataggatctcatatatgctgtcaatatataagaaggagtttggagaggacaatgacaa 
tgcggagacatctgccagtggatctccagacactttactaccatcagctattgttcctga 
tatagatgaaattgcagctcaggcagaaactatgtttgcgggaagaaaagaaaaaaatcc 
agttcaacttgacgatgaaggaggcaggacgtttttacgggtcctcattcatctgatcat 
gcacgactacccgcctttgctgtctggagccctgcagctgttgtttaagcacttcagcca 
gagggcagaggttttacaggcatttaagcaggtgcaattactggtgtctaatcaagacgt 
agataactacaagcaaatcaaggcagatctagaccagcttcgactgacagtagaaaagtc 
tgagctatgggtggagaagagcagcaactatgagaatggagaaataggggaaagtcaagt 
gaaaggtggtgaagagccaattgaggaatcaaacattttaagtccagtgcaggatggaac 
aaagaaacctcagattgacagcaacaagagcaataactaccggattgtaaaggagatttt 
gatcaggctaagtaaactctgtgtgcagaataaaaagtgtcggaatcaacatcaacgatt 
actgaaaaatatgggggcgcattcggtggtgttggatcttctgcagataccctatgaaaa 
gaatgatgaaaagatgaatgaagtaatgaatctagcccatacatttctgcagaatttctg 
tcgaggaaatccacagaatcaagttcttcttcataaacatctgaatttgtttttaactcc 
aggtctccttgaagcagaaaccatgcggcacatcttcatgaacaattaccatctgtgcaa 
cgaaattagcgagagagttgtacaacactttgtgcactgcattgagacacatggccgcca 
cgtggagtacctgaggtttttgcaaacaattgtaaaagcagatggtaaatatgtgaagaa 
atgccaggatatggtaatgacagagttgataaatgggggtgaagacgtgctgatatttta 
caatgatagagcatcatttccaatccttctccatatgatgtgttcagagagagaccgagg 
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ggatgagagtggccccttagcctaccacatcaccctggtggagttgctggcagcatgcac 
agaggggaaaaatgtctacactgaaatcaagtgtaattcccttctcccgctggacgacat 
agtgagggtggtgacccatgacgactgcatccctgaggttaaaattgcttatgtgaactt 
tgttaatcactgttatgttgacactgaagtggaaatgaaagaaatctatacaagtaacca 
catttggaaattatttgagaacttcttggtggatatggcaagggtttgcaacacaactac 
agacaggaaacatgcagacatctttttggaaaagtgtgttactgagtcaataatgaatat 
tgtgagcggcttctttaattctcccttttcagacaatagtaccagcctccagacacatca 
gccagtttttattcagctactgcaatctgccttcagaatttacaattgcacctggccaaa 
cccagcgcagaaagcctcagtggaatcctgtatcagaactttggctgaagtggcaaaaaa 
tcgtggaattgccattccagtggatttggacagccaagttaatactcttttcatgaagag 
ccattcaaatatggtgcagagagcagcaatgggttggagactatcagctcgctctgggcc 
acgctttaaggaagctcttggagggcctgcttgggattacagaaatattattgaaaagtt 
acaggatgtagtggcctccttggagcaccagttcagcccaatgatgcaggctgaattctc 
agtgttggttgatgtattgtacagtccagaactgctgttccctgagggaagcgatgcaag 
aataagatgtggcgctttcatgtcgaagttgattaatcatacaaagaaactaatggagaa 
agaagaaaaactgtgcattaaaattcttcagacattacgagaaatgttagagaagaaaga 
cagctttgtggaagaggtatttttttaaaaaactttattatttaaaaataaagtaggatt 
tccccaatactgtaacctatgaattacaataggctattacgtgattacttgcattatatc 
t 

SEQ ID NO: 434. >Z38709_T12 # TY Transcript # LN 6191 # Source Gene: Z38709 
# Encoded protein: Z38709_P6 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 
cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatattgttgtaggagataaagttgttttgatgcc 
tgtgaatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtg 
taaagaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaata 
tagttcctatcgagaggatgtattaaaaggaggggacgttgttagattatttcatgcgga 
acaagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtac 
gaccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggt 
ggttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaa 
gcatcttgcaactggaaactatttagctgcagagcttaatcctgattatcgagatgccca 
aaatgaaggaaaaaatgtgagagatggagtccctccaacttcaaagaaaaaacgccaggc 
aggggagaagatcatgtatactttggtttcagtcccgcatggcaatgacattgcatccct 
ttttgaactagatgccacaactcttcagagagctgactgcctggttccaaggaactcata 
tgttcggttaaggcatttatgcaccaacacatgggtaaccagtactagtatccccataga 
cacagatgaagagaggcctgttatgttaaagattggaacctgccaaaccaaagaagataa 
agaagcgttcgcaatcgtgtctgttccactgtctgaagttcgagacttagactttgccaa 
tgatgccaataaagtactagcgaccacagttaaaaagctagaaaacggcacaataactca 
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gaatgaaaggaggtttgtaaccaaattattggaagatctcatattctttgttgctgatgt 
gcctaataatggacaagaagttctggatgtggttatcactaagccaaaccgagagcgtca 
aaaattgatgagggaacaaaacatactggcacaggtatttggaattcttaaagcaccctt 
taaagagaaagcaggagaaggctcgatgctgagacttgaagatctgggggatcaaagata 
tgcaccctacaagtacatgctgcggctctgttaccgcgtcctgagacactcgcagcagga 
ttaccggaaaaatcaggaatatattgctaagaatttctgtgtcatgcagtcccagattgg 
ctatgatattttggcagaagatactatcacagctttgttgcacaacaacagaaaactact 
agagaaacatatcacagcaaaagaaatagaaacatttgtcagtttactcaggagaaatcg 
ggagccaaggtttttggattatttgtcagatctgtgtgtgtctaataccactgctatccc 
tgtaactcaagaactcatctgtaaatttatgttgagtccaggcaatgcagacattctcat 
tcaaactaaggtggtctcaatgcaagcagacaaccccatggagagctccatcctttcaga 
tgacattgatgatgaagaagtttggctctattggattgacagcaacaaggaacctcatgg 
caaagctatcaggcaccttgctcaagaggcaaaagaaggcaccaaagctgacttagaagt 
tcttacctattacaggtaccagctaaacctctttgcaaggatgtgcttggatcgccagta 
tctggccataaaccagatttctacacagctgtctgtagacctgatcctgcggtgtgtgtc 
ggatgagagcctgccgttcgacctccgagcgtccttctgtcgcctcatgctccacatgca 
cgttgaccgggatccccaggagtccgtggtgcctgttcgctatgccaggctctggacaga 
aatccccacaaagatcacaattcatgaatatgattctataacagactcttccagaaatga 
tatgaagaggaaatttgccctgacaatggaatttgttgaagaatatttgaaagaagttgt 
aaaccagccctttccttttggggataaagaaaaaaataaactgacatttgaggtggtcca 
cttggctcggaatcttatatactttggattttatagtttcagtgagttattaaggctaac 
aagaacacttctggctattttagacattgtacaggcccccatgtcatcatactttgaaag 
attaagcaaatttcaagatggaggaaacaatgtgatgagaaccattcatggggtgggaga 
gatgatgacccagatggtactcagtagaggctccatcttccccatgagcgtgccggatgt 
gccacccagcatccacccgagcaagcaagggagccccaccgagcatgaggatgtgactgt 
gatggacaccaagctgaagatcattgagattttgcagtttatcctgagtgtcagactgga 
ttataggatctcatatatgctgtcaatatataagaaggagtttggagaggacaatgacaa 
tgcggagacatctgccagtggatctccagacactttactaccatcagctattgttcctga 
tatagatgaaattgcagctcaggcagaaactatgtttgcgggaagaaaagaaaaaaatcc 
agttcaacttgacgatgaaggaggcaggacgtttttacgggtcctcattcatctgatcat 
gcacgactacccgcctttgctgtctggagccctgcagctgttgtttaagcacttcagcca 
gagggcagaggttttacaggcatttaagcaggtgcaattactggtgtctaatcaagacgt 
agataactacaagcaaatcaaggcagatctagaccagcttcgactgacagtagaaaagtc 
tgagctatgggtggagaagagcagcaactatgagaatggagaaataggggaaagtcaagt 
gaaaggtggtgaagagccaattgaggaatcaaacattttaagtccagtgcaggatggaac 
aaagaaacctcagattgacagcaacaagagcaataactaccggattgtaaaggagatttt 
gatcaggctaagtaaactctgtgtgcagaataaaaagtgtcggaatcaacatcaacgatt 
actgaaaaatatgggggcgcattcggtggtgttggatcttctgcagataccctatgaaaa 
gaatgatgaaaagatgaatgaagtaatgaatctagcccatacatttctgcagaatttctg 
tcgaggaaatccacagaatcaagttcttcttcataaacatctgaatttgtttttaactcc 
aggtctccttgaagcagaaaccatgcggcacatcttcatgaacaattaccatctgtgcaa 
cgaaattagcgagagagttgtacaacactttgtgcactgcattgagacacatggccgcca 
cgtggagtacctgaggtttttgcaaacaattgtaaaagcagatggtaaatatgtgaagaa 
atgccaggatatggtaatgacagagttgataaatgggggtgaagacgtgctgatatttta 
caatgatagagcatcatttccaatccttctccatatgatgtgttcagagagagaccgagg 
ggatgagagtggccccttagcctaccacatcaccctggtggagttgctggcagcatgcac 
agaggggaaaaatgtctacactgaaatcaagtgtaattcccttctcccgctggacgacat 
agtgagggtggtgacccatgacgactgcatccctgaggttaaaattgcttatgtgaactt 
tgttaatcactgttatgttgacactgaagtggaaatgaaagaaatctatacaagtaacca 
catttggaaattatttgagaacttcttggtggatatggcaagggtttgcaacacaactac 
agacaggaaacatgcagacatctttttggaaaagtgtgttactgagtcaataatgaatat 
tgtgagcggcttctttaattctcccttttcagacaatagtaccagcctccagacacatca 
gccagtttttattcagctactgcaatctgccttcagaatttacaattgcacctggccaaa 
cccagcgcagaaagcctcagtggaatcctgtatcagaactttggctgaagtggcaaaaaa 
tcgtggaattgccattccagtggatttggacagccaagttaatactcttttcatgaagag 
ccattcaaatatggtgcagagagcagcaatgggttggagactatcagctcgctctgggcc 
acgctttaaggaagctcttggagggcctgcttgggattacagaaatattattgaaaagtt 
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acaggatgtagtggcctccttggagcaccagttcagcccaatgatgcaggctgaattctc 
agtgttggttgatgtattgtacagtccagaactgctgttccctgagggaagcgatgcaag 
aataagatgtggcgctttcatgtcgaagttgattaatcatacaaagaaactaatggagaa 
agaagaaaaactgtgcattaaaattcttcagacattacgagaaatgttagagaagaaaga 
cagctttgtggaagagggtaacacattaagaaagatacttctgaatcgatactttaaagg 
tgattatagtattggtgtgaatggacacctatcaggagcctactccaaaactgcacaggt 
gggaggaagcttttctggacaagattcagataagatggggatatcaatgtcagacattca 
gtgtctgctggataaagaaggtgcatcagaacttgtcatcgatgttatagtgaacaccaa 
aaatgacagaattttttcagaaggcattttcctcggcattgccttgcttgaaggaggaaa 
tacacaaacacaggtacttgttcatcaatcatggggattacatttattattggtatttct 
tgtttacttgtcacagaaaccaaggaaaagtaatgtgtttcttttagctcttattactat 
ctaaaaaaaaa 

SEQ ID NO:435. >Z38709_T13 8 TY Transcript # LN 4406 # Source Gene: Z38709 
# Encoded protein: Z38709_P7 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 
cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatattgttgtaggagataaagttgttttgatgcc 
tgtgaatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtg 
taaagaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaata 
tagttcctatcgagaggatgtattaaaaggaggggacgttgttagattatttcatgcgga 
acaagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtac 
gaccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggt 
ggttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaa 
gcatcttgcaactggaaactatttagctgcagagcttaatcctgattatcgagatgccca 
aaatgaaggaaaaaatgtgagagatggagtccctccaacttcaaagaaaaaacgccaggc 
aggggagaagatcatgtatactttggtttcagtcccgcatggcaatgacattgcatccct 
ttttgaactagatgccacaactcttcagagagctgactgcctggttccaaggaactcata 
tgttcggttaaggcatttatgcaccaacacatgggtaaccagtactagtatccccataga 
cacagatgaagagaggcctgttatgttaaagattggaacctgccaaaccaaagaagataa 
agaagcgttcgcaatcgtgtctgttccactgtctgaagttcgagacttagactttgccaa 
tgatgccaataaagtactagcgaccacagttaaaaagctagaaaacggcacaataactca 
gaatgaaaggaggtttgtaaccaaattattggaagatctcatattctttgttgctgatgt 
gcctaataatggacaagaagttctggatgtggttatcactaagccaaaccgagagcgtca 
aaaattgatgagggaacaaaacatactggcacaggtatttggaattcttaaagcaccctt 
taaagagaaagcaggagaaggctcgatgctgagacttgaagatctgggggatcaaagata 
tgcaccctacaagtacatgctgcggctctgttaccgcgtcctgagacactcgcagcagga 
ttaccggaaaaatcaggaatatattgctaagaatttctgtgtcatgcagtcccagattgg 
ctatgatattttggcagaagatactatcacagctttgttgcacaacaacagaaaactact 
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agagaaacatatcacagcaaaagaaatagaaacatttgtcagtttactcaggagaaatcg 
ggagccaaggtttttggattatttgtcagatctgtgtgtgtctaataccactgctatccc 
tgtaactcaagaactcatctgtaaatttatgttgagtccaggcaatgcagacattctcat 
tcaaactaaggtggtctcaatgcaagcagacaaccccatggagagctccatcctttcaga 
tgacattgatgatgaagaagtttggctctattggattgacagcaacaaggaacctcatgg 
caaagctatcaggcaccttgctcaagaggcaaaagaaggcaccaaagctgacttagaagt 
tcttacctattacaggtaccagctaaacctctttgcaaggatgtgcttggatcgccagta 
tctggccataaaccagatttctacacagctgtctgtagacctgatcctgcggtgtgtgtc 
ggatgagagcctgccgttcgacctccgagcgtccttctgtcgcctcatgctccacatgca 
cgttgaccgggatccccaggagtccgtggtgcctgttcgctatgccaggctctggacaga 
aatccccacaaagatcacaattcatgaatatgattctataacagactcttccagaaatga 
tatgaagaggaaatttgccctgacaatggaatttgttgaagaatatttgaaagaagttgt 
aaaccagccctttccttttggggataaagaaaaaaataaactgacatttgaggtggtcca 
cttggctcggaatcttatatactttggattttatagtttcagtgagttattaaggctaac 
aagaacacttctggctattttagacattgtacaggcccccatgtcatcatactttgaaag 
attaagcaaatttcaagatggaggaaacaatgtgatgagaaccattcatggggtgggaga 
gatgatgacccagatggtactcagtagaggctccatcttccccatgagcgtgccggatgt 
gccacccagcatccacccgagcaagcaagggagccccaccgagcatgaggatgtgactgt 
gatggacaccaagctgaagatcattgagattttgcagtttatcctgagtgtcagactgga 
ttataggatctcatatatgctgtcaatatataagaaggagtttggagaggacaatgacaa 
tgcggagacatctgccagtggatctccagacactttactaccatcagctattgttcctga 
tatagatgaaattgcagctcaggcagaaactatgtttgcgggaaggtatagtaacattta 
tatatctttatcttacttgtgaataacattttacttagaattctattgatactttgatat 
tgttcactctatatctatttatttttagaccctctattgttgctaacataatttctgcca 
gaaatgaagctgctaatttactctggaggagaaagcatagcttttctttaggtcaggtaa 
aaaaaaggacaagagagatgagatcagtgtattgaactttcctctgaattctttgctttt 
ctttcctcaaataatgacagcccaagaatgccctttaactcccctggtataaatatcact 
gagtcttgtaaaacattgcttccctgaatcttagatatttgaggcaagaaagtaacttct 
ttaatgttatcattaactttatatgtaagaaataaagccattttctgccaattggaaata 
cccatgattaaataaaacaagacacctttgacatgactagcctcctttgccatcaggaat 
cctgccgtttattgttttaatattgtttgttcttcagaaatgaagttaatagtgatcaaa 
aagaaaaattaaaatgtttaagctac 

SEQ ID NO: 436. >Z38709JT14 # TY Transcript # LN 1364 # Source Gene: Z38709 
ft Encoded protein: Z38709_P12 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 
cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatgatgcaagcttctggatttaatttggttaata 
aattatctggtgatatcttcctttggtaaggtttggtagaagcatggcatgcaggaatgt 
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gatgcactttaagtcttgtgttggttggatttcttaaaaaaaaa 

SEQ ID NO: 437. >Z38709_T15 # TY Transcript « LN 2160 # Source Gene: Z38709 

# Encoded protein: Z38709_P9 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 
cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatattgttgtaggagataaagttgttttgatgcc 
tgtgaatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtg 
taaagaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaata 
tagttcctatcgagaggatgtattaaaaggaggggacgttgttagattatttcatgcgga 
acaagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtac 
gaccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggt 
ggttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaa 
gcatcttgcaactggaaactatttagctgcagaggcttatcaatcacaataatagatttg 
tcctgaaatgccagtcgttggaaagagaaacatataaatataatcatttaaaaatacaca 
catgatgtttatgatttcagtgttatatataataaaaattgatcaaaacacatttcattg 
taaaaaaaagtaaagcttctcttttaagaaaataagtaggtattatggacttaaaaactt 
acggtctgcttcttgcttttgtgttagatggaggggtgcagttgttttttaattgattgc 
ggtacaagcatgactttcactgcccaggtatttatgatgtttctctctctctatcatttt 
gcaccatacctgacttctttatattcttctttggccagttaattgaagactgtcttcaaa 
gattagtaatgggtctctgtattctctctctgttactcactgccatggctgcagtggacc 
ctgcctttgatcttggatatgccattgccatacaaaatcaataaatttatctaaaattta 

SEQ ID NO:438. >Z38709_T16 # TY Transcript # LN 1977 # Source Gene: Z38709 

# Encoded protein: Z38709_P10 

gcggccgcacgcgcccgctcccccgccggggtcccccgccgcccctccagcgccgcccag 
cgtcgcgtccccgcgcgggccggtgcggcggcagcagaggcttcccgggtggcccggcct 
cccgagggtcgccccgccccggcccagctccgcccccggccgctgctccctgattggccg 
cccccccacctctccagggctcctctcctcctcctcgcccaatgggcaccggcacctggt 
ccaaaaattcccctgaggggaccgagggggctcggccacttccagctcccgccggcggct 
gtgacggccgcagcgggtcgcagagcagggagtggacacgagagtggggagcggagaggg 
aaggaggaggaggaggagcaaaggtgttggagagaaaacttcagaaaggaacaggaaacc 
tgccggggagcgcggcggcggctgcggcttctctgggcgcctgggctgcgctcttcgcgg 
ggtgtcccgagctggcgcttggccccggtctggcttccccggcgcgcaccacggctgagc 
gcgacgctcagtggctcgggccgtgcctccgccgtggctcttggccgctgtagtcccgcg 
atccgatccgcttctgccgcggcggctccctggagagagggcggcgagggcgcagggaag 
aagagggacgtccactgtggaacatgaagcagcatgaagcagcatgactgagaaaatgtc 
cagcttcctctacataggggacatcgtgtccctgtacgcggagggctcggtcaacggctt 
catcagcaccttggggttagtggatgacagatgtgtggtgcacccagaggccggggacct 
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tgccaaccctcccaagaagttcagagactgccttttcaaggtgtgccctatgaacagata 
ttctgcccagaagcaatattggaaagcaaagcaagccaaacaagggaaccacaccgaggc 
agccttgctgaagaaactacagcacgctgcagaactggaacaaaaacaaaatgaatcgga 
gaataagaaactgttgggagaaattgtaaaatacagtaatgttatacaactactgcatat 
aaaaagcaacaaatatcttactgtcaacaagagattacctgctttactggagaagaatgc 
catgcgtgtgtccttggatgctgcaggaaatgaagggtcttggttttatattcatccatt 
ctggaaactgagaagcgagggtgacaatattgttgtaggagataaagttgttttgatgcc 
tgtgaatgcagggcagccactacatgccagcaacatagagcttcttgataacccagggtg 
taaagaggtgaatgctgtcaattgcaacaccagctggaaaatcactttattcatgaaata 
tagttcctatcgagaggatgtattaaaaggaggggacgttgttagattatttcatgcgga 
acaagagaagtttttgacttgtgatgaatatgagaaaaaacagcacattttccttcgtac 
gaccttgcgccaatcagctacttctgctactagttctaaagcactctgggaaatagaggt 
ggttcatcatgacccatgccgtgggggtgcaggacagtggaacagcttgttcagatttaa 
gcatcttgcaactggaaactatttagctgcagagcttaatcctgattatcgagatgccca 
aaatgaaggaaaaaatgtgagagatggagtccctccaacttcaaagaaaaaacgccaggc 
aggggagaagatcatgtatactttggtttcagtcccgcatggcaatgacattgcatccct 
ttttgaactagatgccacaactcttcagagagctgactgcctggttccaaggaactcata 
tgttcggttaaggcatttatgcaccaacacatgggtaaccagtactagtatccccataga 
cacagatgaagagaggcctgttatgttaaaggtaaatgtcactgaagatggtttaat 

SEQ ID NO: 439. >Z38709_T17 # TY Transcript # LN 416 # Source Gene: Z38709 
# Encoded protein: 

ttagcacatggacgatgctcaggaaatgctgctagacactgagcgctgaataagtttcat 
aaatacctgcaagtattgcatcaggggatcaatagttaagatttaaaaaaattaacatga 
attcgaagcaatataaaagtataattattctgagagagttttttcagttgaaactttgat 
tacattttaaatgtgttttggatatttcttgtcttaggaatgtctttatcgtgaatctaa 
tttttccagtgaattttgaaatacaaattcaattccacaggatgcaagcttctggattta 
atttggttaataaattatctggtgatatcttcctttggtaaggtttggtagaagcatggc 
atgcaggaatgtgatgcactttaagtcttgtgttggttggatttcttaaaaaaaaa 

SEQ ID NO: 440. >Z38709_P11 # TY Protein # CC f LN 2701 # Source Gene: 

Z38709 # Encoding Transcript: 1 

MTEKMSSFLYIGDIVSLYAEGSVNGFISTLGLVDDRCWHPEAGDLANPPKKFRDCLFKV 
CPMNRYSAQKQYWKAKQAKQGNHTEAALLKKLQHAAELEQKQNESENKKLLGEIVKYSNV 
IQLLHIKSNKYLTVNKRLPALLEKNAMRVSLDAAGNEGSWFYIHPFWKLRSEGDNIWGD 
PCVVLMPVNAGQPLHASNIELLDNPGCKEVNAVNCNTSWKITLFMKYSSYREDVLKGGDW 
RLFHAEQEKFLTCDEYEKKQHIFLRTTLRQSATSATSSKALWEIEWHHDPCRGGAGQWN 
SLFRFKHLATGNYLAAELNPDYRDAQNEGKNVRDGVPPTSKKKRQAGEKIMYTLVSVPHG 
NDIASLFELDATTLQRADCLVPRNSYVRLRHLCTNTWVTSTSIPIDTDEERPVMLKIGTC 
QTKEDKEAFAIVSVPLSEVRDLDFANDANKVLATTVKKLENGTITQNERRFVTKLLEDLI 
FFVADVPNNGQEVLDWITKPNRERQKLMREQNILAQVFGILKAPFKEKAGEGSMLRLED 
LGDQRYAPYKYMLRLCYRVLRHSQQDYRKNQEYIAKNFCVMQSQIGYDILAEDTITPLLH 
NNRKLLEKHITAKEIETFVSLLRRNREPRFLDYLSDLCVSNTTAIPVTQELICKFMLSPG 
NADILIQTKWSMQADNPMESSILSDDIDDEEVWLYWIDSNKEPHGKAIRHLAQEAKEGT 
KADLEVLTYYRYQLNLFARMCLDRQYLAINQISTQLSVDLILRCVSDESLPFDLRASFCR 
LMLHMHVDRDPQESVVPVRYARLWTEIPTKITIHEYDSITDSSRNDMKRKFALTMEFVEE 
YLKEWNQPFPFGDKEKNKLTFEWHLARNLIYFGFYS FSELLRLTRTLLAILDIVQAPM 
SSYFERLSKFQDGGNNVMRTIHGVGEMMTQMVLSRGSIFPMSVPDVPPSIHPSKQGSPTE 
HEDVTVMDTKLKIIEILQFILSVRLDYRISYMLSIYKKEFGEDNDNAETSASGSPDTLLP 
SAIVPDIDEIAAQAETMFAGRKEKNPVQLDDEGGRTFLRVLIHLIMHDYAPLLSGALQLL 
FKHFSQRAEVLQAFKQVQLLVSNQDVDNYKQIECADLDQLRLTVEKSELWVEKSSNYENGE 
IGESQVKGGEEPIEESNILSPVQDGTKKPQIDSNKSNKYRIVKEILIRLSKLCVQNKKCR 
NQHQRLLKNMGAHSWLDLLQIPYEKNDEKMNEVMNLAHTFLQNFCRGNPQNQVLLHKHL 
NLFLTPGLLEAETMRHIFMNNYHLCNEISERWQHFVHCIETHGRHVEYLRFLQTIVKAD 
GKYVKKCQDMVMTELINGGEDVLIFYNDRASFPILLHMMCSERDRGDESGPLAYHITLVE 
LLAACTEGKNVYTEIKCNSLLPLDDIVRWTHDDCIPEVKIAYVNFVNHCYVDTEVEMKE 
IYTSNHIWKLFENFLVDMARVCNTTTDRKHADIFLEKCVTESIMNIVSGFFNSPFSDNST 
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SLQTHQPVFIQLLQSAFRIYNCTWPNPAQKASVESCIRTLAEVAKNRGIAIPVDLDSQVN 
TLFMKSHSNMVQRAAMGWRLSARSGPRFKEALGGPAWDYRNIIEKLQDVVASLEHQFSPM 
MQAEFSVLVDVLYSPELLFPEGSDARIRCGAFMSKLINHTKKLMEKEEKLCIKILQTLRE 
MLEKKDSFVEEGNTLRKILLNRYFKGDYSIGVNGHLSGAYSKTAQVGGSFSGQDSDKMGI 
SMSDIQCLLDKEGASELVIDVIVNTKNDRIFSEGIFLGIALLEGGNTQTQYSFYQQLHEQ 
KKSEKFFKVLYDRMKAAQKEIRSTVTVNTIDLGNKKRDDDNELMTSGPRMRVRDSTLHLK 
EGMKGQLTEASSATSKAYCVYRREMDPEIDIMCTGPEAGNTEEKSAEEVTMSPAIAIMQP 
ILRFLQLLCENHNRELQNFLRNQNNKTNYNLVCETLQFLDCICGSTTGGLGLLGLYINEK 
NVALVNQNLESLTEYCQGPCHENQTCIATHESNGIDIIIALILNDINPLGKYRMDLVLQL 
ECNNASKLLLAIMESRHDSENAERILFNMRPRELVDVMKNAYNQGLECDHGDDEGGDDGVS 
PKDVGHNIYILAHQLARHNKLLQQMLKPGSDPDEGDEALKYYANHTAQIEIVRHDRTMEQ 
IVFPVPNICEYLTRESKCRVFNTTERDEQGSKVNDFFQQTEDLYNEMKWQKKIRNNPALF 
WFSRHISLWGSISFNLAVFINLAVALFYPFGDDGDEGTLSPLFSVLLWIAVAICTSMLFF 
FS KPVG I RPFLVS I MLRS I Y T I GLGPTL I LLGAANLCNKI VFLVS FVGNRGT FTRG YRAV 
ILDMAFLYHVAYVLVCMLGLFVHEFFYSFLLFDLVYREETLLNVIKSVTRNGRSIILTAV 
LALILVYLFSIIGFLFLKDDFTMEVDRLKNRTPVTGSHQVPTMTLTTMMEACAKENCSPT 
IPASNTADEEYEDGIERTCDTLLMCIVTVLNQGLRNGGGVGDVLRRPSKDEPLFAARWY 
DLLFYFI VI I I VLNLI FGVI IDTFADLRSEKQKKEEI LKTTCFICGLERDKFDNKTVSFE 
EHIKSEHNMWHYLYFIVLVKVKDPTEYTGPESYVAQMIVEKNLDWFPRMRAMSLVSNEGD 

SEQNEIRSLQEKLESTMSLVKQLSGQLAELKEQMTEQRKNKQRLGFLGSNTPHVNHHMPP 
H 

SEQ ID NO:441. >Z38709_P2 # TY Protein # CC #LN 2740 # Source Gene: 

Z38709 # Encoding Transcript: 8 

MTEKMSSFLYIGDIVSLYAEGSVNGFISTLGLVDDRCWHPEAGDLANPPKKFRDCLFKV 
C PMNRYSAQKQYWKAKQAKQGN HT EAALLKKLQHAAELEQKQNES ENKKLLGE I VK YSN V 
IQLLHIKSNKYLTVNKRLPALLEKNAMRVSLDAAGNEGSWFYIHPFWKLRSEGDNIWGD 
KWLMPVNAGQPLHASNIELLDNPGCKEVNAVNCNTSWKITLFMKYSSYREDVLKGGDW 
RLFHAEQEKFLTCDEYEKKQHIFLRTTLRQSATSATSSKALWEIEWHHDPCRGGAGQWN 
SLFRFKHLATGNYLAAELNPDYRDAQNEGKNVRDGVPPTSKKKRQAGEKIMYTLVSVPHG 
NDIASLFELDATTLQRADCLVPRNSYVRLRHLCTNTWVTSTSI PI DTDEERPVMLKIGTC 
QTKEDKEAFAIVSVPLSEVRDLDFANDANKVLATTVKKLENGTITQNERRFVTKLLEDLI 
FFVADVPNNGQEVLDWITKPNRERQKLMREQNILAQVFGILKAPFKEKAGEGSMLRLED 
LGDQRYAPYKYMLRLCYRVLRHSQQDYRKNQEYIAKNFCVMQSQIGYDILAEDTITALLH 
NNRKLLEKHITAKEIETFVSLLRRNREPRFLDYLSDLCVSNTTAIPVTQELICKFMLSPG 
NADILIQTKWSMQADNPMESSILSDDIDDEEVWLYWIDSNKEPHGKAIRHLAQEAKEGT 
KADLEVLTYYRYQLNLFARMCLDRQYLAINQISTQLSVDLILRCVSDESLPFDLRASFCR 
LMLHMHVDRDPQESWPVRYARLWTEIPTKITIHEYDSITDSSRNDMKRKFALTMEFVEE 
YLKEVVNQPFPFGDKEKNKLTFEWHLARNLIYFGFYSFSELLRLTRTLLAILDIVQAPM 
SSYFERLSKFQDGGNNVMRTIHGVGEMMTQMVLSRGSIFPMSVPDVPPSIHPSKQGSPTE 
HEDVTVMDTKLKIIEILQFILSVRLDYRISYMLSIYKKEFGEDNDNAETSASGSPDTLLP 
SAIVPDIDEIAAQAETMFAGRKEKNPVQLDDEGGRTFLRVLIHLIMHDYPPLLSGALQLL 
FKHFSQRAEVLQAFKQVQLLVSNQDVDNYKQIKADLDQLRLTVEKSELWVEKSSNYENGE 
IGESQVKGGEEPIEESNILSPVQDGTKKPQIDSNKSNNYRIVKEILIRLSKLCVQNKKCR 
NQHQRLLKNMGAHSWLDLLQIPYEKNDEKMNEVMNLAHTFLQNFCRGNPQNQVLLHKHL 
NLFLTPGLLEAETMRHIFMNNYHLCNEISERWQHFVHCIETHGRHVEYLRFLQTIVKAD 
GKYVKKCQDMVMTELINGGEDVLIFYNDRASFPILLHMMCSERDRGDESGPLAYHITLVE 
LLAACTEGKNVYTEIKCNSLLPLDDIVRWTHDDCIPEVKIAYVNFVNHCYVDTEVEMKE 
IYTSNHIWKLFENFLVDMARVCNTTTDRKHADIFLEKCVTESIMNIVSGFFNSPFSDNST 
SLQTHQPVFIQLLQSAFRIYNCTWPNPAQKASVESCIRTLAEVAKNRGIAIPVDLDSQVN 
TLFMKSHSNMVQRAAMGWRLSARSGPRFKEALGGPAWDYRNIIEKLQDWASLEHQFSPM 
MQAEFSVLVDVLYSPELLFPEGSDARIRCGAFMSKLINHTKKLMEKEEKLCIKILQTLRE 
MLEKKDSFVEEGNTLRKILLNRYFKGDYSIGVNGHLSGAYSKTAQVGGSFSGQDSDKMGI 
SMS DIQCLLDKEGASELV I DVI VNTKNDRI FSEGI FLGI ALLEGGNTQTQ YS FYQQLHEQ 
KKSEKFFKVLYDRMKAAQKEIRSTVTVNTIDLGNKKRDDDNELMTSGPRMREFWSWGDPE 
NGSAGLNLNENNQCKATNGMKDVGFHCGMHVRDSTLHLKEGMKGQLTEASSATSKAYCVY 
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RREMDPEI DIMCTGPEAGNTEEKSAEEVTMS PAI AIMQPILRFLQLLCENHNRELQNFLR 

NQNNKTNYNLVCETLQFLDCICGSTTGGLGLLGLYINEKNVALVNQNLESLTEYCQGPCH 

ENQTCIATHESNGIDI I IALI LNDINPLGKYRMDLVLQLKNNASKLLLAIMESRHDSENA 

ERILFNMRPRELVDVMKNAYNQGLECDHGDDEGGDDGVSPKDVGHNIYILAHQLARHNKL 

LQQMLKPGSDPDEGDEALKYYANHTAQIEIVRHDRTMEQIVFPVPNICEYLTRESKCRVF 

NTTERDEQGSKVNDFFQQTEDLYNEMKWQKKIRNNPALFWFSRHISLWGSISFNLAVFIN 

LAVALFYPFGDDGDEGTLSPLFSVLLWIAVAICTSMLFFFSKPVGIRPFLVSIMLRSIYT 

IGLGPTLI LLGAANLCNKI VFLVS FVGNRGT FTRG YRAV I LDMAFL YH VAYVLVCMLGLF 

VHEFFYSFLLFDLVYREETLLNVIKSVTRNGRSIILTAVLALILVYLFSIIGFLFLKDDF 

TMEVDRLKNRTPVTGSHQVPTMTLTTMMEACAKENCSPTIPASNTADEEYEDGIERTCDT 

LLMCI VTVLNQGLRNGGGVGDVLRRPSKDEPLFAARVVYDLLFYFI VI I I VLNLI FGVI I 

DTFADLRSEKQKKEEILKTTCFICGLERDKFDNKTVS FEEHIKSEHNMWHYLYFIVLVKV 

KDPTEYTGPESYVAQMIVEKNLDWFPRMRAMSLVSNEGD 

SEQNEIRSLQEKLESTMSLVK 

QLSGQLAELKEQMTEQRKNKQRLGFLGSNTPHVNHHMPPH 

SEQ ID NO: 442. >Z38709_P3 # TY Protein # CC #LN 2351 # Source Gene: 

Z38709 # Encoding Transcript: 9 

MYTLVSVPHGNDIASLFELDATTLQRADCLVPRNSYVRLRHLCTNTWVTSTSIPIDTDEE 
RPVMLKIGTCQTKEDKEAFAIVSVPLSEVRDLDFANDANECVLATTVKKLENGTITQNERR 
FVTKLLEDLIFFVADVPNNGQEVLDVVITKPNRERQKLMREQNILAQVFGILECAPFKEKA 
GEGSMLRLEDLGDQRYAPYKYMLRLCYRVLRHSQQDYRKNQEYIAKNFCVMQSQIGYDIL 
AEDTITALLHNNRKLLEKHITAKEIETFVSLLRRNREPRFLDYLSDLCVSNTTAIPVTQE 
LICKFMLSPGNADILIQTKWSMQADNPMESSILSDDIDDEEVWLYWIDSNKEPHGKAIR 
HLAQEAKEGTKADLEVLTYYRYQLNLFARMCLDRQYLAINQISTQLSVDLILRCVSDESL 
PFDLRASFCRLMLHMHVDRDPQESWPVRYARLWTEIPTKITIHEYDSITDSSRNDMKRK 
FALTMEFVEEYLKEWNQPFPFGDKEKNKLTFEVVHLARNLIYFGFYSFSELLRLTRTLL 
AILDIVQAPMSSYFERLSKFQDGGNNVMRTIHGVGEMMTQMVLSRGSIFPMSVPDVPPSI 
HPSKQGSPTEHEDVTVMDTKLKIIEILQFILSVRLDYRISYMLSIYKKEFGEDNDNAETS 
ASGSPDTLLPSAIVPDIDEIAAQAETMFAGRKEKNPVQLDDEGGRTFLRVLIHLIMHDYP 
PLLSGALQLLFKHFSQRAEVLQAFKQVQLLVSNQDVDNYKQIKADLDQLRLTVEKSELWV 
EKSSNYENGEIGESQVKGGEEPIEESNILSPVQDGTKKPQIDSNKSNNYRIVKEILIRLS 
KLCVQNKKCRNQHQRLLKNMGAHSWLDLLQIPYEKNDEKMNEVMNLAHTFLQNFCRGNP 
QNQVLLHKHLNLFLTPGLLEAETMRHIFMNNYHLCNEISERWQHFVHCIETHGRHVEYL 
RFLQTIVKADGKYVKKCQDMVMTELINGGEDVLIFYNDRASFPILLHMMCSERDRGDESG 
PLAYHITLVELLAACTEGKNVYTEIKCNSLLPLDDIVRVVTHDDCIPEVKIAYVNFVNHC 
YVDTEVEMKEIYTSNHIWKLFENFLVDMARVCNTTTDRKHADIFLEKCVTESIMNIVSGF 
FNSPFSDNSTSJjQTHQPVFIQLLQSAFRIYNCTWPNPAQKASVESCIRTLAEVAKNRGIA 
I PVDLDSQVNTLFMKSHSNMVQRAAMGWRLSARSGPRFKEALGGPAWDYRNI IEKLQDW 
ASLEHQFSPMMQAEFSVLVDVLYSPELLFPEGSDARIRCGAFMSKLINHTKKLMEKEEKL 
CIKILQTLREMLEKKDSFVEEGNTLRKILLNRYFKGDYSIGVNGHLSGAYSKTAQVGGSF 
SGQDSDKMGISMSDIQCLLDKEGASELVIDVIVNTKNDRIFSEGIFLGIALLEGGNTQTQ 
YSFYQQLHEQKKSEKFFKVLYDRMKAAQKEIRSTVTVNTIDLGNKKRDDDNELMTSGPRM 
RVRDSTLHLKEGMKGQLTEASSATSKAYCVYRREMDPEIDIMCTGPEAGNTEEKSAEEVT 
MSPAIAIMQPILRFLQLLCENHNRELQNFLRNQNNKTNYNLVCETLQFLDCICGSTTGGL 
GLLGLYINEKNVALVNQNLESLTEYCQGPCHENQTCIATHESNGIDI I IALILNDINPLG 
KYRMDLVLQLKNNASKLLLAIMESRHDSENAERILFNMRPRELVDVMKNAYNQGLECDHG 
DDEGGDDGVSPKDVGHNIYILAHQLARHNKLLQQMLKPGSDPDEGDEALKYYANHTAQIE 
IVRHDRTMEQIVFPVPNICEYLTRESKCRVFNTTERDEQGSKVNDFFQQTEDLYNEMKWQ 
KKIRNNPALFWFSRHISLWGSISFNLAVFINLAVALFYPFGDDGDEGTLSPLFSVLLWIA 
VAICTSMLFFFSKPVGIRPFLVSIMLRS I YTIGLGPTLI LLGAANLCNKI VFLVSFVGNR 
GTFTRGYRAVILDMAFLYHVAYVLVCMLGLFVHEFFYSFLLFDLVYREETLLNVIKSVTR 
NGRSIILTAVLALILVYLFSIIGFLFLKDDFTMEVDRLKNRTPVTGSHQVPTMTLTTMME 
ACAKENCSPTIPASNTADEEYEDGIERTCDTLLMCIVTVLNQGLRNGGGVGDVLRRPSKD 
EPLFAARWYDLLFYFIVIIIVLNLIFGVIIDTFADLRSEKQKKEEILKTTCFICGLERD 
KFDNKTVSFEEHIKSEHNMWHYLYFIVLVKVKDPTEYTGPESYVAQMIVEKNLDWFPRMR 
AMSLVSNEGD 
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SEQNEIRSLQEKLESTMSLVKQLSGQLAELKEQMTEQRKNKQRLGFLGSN 
TPHVNHHMPPH 

SEQ ID NO:443. >Z38709_P4 # TY Protein # CC #LN 2583 # Source Gene: 

Z38709 # Encoding Transcript: 10 

MSSFLYIGDIVSLYAEGSVNGFISTLGLVDDRCVVHPEAGDLANPPKKFRDCLFKVCPMN 
RYSAQKQYWKAKQAKQGNHTEAALLKKLQHAAELEQKQNESENKKLLGEIVKYSNVIQLL 
H IKSNKYLTVNKRLPALLEKNAMRVSLDAAGNEGSWFY I H PFWKLRS EGDN I WGDKWL 
MPVNAGQPLHASNIELLDNPGCKEVNAVNCNTSWKITLFMKYSSYREDVLKGGDVVRLFH 
AEQEKFLTCDEYEKKQHIFLRTTLRQSATSATSSKALWEIEWHHDPCRGGAGQWNSLFR 
FKHLATGNYLAAELNPDYRDAQNEGKNVRDGVPPTSKKKRQAGEKIMYTLVSVPHGNDIA 
SLFELDATTLQRADCLVPRNSYVRLRHLCTNTWVTSTSIPIDTDEERPVMLKIGTCQTKE 
DKEAFAIVSVPLSEVRDLDFANDANKVLATTVKKLENGTITQNERRFVTKLLEDLIFFVA 
DVPNNGQEVLDWITKPNRERQKLMREQNILAQVFGILKAPFKEKAGEGSMLRLEDLGDQ 
RYAPYKYMLRLCYRVLRHSQQDYRKNQEYIAKNFCVMQSQIGYDILAEDISGTLLKRQKK 
APKLTSEVLTYYRYQLNLFARMCLDRQYLAINQISTQLSVDLILRCVSDESLPFDLRASF 
CRLMLHMHVDRDPQESVVPVRYARLWTEIPTKITIHEYDSITDSSRNDMKRKFALTMEFV 
EEYLKEWNQPFPFGDKEKNKLTFEWHLARNLIYFGFYSFSELLRLTRTLLAILDIVQA 
PMSSYFERLSKFQDGGNNVMRTIHGVGEMMTQMVLSRGSIFPMSVPDVPPSIHPSKQGSP 
TEHEDVTVMDTKLKIIEILQFILSVRLDYRISYMLSIYECKEFGEDNDNAETSASGSPDTL 
LPSAIVPDIDEIAAQAETMFAGRKEKNPVQLDDEGGRTFLRVLIHLIMHDYPPLLSGALQ 
LLFKHFSQRAEVLQAFKQVQLLVSNQDVDNYKQIKADLDQLRLTVEKSELWVEKSSNYEN 
GEIGESQVKGGEEPIEESNILSPVQDGTKKPQIDSNKSNNYRIVKEILIRLSKLCVQNKK 
CRNQHQRLLKNMGAHSVVLDLLQIPYEKNDEKMNEVMNLAHTFLQNFCRGNPQNQVLLHK 
HLNLFLT PGLLEAETMRHIFMNNYHLCNEISERWQHFVHCIETHGRHVEYLRFLQTIVK 
ADGKYVKKCQDMVMTELINGGEDVLIFYNDRASFPILLHMMCSERDRGDESGPLAYHITL 
VELLAACTEGKNVYTEIKCNSLLPLDDIVRVVTHDDCIPEVKIAYVNFVNHCYVDTEVEM 
KEIYTSNHIWKLFENFLVDMARVCNTTTDRKHADIFLEKCVTESIMNIVSGFFNSPFSDN 
STSLQTHQPVFIQLLQSAFRIYNCTWPNPAQKASVESCIRTLAEVAKNRGIAIPVDLDSQ 
VNTLFMKSHSNMVQRAAMGWRLSARSGPRFKEALGGPAWDYRNIIEKLQDWASLEHQFS 
PMMQAEFSVLVDVLYSPELLFPEGSDARIRCGAFMSKLINHTKKLMEKEEKLCIKILQTL 
REMLEKKDSFVEEGNTLRKILLNRYFKGDYSIGVNGHLSGAYSKTAQVGGSFSGQDSDKM 
GISMSDIQCLLDKEGASELVIDVIVNTKNDRIFSEGIFLGIALLEGGNTQTQYSFYQQLH 
EQKKSEKFFECVLYDRMKAAQKEIRSTVTVNTIDLGNKKRDDDNELMTSGPRMRVRDSTLH 
LKEGMKGQLTEASSATSKAYCVYRREMDPEIDIMCTGPEAGNTEEKSAEEVTMSPAIAIM 
QPILRFLQLLCENHNRELQNFLRNQNNJCTNYNLVCETLQFLDCICGSTTGGLGLLGLYIN 
EKNVALVNQNLESLTEYCQGPCHENQTCIATHESNGIDIIIALILNDINPLGKYRMDLVL 
QLKNNASKLLLAIMESRHDSENAERILFNMRPRELVDVMKNAYNQGLECDHGDDEGGDDG 
VSPKDVGHNIYILAHQLARHNKLLQQMLKPGSDPDEGDEALKYYANHTAQIEIVRHDRTM 
EQIVFPVPNICEYLTRESKCRVFNTTERDEQGSKVNDFFQQTEDLYNEMKWQKKIRNNPA 
LFWFSRHISLWGSISFNLAVFINLAVALFYPFGDDGDEGTLSPLFSVLLWIAVAICTSML 
FFFSKPVGI RP FLVS IMLRSI YT I GLG PTLI LLGAANLCNKI VFLVS FVGNRGTFTRGYR 
AVILDMAFLYHVAYVLVCMLGLFVHEFFYSFLLFDLVYREETLLNVIKSVTRNGRSIILT 
AVLALILVYLFSIIGFLFLKDDFTMEVDRLKNRTPVTGSHQVPTMTLTTMMEACAKENCS 
PTIPASNTADEEYEDGIERTCDTLLMCIVTVLNQGLRNGGGVGDVLRRPSKDEPLFAARV 
VYDLLFYFIVIIIVLNLIFGVIIDTFADLRSEKQKKEEILKTTCFICGLERDKFDNKTVS 
FEEHIKSEHNMWHYLYFIVLVKVKDPTEYTGPESYVAQMIVEKNLDWFPRMRAMSLVSNE 
GD 

S EQNE I RSLQEKLESTMSLVKQLSGQLAELKEQMTEQRKNKQRLGFLGSNT PHVNHHM 
PPH 

SEQ ID NO: 44 4 . >Z38709_P5 # TY Protein # CC #LN 1694 # Source Gene: 

Z38709 # Encoding Transcript: 11 

MTEKMSSFLYIGDIVSLYAEGSVNGFISTLGLVDDRCWHPEAGDLANPPKKFRDCLFKV 
CPMNRYSAQKQYWKAKQAKQGNHTEAALLKKLQHAAELEQKQNESENKKLLGEIVKYSNV 
IQLLHIKSNKYLTVNKRLPALLEKNAMRVSLDAAGNEGSWFYIHPFWKLRSEGDNIVVGD 
KWLMPVNAGQPLHASNIELLDNPGCKEVNAVNCNTSWKITLFMKYSSYREDVLKGGDW 
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RLFHAEQEKFLTCDEYEKKQHIFLRTTLRQSATSATSSKALWEIEWHHDPCRGGAGQWN 
SLFRFKHLATGNYLAAELNPDYRDAQNEGKNVRDGVPPTSKKKRQAGEKIMYTLVSVPHG 
NDIASLFELDATTLQRADCLVPRNSYVRLRHLCTNTWVTSTSIPIDTDEERPVMLKIGTC 
QTKEDKEAFAIVSVPLSEVRDLDFANDANKVLATTVKKLENGTITQNERRFVTKLLEDLI 
FFVADVPNNGQEVLDVVITKPNRERQKLMREQNILAQVFGILKAPFKEKAGEGSMLRLED 
LGDQRYAP YKYMLRLC YRVLRHSQQD YRKNQE Y I AKN FCVMQSQI GYDI LAEDT I TALLH 
NNRKLLEKHITAKEIETFVSLLRRNREPRFLDYLSDLCVSNTTAIPVTQELICKFMLSPG 
NADILIQTBCWSMQADNPMESSILSDDIDDEEVWLYWIDSNKEPHGKAIRHLAQEAKEGT 
KADLEVLTYYRYQLNLFARMCLDRQYLAINQISTQLSVDLILRCVSDESLPFDLRASFCR 
LMLHMHVDRDPQESVVPVRYARLWTEIPTKITIHEYDSITDSSRNDMKRKFALTMEFVEE 
YLKEWNQPFPFGDKEKNKLTFEVVHLARNLIYFGFYSFSELLRLTRTLLAILDIVQAPM 
SSYFERLSKFQDGGNNVMRTIHGVGEMMTQMVLSRGSIFPMSVPDVPPSIHPSKQGSPTE 
HEDVTVMDTKLKIIEILQFILSVRLDYRISYMLSIYKKEFGEDNDNAETSASGSPDTLLP 
SAIVPDIDEIAAQAETMFAGRKEKNPVQLDDEGGRTFLRVLIHLIMHDYPPLLSGALQLL 
FKHFSQRAEVLQAFKQVQLLVSNQDVDNYKQIKADLDQLRLTVEKSELWVEKSSNYENGE 
IGESQVKGGEEPIEESNILSPVQDGTKKPQIDSNKSNNYRIVKEILIRLSKLCVQNKKCR 
NQHQRLLKNMGAHSWLDLLQIPYEKNDEKMNEVMNLAHTFLQNFCRGNPQNQVLLHKHL 
NLFLTPGLLEAETMRHIFMNNYHLCNEISERWQHFVHCIETHGRHVEYLRFLQTIVKAD 
GKYVKKCQDMVMTELINGGEDVLIFYNDRASFPILLHMMCSERDRGDESGPLAYHITLVE 
LLAACTEGKNVYTEIKCNSLLPLDDIVRWTHDDCIPEVKIAYVNFVNHCYVDTEVEMKE 
IYTSNHIWKLFENFLVDMARVCNTTTDRKHADIFLEKCVTESIMNIVSGFFNSPFSDNST 
SLQTHQPVFIQLLQSAFRIYNCTWPNPAQKASVESCIRTLAEVAKNRGIAIPVDLDSQVN 
TLFMKS HS NMVQRAAMGWRLS ARSG PR FKEALGGP AWD YRN 1 1 EKLQD WAS LEHQ FS PM 
MQAEFSVLVDVLYSPELLFPEGSDARIRCGAFMSKLINHTKKLMEKEEKLCIKILQTLRE 
MLEKKDSFVEEVFF 

SEQ ID NO: 445. >Z38709_P6 # TY Protein # CC #LN 1826 # Source Gene: 

Z38709 # Encoding Transcript: 12 

MTEKMSSFLYIGDIVSLYAEGSVNGFISTLGLVDDRCVVHPEAGDLANPPKKFRDCLFKV 
CPMNRYSAQKQYWKAKQAKQGNHTEAALLKKLQHAAELEQKQNESENKKLLGEIVKYSNV 
IQLLHIKSNKYLTVNKRLPALLEKNAMRVSLDAAGNEGSWFYIHPFWKLRSEGDNIVVGD 
ICWLMPVNAGQPLHASNIELLDNPGCKEVNAVNCNTSWKITLFMKYSSYREDVLKGGDVV 
RLFHAEQEKFLTCDEYEKKQHIFLRTTLRQSATSATSSKALWEIEWHHDPCRGGAGQWN 
SLFRFKHLATGNYLAAELNPDYRDAQNEGKNVRDGVPPTSKKKRQAGEKIMYTLVSVPHG 
NDIASLFELDATTLQRADCLVPRNSYVRLRHLCTNTWVTSTSIPIDTDEERPVMLKIGTC 
QTKEDKEAFAIVSVPLSEVRDLDFANDANKVLATTVKKLENGTITQNERRFVTKLLEDLI 
FFVADVPNNGQEVLDWITKPNRERQKLMREQNILAQVFGILKAPFKEKAGEGSMLRLED 
LGDQRYAPYKYMLRLCYRVLRHSQQDYRKNQEYIAKNFCVMQSQIGYDILAEDTITALLH 
NNRKLLEKHITAKEIETFVSLLRRNREPRFLDYLSDLCVSNTTAIPVTQELICKFMLSPG 
NADILIQTKVVSMQADNPMESSILSDDIDDEEVWLYWIDSNKEPHGKAIRHLAQEAKEGT 
ECADLEVLTYYRYQLNLFARMCLDRQYLAINQISTQLSVDLILRCVSDESLPFDLRASFCR 
LMLHMHVDRDPQESWPVRYARLWTEIPTKITTHEYDSITDSSRNDMKRKFALTMEFVEE 
YLKEWNQPFPFGDKEKNKLTFEWHLARNLIYFGFYSFSELLRLTRTLLAILDIVQAPM 
SSYFERLSKFQDGGNNVMRTIHGVGEMMTQMVLSRGSIFPMSVPDVPPSIHPSKQGSPTE 
HEDVTVMDTKLKIIEILQFILSVRLDYRISYMLSIYKKEFGEDNDNAETSASGSPDTLLP 
SAIVPDIDEIAAQAETMFAGRKEKNPVQLDDEGGRTFLRVLIHLIMHDYPPLLSGALQLL 
FKHFSQRAEVLQAFKQVQLLVSNQDVDNYKQIKADLDQLRLTVEKSELWVEKSSNYENGE 
IGESQVKGGEEPIEESNILSPVQDGTKKPQIDSNKSNNYRIVKEILIRLSKLCVQNKKCR 
NQHQRLLKNMGAHSWLDLLQIPYEKNDEKMNEVMNLAHTFLQNFCRGNPQNQVLLHKHL 
NL FLT PGLLEAETMRH I FMNN YHLCNEI SERWQH FVHCI ETHGRH VE YLRFLQT I VKAD 
GKYVKKCQDMVMTELINGGEDVLIFYNDRASFPILLHMMCSERDRGDESGPLAYHITLVE 
LLAACTEGKNVYTEIKCNSLLPLDDIVRWTHDDCIPEVKIAYVNFVNHCYVDTEVEMKE 
IYTSNHIWKLFENFLVDMARVCNTTTDRKHADIFLEKCVTESIMNIVSGFFNSPFSDNST 
SLQTHQPVFIQLLQSAFRIYNCTWPNPAQKASVESCIRTLAEVAKNRGIAIPVDLDSQVN 
TLFMKSHSNMVQRAAMGWRLSARSGPRFKEALGGPAWDYRNIIEKLQDWASLEHQFSPM 
MQAEFSVLVDVLYSPELLFPEGSDARIRCGAFMSKLINHTKKLMEKEEKLCIKILQTLRE 
MLEKKDSFVEEGNTLRKILLNRYFKGDYSIGVNGHLSGAYSKTAQVGGSFSGQDSDKMGI 
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SMSDIQCLLDKEGASELVIDVIVNTKNDRIFSEGIFLGIALLEGGNTQTQVLVHQSWGLH 
LLLVFLVYLSQKPRKSNVFLLALITI 

SEQ ID NO: 446. >238709_P7 # TY Protein # CC #LN 1052 # Source Gene: 

Z38709 # Encoding Transcript: 13 

MTEKMSSFLYIGDIVSLYAEGSVNGFISTLGLVDDRCWHPEAGDLANPPKKFRDCLFKV 
CPMNRYSAQKQYWKAKQAKQGNHTEAALLKKLQHAAELEQKQNESENKKLLGEIVKYSNV 
IQLLHIKSNKYLTVNKRLPALLEKNAMRVSLDAAGNEGSWFYIHPFWKLRSEGDNIWGD 
KVVLMPVNAGQPLHASNIELLDNPGCKEVNAVNCNTSWKITLFMKYSSYREDVLKGGDVV 
RLFHAEQEKFLTCDEYEKKQHIFLRTTLRQSATSATSSKALWEIEVVHHDPCRGGAGQWN 
SLFRFKHLATGNYLAAELNPDYRDAQNEGKNVRDGVPPTSKKKRQAGEKIMYTLVSVPHG 
NDIASLFELDATTLQRADCLVPRNSYVRLRHLCTNTWVTSTSIPIDTDEERPVMLKIGTC 
QTKEDKEAFAIVSVPLSEVRDLDFANDANKVLATTVKKLENGTITQNERRFVTKLLEDLI 
FFVADVPNNGQEVLDVVITKPNRERQKLMREQNILAQVFGILKAPFKEKAGEGSMLRLED 
LGDQRYAPYKYMLRLCYRVLRHSQQDYRKNQEYIAKNFCVMQSQIGYDILAEDTITALLH 
NNRKLLEKHITAKEIETFVSLLRRNREPRFLDYLSDLCVSNTTAIPVTQELICKFMLSPG 
NADILIQTKWSMQADNPMESSILSDDIDDEEVWLYWIDSNKEPHGKAIRHLAQEAKEGT 
KADLEVLTYYRYQLNLFARMCLDRQYLAINQISTQLSVDLILRCVSDESLPFDLRASFCR 
LMLHMHVDRDPQESVVPVRYARLWTEIPTKITIHEYDSITDSSRNDMKRKFALTMEFVEE 
YLKEWNQPFPFGDKEKNKLTFEWHLARNLIYFGFYSFSELLRLTRTLLAILDIVQAPM 
SSYFERLSKFQDGGNNVMRTIHGVGEMMTQMVLSRGSIFPMSVPDVPPSIHPSKQGSPTE 
HEDVTVMDTKLKIIEILQFILSVRLDYRISYMLSIYKKEFGEDNDNAETSASGSPDTLLP 
SAIVPDIDEIAAQAETMFAGRYSNIYISLSYL 

SEQ ID NO: 447. >Z38709_P12 # TY Protein # CC #LN 181 # Source Gene: 

Z38709 # Encoding Transcript: 14 

MTEKMSSFLYIGDIVSLYAEGSVNGFISTLGLVDDRCWHPEAGDLANPPKKFRDCLFKV 
CPMNRYSAQKQYWKAKQAKQGNHTEAALLKKLQHAiAELEQKQNESENKKLLGEIVKYSNV 
I QLLH I KSNKYLTVNKRLPALLEKNAMRVSLDAAGNEGSWFY IH PFWKLRSEGDNDAS FW 
I 

SEQ ID NO:448. >Z38709_P9 ft TY Protein # CC #LN 322 # Source Gene: Z38709 
# Encoding Transcript: 15 

MTEKMSSFLYIGDIVSLYAEGSVNGFISTLGLVDDRCWHPEAGDLANPPKKFRDCLFKV 
CPMNRYSAQKQYWKAKQAKQGNHTEAALLKKLQHAAELEQKQNESENKKLLGEIVKYSNV 
IQLLHIKSNKYLTVNKRLPALLEKNAMRVSLDAAGNEGSWFYIHPFWKLRSEGDNIVVGD 
KVVLMPVNAGQPLHASNI ELLDNPGCKEVNAVNCNTSWKI TLFMKYSS YREDVLKGGDW 
RLFHAEQEKFLTCDEYEKKQHIFLRTTLRQSATSATSSKALWEIEWHHDPCRGGAGQWN 
SLFRFKHLATGNYLAAEAYQSQ 

SEQ ID NO:449. >Z38709_P10 # TY Protein # CC #LN 424 # Source Gene: 

Z38709 # Encoding Transcript: 16 

MTEKMSSFLYIGDIVSLYAEGSVNGFISTLGLVDDRCVVHPEAGDLANPPKKFRDCLFKV 
C PMNR YS AQKQ YWKAKQAKQGNHTE AALLKKLQHAAELEQKQNE SENKKLLGE I VKYSN V 
IQLLHIKSNKYLTVNKRLPALLEKNAMRVSLDAAGNEGSWFYIHPFWKLRSEGDNIWGD 
KVVLMPVNAGQPLHASNI ELLDNPGCKEVNAVNCNTSWKI TLFMKYSS YREDVLKGGDW 
RLFHAEQEKFLTCDEYEKKQHI FLRTTLRQSATSATSSKALWEI EWHHDPCRGGAGQWN 
SLFRFKHLATGNYLAAELNPDYRDAQNEGKNVRDGVPPTSKKKRQAGEKIMYTLVSVPHG 
NDIASLFELDATTLQRADCLVPRNS YVRLRHLCTNTWVTSTS I PI DTDEERPVMLKVNVT 
EDGL 



SEQ ID NO:450. >Z39663 # TY Consensus # Length 4094 # Number of exons 16 
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ttcgaccaataagacaaagaaggaaagtacttcagcgcttgctgttttgtaatactggct 
ttgattgggagaaaataagtgggaaacaagagtagcttggacagggggcttggacggcca 
gcgcattgaccagaagcggcccacactacacgcgcgcgaacacacacacacacacacgca 
cacgccgcgcgcggtcagctagagtttggctactggacccaggaagggagggagcgggag 
gagctggtggctgtcgggtcggggtctgggcgccaccgagcgcccgccccacctctggag 
tgcgcagaaggcggtgcgtcccgggccccggcagcccagcccaggcgctctggccaagtt 
tgcgcgcggtctccgctgactctcgggttacctgagccggcaaccacgtcagcgccacat 
catctgggctttttatattgcaaggaaacaggaaaagaaggaagaaaaacatcgcccgag 
agagccaatcgcaggaagacggcgctgcccggaggagcggggcgggcgggcgcgcggggg 
agcgggcggcgggcgggagccaggcccgggcgggggcgggggcggcggggccagaagagg 
cggcgggccgcgctccggccggtctgcggcgttggccttggctttggctttggcggcggc 
ggtggagaagatgctgcagtccctggccggcagctcgtgcgtgcgcctggtggagcggca 
ccgctcggcctggtgcttcggcttcctggtgctgggctacttgctctacctggtcttcgg 
cgcagtggtcttctcctcggtggagctgccctatgaggacctgctgcgccaggagctgcg 
caagctgaagcgacgcttcttggaggagcacgagtgcctgtctgagcagcagctggagca 
gttcctgggccgggtgctggaggccagcaactacggcgtgtcggtgctcagcaacgcctc 
gggcaactggaactgggacttcacctccgcgctcttcttcgccagcaccgtgctctccac 
cacagacttgagtcagtagaaaatggggagcctctgtgggctcctggtcctgcaccgtga 
taaatagcctgtcctgcaacctggcttaaaacagggaaagccggagttaggcaacagagt 
gcctagctttctcagtcattcaaacacaagcacggaagaagagattgcactgaccagagg 
accgccagggtgtgatgctccgtagtcgcagggatgaaggaaaaccatgggcggagcttg 
tcatggacagctcctaggtggacataaatggaaactcattggttctggggcacggcggga 
agagggagacggccctggacggccgcaccaagaatttcacaacccaaacccgactagaaa 
gcctggtgactgagaagagctaaaccccagtagagccaagttgggaggtggtgaaaaaaa 
tccaggtgaatggggctaagaccccatacccaggaagcgagctggcatcagaagcagaag 
actgaggtcatttttatggatgggagatgtgctggtgggtttttagaagagctcatgtca 
gttggcatccatttcttcagtaaggtgagaaggtcctgagggttttatctcctgatttgg 
ttagaagattgttgaatctggctgtgagagcctttcaagtgcctaaaggagagagcttga 
acttccctgagagtacaccagagggactccttcctccttctggccttgtgagattagagt 
gagaagtctatgaggaagcagccctcaccagacactgaatctgctggtgccttgatcaga 
gacttcccagccactaaaactggacagtcttcattgatgaccttactatttaccacagaa 
gttcatctatgtggctccaaactctatttcaagaatatgaaaggcatgtaggttatgcct 
acgttctcggacccttgatagatgttttgctgacccaccttctgatatctgtagagttac 
cctaggctccaccttccaccaaccagttctgtgtcaaagtgcaaaatgaaaatcacagaa 
aaagtcaacctgacctgaagcttattcatcaccgtgcaatgagtttcaccatacagtggg 
tagcctcggatgagtttttcctgatggtctaagatgggtgtgggccttagcttcttagta 
cacagaacttgtgtttgagcctgatagtgaattcaagaccagagaccctgatgtgttgga 
attgctggttggattttacaaagaatggtggactttcaaacttatggtgaccactgttga 
attttactgggacatgacaattatactagagcccctaaagtggcagacctgaagtctcgc 
tgttctctttggtttgaggggaataagggcagatgataggcatatattgtttaaaaacgt 
gtttgccactttgtctcttcctcttcgcctcagtgaccttgttctccttgcagggttatg 
gccacaccgtgcccttgtcagatggaggtaaggccttctgcatcatctactccgtcattg 
gcattcccttcaccctcctgttcctgacggctgtggtccagcgcatcaccgtgcacgtca 
cccgcaggccggtcctctacttccacatccgctggggcttctccaagcaggtggtggcca 
tcgtccatgccgtgctccttgggtttgtcactgtgtcctgcttcttcttcatcccggccg 
ctgtcttctcagtcctggaggatgactggaacttcctggaatccttttatttttgtttta 
tttccctgagcaccattggcctgggggattatgtgcctggggaaggctacaatcaaaaat 
tcagagagctctataagattgggatcacgtgttacctgctacttggccttattgccatgt 
tggtagttctggaaaccttctgtgaactccatgagctgaaaaaattcagaaaaatgttct 
atgtgaagaaggacaaggacgaggatcaggtgcacatcatagagcatgaccaactgtcct 
tctcctcgatcacagaccaggcagctggcatgaaagaggaccagaagcaaaatgagcctt 
ttgtggccacccagtcatctgcctgcgtggatggccctgcaaaccattgagcgtaggatt 
tgttgcattatgctagagcaccagggtcagggtgcaaggaagaggcttaagtatgttcat 
ttttatcagaatgcaaaagcgaaaattatgtcactttaagaaatagctactgtttgcaat 
gtcttattaaaaaacaacaaaaaaagacaaatggaacaaagaagctgtgaccccagcagg 
atgtctaatatgtgaggaaatgagatgtccacctaaaattcatatgtgacaaaattatct 
cgaccttacataggaggagaatacttgaagcagtatgctgctgtggttagaagcagattt 
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tatacttttaactggaaactttggggtttgcatttagatcatttagctgatggctaaata 
gcaaaatttatatttagaagcaaaaaaaaaaagcatagagatgtgttttataaataggtt 
tatgtgtactggtttgcatgtacccacccaaaatgattatttttgtagaatctaagttaa 
acttactatttataatgcataggtaaccattaactatgtacatataaagtataaatatgt 
ttatattctgtacatatggtttaggtcaccagatcctagtgtagttctgaaactaagact 
atagatattttgtttcttttgatttctctttatactaaagaatccagagttgctacaata 
aaataaggggaataataaacttgagagtgaataaccatagtattctgctgcaataaatgt 
ttctaccctcttgcgatagaatggtgtcagatgtctaacatgtttataaacatttttttg 
aattgaacaggttatgaaaaggatttattaaaaggttaagatactttgttttgaaagtac 
attgtgatttgcagccacctagcactaaagcatagaacttaaacacgaatccctaaatga 
acagattatgttacattaatctgagaattttaaatttagaagctaaaataaataaaaatc 
acagctggttgtaa 

SEQ ID NO: 451. >Z39663_T1 # TY Transcript # LN 2390 # Source Gene: Z39663 
# Encoded protein: Z39663_P1 

ttcgaccaataagacaaagaaggaaagtacttcagcgcttgctgttttgtaatactggct 
ttgattgggagaaaataagtgggaaacaagagtagcttggacagggggcttggacggcca 
gcgcattgaccagaagcggcccacactacacgcgcgcgaacacacacacacacacacgca 
cacgccgcgcgcggtcagctagagtttggctactggacccaggaagggagggagcgggag 
gagctggtggctgtcgggtcggggtctgggcgccaccgagcgcccgccccacctctggag 
tgcgcagaaggcggtgcgtcccgggccccggcagcccagcccaggcgctctggccaagtt 
tgcgcgcggtctccgctgactctcgggttacctgagccggcaaccacgtcagcgccacat 
catctgggctttttatattgcaaggaaacaggaaaagaaggaagaaaaacatcgcccgag 
agagccaatcgcaggaagacggcgctgcccggaggagcggggcgggcgggcgcgcggggg 
agcgggcggcgggcgggagccaggcccgggcgggggcgggggcggcggggccagaagagg 
cggcgggccgcgctccggccggtctgcggcgttggccttggctttggctttggcggcggc 
ggtggagaagatgctgcagtccctggccggcagctcgtgcgtgcgcctggtggagcggca 
ccgctcggcctggtgcttcggcttcctggtgctgggctacttgctctacctggtcttcgg 
cgcagtggtcttctcctcggtggagctgccctatgaggacctgctgcgccaggagctgcg 
caagctgaagcgacgcttcttggaggagcacgagtgcctgtctgagcagcagctggagca 
gttcctgggccgggtgctggaggccagcaactacggcgtgtcggtgctcagcaacgcctc 
gggcaactggaactgggacttcacctccgcgctcttcttcgccagcaccgtgctctccac 
cacaggttatggccacaccgtgcccttgtcagatggaggtaaggccttctgcatcatcta 
ctccgtcattggcattcccttcaccctcctgttcctgacggctgtggtccagcgcatcac 
cgtgcacgtcacccgcaggccggtcctctacttccacatccgctggggcttctccaagca 
ggtggtggccatcgtccatgccgtgctccttgggtttgtcactgtgtcctgcttcttctt 
catcccggccgctgtcttctcagtcctggaggatgactggaacttcctggaatcctttta 
tttttgttttatttccctgagcaccattggcctgggggattatgtgcctggggaaggcta 
caatcaaaaattcagagagctctataagattgggatcacgtgttacctgctacttggcct 
tattgccatgttggtagttctggaaaccttctgtgaactccatgagctgaaaaaattcag 
aaaaatgttctatgtgaagaaggacaaggacgaggatcaggtgcacatcatagagcatga 
ccaactgtccttctcctcgatcacagaccaggcagctggcatgaaagaggaccagaagca 
aaatgagccttttgtggccacccagtcatctgcctgcgtggatggccctgcaaaccattg 
agcgtaggatttgttgcattatgctagagcaccagggtcagggtgcaaggaagaggctta 
agtatgttcatttttatcagaatgcaaaagcgaaaattatgtcactttaagaaatagcta 
ctgtttgcaatgtcttattaaaaaacaacaaaaaaagacaaatggaacaaagaagctgtg 
accccagcaggatgtctaatatgtgaggaaatgagatgtccacctaaaattcatatgtga 
caaaattatctcgacct'tacataggaggagaatacttgaagcagtatgctgctgtggtta 
gaagcagattttatacttttaactggaaactttggggtttgcatttagatcatttagctg 
atggctaaatagcaaaatttatatttagaagcaaaaaaaaaaagcatagagatgtgtttt 
ataaataggtttatgtgtactggtttgcatgtacccacccaaaatgattatttttgtaga 
atctaagttaaacttactatttataatgcataggtaaccattaactatgtacatataaag 
tataaatatgtttatattctgtacatatggtttaggtcaccagatcctagtgtagttctg 
aaactaagactatagatattttgtttcttttgatttctctttatactaaagaatccagag 
ttgctacaataaaataaggggaataataaacttgagagtgaataaccata 
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SEQ ID NO: 452 ♦ >Z39663_T2 # TY Transcript # LN 2428 # Source Gene: 239663 

# Encoded protein: Z39663_P1 

ttcgaccaataagacaaagaaggaaagtacttcagcgcttgctgttttgtaatactggct 
ttgattgggagaaaataagtgggaaacaagagtagcttggacagggggcttggacggcca 
gcgcattgaccagaagcggcccacactacacgcgcgcgaacacacacacacacacacgca 
cacgccgcgcgcggtcagctagagtttggctactggacccaggaagggagggagcgggag 
gagctggtggctgtcgggtcggggtctgggcgccaccgagcgcccgccccacctctggag 
tgcgcagaaggcggtgcgtcccgggccccggcagcccagcccaggcgctctggccaagtt 
tgcgcgcggtctccgctgactctcgggttacctgagccggcaaccacgtcagcgccacat 
catctgggctttttatattgcaaggaaacaggaaaagaaggaagaaaaacatcgcccgag 
agagccaatcgcaggaagacggcgctgcccggaggagcggggcgggcgggcgcgcggggg 
agcgggcggcgggcgggagccaggcccgggcgggggcgggggcggcggggccagaagagg 
cggcgggccgcgctccggccggtctgcggcgttggccttggctttggctttggcggcggc 
ggtggagaagatgctgcagtccctggccggcagctcgtgcgtgcgcctggtggagcggca 
ccgctcggcctggtgcttcggcttcctggtgctgggctacttgctctacctggtcttcgg 
cgcagtggtcttctcctcggtggagctgccctatgaggacctgctgcgccaggagctgcg 
caagctgaagcgacgcttcttggaggagcacgagtgcctgtctgagcagcagctggagca 
gttcctgggccgggtgctggaggccagcaactacggcgtgtcggtgctcagcaacgcctc 
gggcaactggaactgggacttcacctccgcgctcttcttcgccagcaccgtgctctccac 
cacaggttatggccacaccgtgcccttgtcagatggaggtaaggccttctgcatcatcta 
ctccgtcattggcattcccttcaccctcctgttcctgacggctgtggtccagcgcatcac 
cgtgcacgtcacccgcaggccggtcctctacttccacatccgctggggcttctccaagca 
ggtggtggccatcgtccatgccgtgctccttgggtttgtcactgtgtcctgcttcttctt 
catcccggccgctgtcttctcagtcctggaggatgactggaacttcctggaatcctttta 
tttttgttttatttccctgagcaccattggcctgggggattatgtgcctggggaaggcta 
caatcaaaaattcagagagctctataagattgggatcacgtgttacctgctacttggcct 
tattgccatgttggtagttctggaaaccttctgtgaactccatgagctgaaaaaattcag 
aaaaatgttctatgtgaagaaggacaaggacgaggatcaggtgcacatcatagagcatga 
ccaactgtccttctcctcgatcacagaccaggcagctggcatgaaagaggaccagaagca 
aaatgagccttttgtggccacccagtcatctgcctgcgtggatggccctgcaaaccattg 
agcgtaggatttgttgcattatgctagagcaccagggtcagggtgcaaggaagaggctta 
agtatgttcatttttatcagaatgcaaaagcgaaaattatgtcactttaagaaatagcta 
ctgtttgcaatgtcttattaaaaaacaacaaaaaaagacaaatggaacaaagaagctgtg 
accccagcaggatgtctaatatgtgaggaaatgagatgtccacctaaaattcatatgtga 
caaaattatctcgaccttacataggaggagaatacttgaagcagtatgctgctgtggtta 
gaagcagattttatacttttaactggaaactttggggtttgcatttagatcatttagctg 
atggctaaatagcaaaatttatatttagaagcaaaaaaaaaaagcatagagatgtgtttt 
ataaataggtttatgtgtactggtttgcatgtacccacccaaaatgattatttttgtaga 
atctaagttaaacttactatttataatgcataggtaaccattaactatgtacatataaag 
tataaatatgtttatattctgtacatatggtttaggtcaccagatcctagtgtagttctg 
aaactaagactatagatattttgtttcttttgatttctctttatactaaagaatccagag 
ttgctacaataaaataaggggaataataaacttgagagtgaataaccatagtattctgct 
gcaataaatgtttctaccctcttgcgat 

SEQ ID NO:453. >Z39663_T3 < TY Transcript # LN 2665 # Source Gene: Z39663 

# Encoded protein: Z39663_P1 

ttcgaccaataagacaaagaaggaaagtacttcagcgcttgctgttttgtaatactggct 
ttgattgggagaaaataagtgggaaacaagagtagcttggacagggggcttggacggcca 
gcgcattgaccagaagcggcccacactacacgcgcgcgaacacacacacacacacacgca 
cacgccgcgcgcggtcagctagagtttggctactggacccaggaagggagggagcgggag 
gagctggtggctgtcgggtcggggtctgggcgccaccgagcgcccgccccacctctggag 
tgcgcagaaggcggtgcgtcccgggccccggcagcccagcccaggcgctctggccaagtt 
tgcgcgcggtctccgctgactctcgggttacctgagccggcaaccacgtcagcgccacat 
catctgggctttttatattgcaaggaaacaggaaaagaaggaagaaaaacatcgcccgag 
agagccaatcgcaggaagacggcgctgcccggaggagcggggcgggcgggcgcgcggggg 
agcgggcggcgggcgggagccaggcccgggcgggggcgggggcggcggggccagaagagg 
cggcgggccgcgctccggccggtctgcggcgttggccttggctttggctttggcggcggc 
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ggtggagaagatgctgcagtccctggccggcagctcgtgcgtgcgcctggtggagcggca 
ccgctcggcctggtgcttcggcttcctggtgctgggctacttgctctacctggtcttcgg 
cgcagtggtcttctcctcggtggagctgccctatgaggacctgctgcgccaggagctgcg 
caagctgaagcgacgcttcttggaggagcacgagtgcctgtctgagcagcagctggagca 
gttcctgggccgggtgctggaggccagcaactacggcgtgtcggtgctcagcaacgcctc 
gggcaactggaactgggacttcacctccgcgctcttcttcgccagcaccgtgctctccac 
cacaggttatggccacaccgtgcccttgtcagatggaggtaaggccttctgca teat eta 
ctccgtcattggcattcccttcaccctcctgttcctgacggctgtggtccagcgcatcac 
cgtgcacgtcacccgcaggccggtcctctacttccacatccgctggggcttctccaagca 
ggtggtggccatcgtccatgccgtgctccttgggtttgtcactgtgtcctgcttcttctt 
catcccggccgctgtcttctcagtcctggaggatgactggaacttcctggaatcctttta 
tttttgttttatttccctgagcaccattggcctgggggattatgtgcctggggaaggcta 
caatcaaaaattcagagagctctataagattgggatcacgtgttacctgctacttggcct 
tattgccatgttggtagttctggaaaccttctgtgaactccatgagctgaaaaaattcag 
aaaaatgttctatgtgaagaaggacaaggacgaggatcaggtgcacatcatagagcatga* 
ccaactgtccttctcctcgatcacagaccaggcagctggcatgaaagaggaccagaagca 
aaatgagccttttgtggccacccagtcatctgcctgcgtggatggccctgcaaaccattg 
agcgtaggatttgttgcattatgctagagcaccagggtcagggtgcaaggaagaggctta 
agtatgttcatttttatcagaatgcaaaagcgaaaattatgtcactttaagaaatagcta 
ctgtttgcaatgtcttattaaaaaacaacaaaaaaagacaaatggaacaaagaagctgtg 
accccagcaggatgtctaatatgtgaggaaatgagatgtccacctaaaattcatatgtga 
caaaattatctcgaccttacataggaggagaatacttgaagcagtatgctgctgtggtta 
gaagcagattttatacttttaactggaaactttggggtttgcatttagatcatttagctg 
atggctaaatagcaaaatttatatttagaagcaaaaaaaaaaagcatagagatgtgtttt 
ataaataggtttatgtgtactggtttgcatgtacccacccaaaatgattatttttgtaga 
atctaagttaaacttactatttataatgcataggtaaccattaactatgtacatataaag 
tataaatatgtttatattctgtacatatggtttaggtcaccagatcctagtgtagttctg 
aaactaagactatagatattttgtttcttttgatttctctttatactaaagaatccagag 
ttgctacaataaaataaggggaataataaacttgagagtgaataaccatagtattctgct 
gcaataaatgtttctaccctcttgcgatagaatggtgtcagatgtctaacatgtttataa 
acatttttttgaattgaacaggttatgaaaaggatttattaaaaggttaagatactttgt 
tttgaaagtacattgtgatttgcagccacctagcactaaagcatagaacttaaacacgaa 
tccctaaatgaacagattatgttacattaatctgagaattttaaatttagaagctaaaat 
aaataaaaatcacagctggttgtaa 

SEQ ID NO: 454. >Z39663JT4 # TY Transcript # LN 1726 # Source Gene: 239663 
# Encoded protein: Z39663_P2 

acttgagtcagtagaaaatggggagcctctgtgggctcctggtcct'gcaccgtgataaat 
agcctgtcctgcaacctggcttaaaacagggaaagccggagttaggcaacagagtgccta 
gctttctcagtcattcaaacacaagcacggaagaagagattgcactgaccagaggaccgc 
cagggtgtgatgctccgtagtcgcagggatgaaggaaaaccatgggcggagcttgtcatg 
gacagctcctaggtggacataaatggaaactcattggttctggggcacggcgggaagagg 
gagacggccctggacggccgcaccaagaatttcacaacccaaacccgactagaaagcctg 
ggttatggccacaccgtgcccttgtcagatggaggtaaggccttctgcatcatctactcc 
gtcattggcattcccttcaccctcctgttcctgacggctgtggtccagcgcatcaccgtg 
cacgtcacccgcaggccggtcctctacttccacatccgctggggcttctccaagcaggtg 
gtggccatcgtccatgccgtgctccttgggtttgtcactgtgtcctgcttcttcttcatc 
ccggccgctgtcttctcagtcctggaggatgactggaacttcctggaatccttttatttt 
tgttttatttccctgagcaccattggcctgggggattatgtgcctggggaaggctacaat 
caaaaattcagagagctctataagattgggatcacgtgttacctgctacttggccttatt 
gccatgttggtagttctggaaaccttctgtgaactccatgagctgaaaaaattcagaaaa 
atgttctatgtgaagaaggacaaggacgaggatcaggtgcacatcatagagcatgaccaa 
ctgtccttctcctcgatcacagaccaggcagctggcatgaaagaggaccagaagcaaaat 
gagccttttgtggccacccagtcatctgcctgcgtggatggccctgcaaaccattgagcg 
taggatttgttgcattatgctagagcaccagggtcagggtgcaaggaagaggcttaagta 
tgttcatttttatcagaatgcaaaagcgaaaattatgtcactttaagaaatagctactgt 
ttgcaatgtcttattaaaaaacaacaaaaaaagacaaatggaacaaagaagctgtgaccc 
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cagcaggatgtctaatatgtgaggaaatgagatgtccacctaaaattcatatgtgacaaa 
attatctcgaccttacataggaggagaatacttgaagcagtatgctgctgtggttagaag 
cagattttatacttttaactggaaactttggggtttgcatttagatcatttagctgatgg 
ctaaatagcaaaatttatatttagaagcaaaaaaaaaaagcatagagatgtgttttataa 
ataggtttatgtgtactggtttgcatgtacccacccaaaatgattatttttgtagaatct 
aagttaaacttactatttataatgcataggtaaccattaactatgtacatataaagtata 
aatatgtttatattctgtacatatggtttaggtcaccagatcctagtgtagttctgaaac 
taagactatagatattttgtttcttttgatttctctttatactaaagaatccagagttgc 
tacaataaaataaggggaataataaacttgagagtgaataaccata 

SEQ ID NO: 455. >Z39663JT5 # TY Transcript # LN 2537 < Source Gene: Z39663 
# Encoded protein: Z39663_P3 

ttcgaccaataagacaaagaaggaaagtacttcagcgcttgctgttttgtaatactggct 
ttgattgggagaaaataagtgggaaacaagagtagcttggacagggggcttggacggcca 
gcgcattgaccagaagcggcccacactacacgcgcgcgaacacacacacacacacacgca 
cacgccgcgcgcggtcagctagagtttggctactggacccaggaagggagggagcgggag 
gagctggtggctgtcgggtcggggtctgggcgccaccgagcgcccgccccacctctggag 
tgcgcagaaggcggtgcgtcccgggccccggcagcccagcccaggcgctctggccaagtt 
tgcgcgcggtctccgctgactctcgggttacctgagccggcaaccacgtcagcgccacat 
catctgggctttttatattgcaaggaaacaggaaaagaaggaagaaaaacatcgcccgag 
agagccaatcgcaggaagacggcgctgcccggaggagcggggcgggcgggcgcgcggggg 
agcgggcggcgggcgggagccaggcccgggcgggggcgggggcggcggggccagaagagg 
cggcgggccgcgctccggccggtctgcggcgttggccttggctttggctttggcggcggc 
ggtggagaagatgctgcagtccctggccggcagctcgtgcgtgcgcctggtggagcggca 
ccgctcggcctggtgcttcggcttcctggtgctgggctacttgctctacctggtcttcgg 
cgcagtggtcttctcctcggtggagctgccctatgaggacctgctgcgccaggagctgcg 
caagctgaagcgacgcttcttggaggagcacgagtgcctgtctgagcagcagctggagca 
gttcctgggccgggtgctggaggccagcaactacggcgtgtcggtgctcagcaacgcctc 
gggcaactggaactgggacttcacctccgcgctcttcttcgccagcaccgtgctctccac 
cacaggtgaatggggctaagaccccatacccaggaagcgagctggcatcagaagcagaag 
actgaggtcatttttatggatgggagatgtgctggtgggtttttagaagagctcatgtca 
gttggcatccatttcttcagtaaggtgagaaggttatggccacaccgtgcccttgtcaga 
tggaggtaaggccttctgcatcatctactccgtcattggcattcccttcaccctcctgtt 
cctgacggctgtggtccagcgcatcaccgtgcacgtcacccgcaggccggtcctctactt 
ccacatccgctggggcttctccaagcaggtggtggccatcgtccatgccgtgctccttgg 
gtttgtcactgtgtcctgcttcttcttcatcccggccgctgtcttctcagtcctggagga 
tgactggaacttcctggaatccttttatttttgttttatttccctgagcaccattggcct 
gggggattatgtgcctggggaaggctacaatcaaaaattcagagagctctataagattgg 
gatcacgtgttacctgctacttggccttattgccatgttggtagttctggaaaccttctg 
tgaactccatgagctgaaaaaattcagaaaaatgttctatgtgaagaaggacaaggacga 
ggatcaggtgcacatcatagagcatgaccaactgtccttctcctcgatcacagaccaggc 
agctggcatgaaagaggaccagaagcaaaatgagccttttgtggccacccagtcatctgc 
ctgcgtggatggccctgcaaaccattgagcgtaggatttgttgcattatgctagagcacc 
agggtcagggtgcaaggaagaggcttaagtatgttcatttttatcagaatgcaaaagcga 
aaattatgtcactttaagaaatagctactgtttgcaatgtcttattaaaaaacaacaaaa 
aaagacaaatggaacaaagaagctgtgaccccagcaggatgtctaatatgtgaggaaatg 
agatgtccacctaaaattcatatgtgacaaaattatctcgaccttacataggaggagaat 
acttgaagcagtatgctgctgtggttagaagcagattttatacttttaactggaaacttt 
ggggtttgcatttagatcatttagctgatggctaaatagcaaaatttatatttagaagca 
aaaaaaaaaagcatagagatgtgttttataaataggtttatgtgtactggtttgcatgta 
cccacccaaaatgattatttttgtagaatctaagttaaacttactatttataatgcatag 
gtaaccattaactatgtacatataaagtataaatatgtttatattctgtacatatggttt 
aggtcaccagatcctagtgtagttctgaaactaagactatagatattttgtttcttttga 
tttctctttatactaaagaatccagagttgctacaataaaataaggggaataataaactt 
gagagtgaataaccata 
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SEQ ID NO: 456. >Z39663JT6 # TY Transcript # LN 1485 # Source Gene: Z39663 

# Encoded protein: Z39663_P4 

agggactccttcctccttctggccttgtgagattagagtgagaagtctatgaggaagcag 
ccctcaccagacactgaatctgctggtgccttgatcagagacttcccagccactaaaact 
gttatggccacaccgtgcccttgtcagatggaggtaaggccttctgcatcatctactccg 
tcattggcattcccttcaccctcctgttcctgacggctgtggtccagcgcatcaccgtgc 
acgtcacccgcaggccggtcctctacttccacatccgctggggcttctccaagcaggtgg 
tggccatcgtccatgccgtgctccttgggtttgtcactgtgtcctgcttcttcttcatcc 
cggccgctgtcttctcagtcctggaggatgactggaacttcctggaatccttttattttt 
gttttatttccctgagcaccattggcctgggggattatgtgcctggggaaggctacaatc 
aaaaattcagagagctctataagattgggatcacgtgttacctgctacttggccttattg 
ccatgttggtagttctggaaaccttctgtgaactccatgagctgaaaaaattcagaaaaa 
tgttctatgtgaagaaggacaaggacgaggatcaggtgcacatcatagagcatgaccaac 
tgtccttctcctcgatcacagaccaggcagctggcatgaaagaggaccagaagcaaaatg 
agccttttgtggccacccagtcatctgcctgcgtggatggccctgcaaaccattgagcgt 
aggatttgttgcattatgctagagcaccagggtcagggtgcaaggaagaggcttaagtat 
gttcatttttatcagaatgcaaaagcgaaaattatgtcactttaagaaatagctactgtt 
tgcaatgtcttattaaaaaacaacaaaaaaagacaaatggaacaaagaagctgtgacccc 
agcaggatgtctaatatgtgaggaaatgagatgtccacctaaaattcatatgtgacaaaa 
ttatctcgaccttacataggaggagaatacttgaagcagtatgctgctgtggttagaagc 
agattttatacttttaactggaaactttggggtttgcatttagatcatttagctgatggc 
taaatagcaaaatttatatttagaagcaaaaaaaaaaagcatagagatgtgttttataaa 
taggtttatgtgtactggtttgcatgtacccacccaaaatgattatttttgtagaatcta 
agttaaacttactatttataatgcataggtaaccattaactatgtacatataaagtataa 
atatgtttatattctgtacatatggtttaggtcaccagatcctagtgtagttctgaaact 
aagactatagatattttgtttcttttgatttctctttatactaaagaatccagagttgct 
acaataaaataaggggaataataaacttgagagtgaataaccata 

SEQ ID NO: 4 57 . >Z39663JT7 # TY Transcript # LN 1474 # Source Gene: Z39663 

# Encoded protein: Z39663_P5 

gtcctgagggttttatctcctgatttggttagaagattgttgaatctggctgtgagagcc 
tttcaagtgcctaaaggagagagcttgaacttccctgagagtacaccaggttatggccac 
accgtgcccttgtcagatggaggtaaggccttctgcatcatctactccgtcattggcatt 
cccttcaccctcctgttcctgacggctgtggtccagcgcatcaccgtgcacgtcacccgc 
aggccggtcctctacttccacatccgctggggcttctccaagcaggtggtggccatcgtc 
catgccgtgctccttgggtttgtcactgtgtcctgcttcttcttcatcccggccgctgtc 
ttctcagtcctggaggatgactggaacttcctggaatccttttatttttgttttatttcc 
ctgagcaccattggcctgggggattatgtgcctggggaaggctacaatcaaaaattcaga 
gagctctataagattgggatcacgtgttacctgctacttggccttattgccatgttggta 
gttctggaaaccttctgtgaactccatgagctgaaaaaattcagaaaaatgttctatgtg 
aagaaggacaaggacgaggatcaggtgcacatcatagagcatgaccaactgtccttctcc 
tcgatcacagaccaggcagctggcatgaaagaggaccagaagcaaaatgagccttttgtg 
gccacccagtcatctgcctgcgtggatggccctgcaaaccattgagcgtaggatttgttg 
cattatgctagagcaccagggtcagggtgcaaggaagaggcttaagtatgttcattttta 
tcagaatgcaaaagcgaaaattatgtcactttaagaaatagctactgtttgcaatgtctt 
attaaaaaacaacaaaaaaagacaaatggaacaaagaagctgtgaccccagcaggatgtc 
taatatgtgaggaaatgagatgtccacctaaaattcatatgtgacaaaattatctcgacc 
ttacataggaggagaatacttgaagcagtatgctgctgtggttagaagcagattttatac 
ttttaactggaaactttggggtttgcatttagatcatttagctgatggctaaatagcaaa 
atttatatttagaagcaaaaaaaaaaagcatagagatgtgttttataaataggtttatgt 
gtactggtttgcatgtacccacccaaaatgattatttttgtagaatctaagttaaactta 
ctatttataatgcataggtaaccattaactatgtacatataaagtataaatatgtttata 
ttctgtacatatggtttaggtcaccagatcctagtgtagttctgaaactaagactataga 
tattttgtttcttttgatttctctttatactaaagaatccagagttgctacaataaaata 
aggggaataataaacttgagagtgaataaccata 
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SEQ ID NO: 458 . >Z39663_T8 # TY Transcript # LN 1724 # Source Gene: Z39663 
ft Encoded protein: Z39663_P6 

gtgggtagcctcggatgagtttttcctgatggtctaagatgggtgtgggccttagcttct 
tagtacacagaacttgtgtttgagcctgatagtgaattcaagaccagagaccctgatgtg 
ttggaattgctggttggattttacaaagaatggtggactttcaaacttatggtgaccact 
gttgaattttactgggacatgacaattatactagagcccctaaagtggcagacctgaagt 
ctcgctgttctctttggtttgaggggaataagggcagatgataggcatatattgtttaaa 
aacgtgtttgccactttgtctcttcctcttcgcctcagtgaccttgttctccttgcaggg 
ttatggccacaccgtgcccttgtcagatggaggtaaggccttctgcatcatctactccgt 
cattggcattcccttcaccctcctgttcctgacggctgtggtccagcgcatcaccgtgca 
cgtcacccgcaggccggtcctctacttccacatccgctggggcttctccaagcaggtggt 
ggccatcgtccatgccgtgctccttgggtttgtcactgtgtcctgcttcttcttcatccc 
ggccgctgtcttctcagtcctggaggatgactggaacttcctggaatccttttatttttg 
ttttatttccctgagcaccattggcctgggggattatgtgcctggggaaggctacaatca 
aaaattcagagagctctataagattgggatcacgtgttacctgctacttggccttattgc 
catgttggtagttctggaaaccttctgtgaactccatgagctgaaaaaattcagaaaaat 
gttctatgtgaagaaggacaaggacgaggatcaggtgcacatcatagagcatgaccaact 
gtccttctcctcgatcacagaccaggcagctggcatgaaagaggaccagaagcaaaatga 
gccttttgtggccacccagtcatctgcctgcgtggatggccctgcaaaccattgagcgta 
ggatttgttgcattatgctagagcaccagggtcagggtgcaaggaagaggcttaagtatg 
ttcatttttatcagaatgcaaaagcgaaaattatgtcactttaagaaatagctactgttt 
gcaatgtcttattaaaaaacaacaaaaaaagacaaatggaacaaagaagctgtgacccca 
gcaggatgtctaatatgtgaggaaatgagatgtccacctaaaattcatatgtgacaaaat 
tatctcgaccttacataggaggagaatacttgaagcagtatgctgctgtggttagaagca 
gattttatacttttaactggaaactttggggtttgcatttagatcatttagctgatggct 
aaatagcaaaatttatatttagaagcaaaaaaaaaaagcatagagatgtgttttataaat 
aggtttatgtgtactggtttgcatgtacccacccaaaatgattatttttgtagaatctaa 
gttaaacttactatttataatgcataggtaaccattaactatgtacatataaagtataaa 
tatgtttatattctgtacatatggtttaggtcaccagatcctagtgtagttctgaaacta 
agactatagatattttgtttcttttgatttctctttatactaaagaatccagagttgcta 
caataaaataaggggaataataaacttgagagtgaataaccata 

SEQ ID NO: 459. >Z39663_T9 # TY Transcript # LN 1571 # Source Gene: Z39663 
# Encoded protein: Z39663_P7 

tgactgagaagagctaaaccccagtagagccaagttgggaggtggtgaaaaaaatccagg 
tgaatggggctaagaccccatacccaggaagcgagctggcatcagaagcagaagactgag 
gtcatttttatggatgggagatgtgctggtgggtttttagaagagctcatgtcagttggc 
atccatttcttcagtaaggtgagaaggttatggccacaccgtgcccttgtcagatggagg 
taaggccttctgcatcatctactccgtcattggcattcccttcaccctcctgttcctgac 
ggctgtggtccagcgcatcaccgtgcacgtcacccgcaggccggtcctctacttccacat 
ccgctggggcttctccaagcaggtggtggccatcgtccatgccgtgctccttgggtttgt 
cactgtgtcctgcttcttcttcatcccggccgctgtcttctcagtcctggaggatgactg 
gaacttcctggaatccttttatttttgttttatttccctgagcaccattggcctggggga 
ttatgtgcctggggaaggctacaatcaaaaattcagagagctctataagattgggatcac 
gtgttacctgctacttggccttattgccatgttggtagttctggaaaccttctgtgaact 
ccatgagctgaaaaaattcagaaaaatgttctatgtgaagaaggacaaggacgaggatca 
ggtgcacatcatagagcatgaccaactgtccttctcctcgatcacagaccaggcagctgg 
catgaaagaggaccagaagcaaaatgagccttttgtggccacccagtcatctgcctgcgt 
ggatggccctgcaaaccattgagcgtaggatttgttgcattatgctagagcaccagggtc 
agggtgcaaggaagaggcttaagtatgttcatttttatcagaatgcaaaagcgaaaatta 
tgtcactttaagaaatagctactgtttgcaatgtcttattaaaaaacaacaaaaaaagac 
aaatggaacaaagaagctgtgaccccagcaggatgtctaatatgtgaggaaatgagatgt 
ccacctaaaattcatatgtgacaaaattatctcgaccttacataggaggagaatacttga 
agcagtatgctgctgtggttagaagcagattttatacttttaactggaaactttggggtt 
tgcatttagatcatttagctgatggctaaatagcaaaatttatatttagaagcaaaaaaa 
aaaagcatagagatgtgttttataaataggtttatgtgtactggtttgcatgtacccacc 
caaaatgattatttttgtagaatctaagttaaacttactatttataatgcataggtaacc 
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attaactatgtacatataaagtataaatatgtttatattctgtacatatggtttaggtca 
ccagatcctagtgtagttctgaaactaagactatagatattttgtttcttttgatttctc 
tttatactaaagaatccagagttgctacaataaaataaggggaataataaacttgagagt 
gaataaccata 

SEQ ID NO: 460. >Z39663_T10 # TY Transcript < LN 1759 # Source Gene: Z39663 

# Encoded protein: Z39663P8 

agggactccttcctccttctggccttgtgagattagagtgagaagtctatgaggaagcag 
ccctcaccagacactgaatctgctggtgccttgatcagagacttcccagccactaaaact 
ggacagtcttcattgatgaccttactatttaccacagaagttcatctatgtggctccaaa 
ctctatttcaagaatatgaaaggcatgtaggttatgcctacgttctcggacccttgatag 
atgttttgctgacccaccttctgatatctgtagagttaccctaggctccaccttccacca 
accagttctgtgtcaaagtgcaaaatgaaaatcacagaaaaagtcaacctgacctgaagc 
ttattcatcaccgtgcaatgagtttcaccatacagttatggccacaccgtgcccttgtca 
gatggaggtaaggccttctgcatcatctactccgtcattggcattcccttcaccctcctg 
ttcctgacggctgtggtccagcgcatcaccgtgcacgtcacccgcaggccggtcctctac 
ttccacatccgctggggcttctccaagcaggtggtggccatcgtccatgccgtgctcctt 
gggtttgtcactgtgtcctgcttcttcttcatcccggccgctgtcttctcagtcctggag 
gatgactggaacttcctggaatccttttatttttgttttatttccctgagcaccattggc 
ctgggggattatgtgcctggggaaggctacaatcaaaaattcagagagctctataagatt 
gggatcacgtgttacctgctacttggccttattgccatgttggtagttctggaaaccttc 
tgtgaactccatgagctgaaaaaattcagaaaaatgttctatgtgaagaaggacaaggac 
gaggatcaggtgcacatcatagagcatgaccaactgtccttctcctcgatcacagaccag 
gcagctggcatgaaagaggaccagaagcaaaatgagccttttgtggccacccagtcatct 
gcctgcgtggatggccctgcaaaccattgagcgtaggatttgttgcattatgctagagca 
ccagggtcagggtgcaaggaagaggcttaagtatgttcatttttatcagaatgcaaaagc 
gaaaattatgtcactttaagaaatagctactgtttgcaatgtcttattaaaaaacaacaa 
aaaaagacaaatggaacaaagaagctgtgaccccagcaggatgtctaatatgtgaggaaa 
tgagatgtccacctaaaattcatatgtgacaaaattatctcgaccttacataggaggaga 
atacttgaagcagtatgctgctgtggttagaagcagattttatacttttaactggaaact 
ttggggtttgcatttagatcatttagctgatggctaaatagcaaaatttatatttagaag 
caaaaaaaaaaagcatagagatgtgttttataaataggtttatgtgtactggtttgcatg 
tacccacccaaaatgattatttttgtagaatctaagttaaacttactatttataatgcat 
aggtaaccattaactatgtacatataaagtataaatatgtttatattctgtacatatggt 
ttaggtcaccagatcctagtgtagttctgaaactaagactatagatattttgtttctttt 
gatttctctttatactaaagaatccagagttgctacaataaaataaggggaataataaac 
ttgagagtgaataaccata 

SEQ ID NO:461. >Z39663_T11 # TY Transcript « LN 1438 # Source Gene: Z39663 

# Encoded protein: Z39663_P9 

ttcgaccaataagacaaagaaggaaagtacttcagcgcttgctgttttgtaatactggct 
ttgattgggagaaaataagtgggaaacaagagtagcttggacagggggcttggacggcca 
gcgcattgaccagaagcggcccacactacacgcgcgcgaacacacacacacacacgttat 
ggccacaccgtgcccttgtcagatggaggtaaggccttctgcatcatctactccgtcatt 
ggcattcccttcaccctcctgttcctgacggctgtggtccagcgcatcactgtgtcctgc 
ttcttcttcatcccggccgctgtcttctcagtcctggaggatgactggaacttcctggaa 
tccttttatttttgttttatttccctgagcaccattggcctgggggattatgtgcctggg 
gaaggctacaatcaaaaattcagagagctctataagattgggatcacgtgttacctgcta 
cttggccttattgccatgttggtagttctggaaaccttctgtgaactccatgagctgaaa 
aaattcagaaaaatgttctatgtgaagaaggacaaggacgaggatcaggtgcacatcata 
gagcatgaccaactgtccttctcctcgatcacagaccaggcagctggcatgaaagaggac 
cagaagcaaaatgagccttttgtggccacccagtcatctgcctgcgtggatggccctgca 
aaccattgagcgtaggatttgttgcattatgctagagcaccagggtcagggtgcaaggaa 
gaggcttaagtatgttcatttttatcagaatgcaaaagcgaaaattatgtcactttaaga 
aatagctactgtttgcaatgtcttattaaaaaacaacaaaaaaagacaaatggaacaaag 
aagctgtgaccccagcaggatgtctaatatgtgaggaaatgagatgtccacctaaaattc 
atatgtgacaaaattatctcgaccttacataggaggagaatacttgaagcagtatgctgc 
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tgtggttagaagcagattttatacttttaactggaaactttggggtttgcatttagatca 
tttagctgatggctaaatagcaaaatttatatttagaagcaaaaaaaaaaagcatagaga 
tgtgttttataaataggtttatgtgtactggtttgcatgtacccacccaaaatgattatt 
tttgtagaatctaagttaaacttactatttataatgcataggtaaccattaactatgtac 
atataaagtataaatatgtttatattctgtacatatggtttaggtcaccagatcctagtg 
tagttctgaaactaagactatagatattttgtttcttttgatttctctttatactaaaga 
atccagagttgctacaataaaataaggggaataataaacttgagagtgaataaccata 

SEQ ID NO: 462. >Z39663_P1 # TY Protein # CC #LN 336 # Source Gene: Z39663 

# Encoding Transcript: 1 

MLQSLAGSSCVRLVERHRSAWCFGFLVLGYLLYLVFGAVVFSSVELPYEDLLRQELRKLK 
RRFLEEHECL 

SEQQLEQFLGRVLEASNYGVSVLSNASGNWNWDFTSALFFASTVLSTTGY 

GHTVPLSDGGKAFCIIYSVIGIPFTLLFLTAWQRITVHVTRRPVLYFHIRWGFSKQWA 

IVHAVLLGFVTVSCFFFIPAAVFSVLEDDWNFLESFYFCFISLSTIGLGDYVPGEGYNQK 

FRELYKIGITCYLLLGLIAMLWLETFCELHELKKFRKMFYVKKDKDEDQVHIIEHDQLS 

FSSITDQAAGMKEDQKQNEPFVATQSSACVDGPANH 

SEQ ID NO: 463. >Z39663_P2 # TY Protein # CC #LN 269 # Source Gene: Z39663 

# Encoding Transcript: 4 

MKENHGRSLSWTAPEVDINGNSLVLGHGGKRETALDGRTKNFTTQTRLESLGYGHTVPLS 
DGGKAFCIIYSVIGIPFTLLFLTAWQRITVHVTRRPVLYFHIRWGFSKQWAIVHAVLL 
GFVTVSCFFFIPAAVFSVLEDDWNFLESFYFCFISLSTIGLGDYVPGEGYNQKFRELYKI 
GITCYLLLGLIAMLWLETFCELHELKKFRKMFYVKKDKDEDQVHIIEHDQLSFSSITDQ 
AAGMKEDQKQNE P FVATQS S AC VDG PANH 

SEQ ID NO: 4 64 . >Z39663_P3 # TY Protein # CC #LN 385 # Source Gene: Z39663 

# Encoding Transcript: 5 

MLQSLAGSSCVRLVERHRSAWCFGFLVLGYLLYLVFGAWFSSVELPYEDLLRQELRKLK 
RRFLEEHECL 

SEQQLEQFLGRVLEASNYGVSVLSNASGNWNWDFTSALFFASTVLSTTGE 
WGYRPHTQEASWHQKQKTEVIFMDGRCAGGFLEELMSVGIHFFSKVEGYGHTVPLSDGGK 
AFCIIYSVIGIPFTLLFLTAVVQRITVHVTRRPVLYFHIRWGFSKQWAIVHAVLLGFVT 
VSCFFFIPAAVFSVLEDDWNFLESFYFCFISLSTIGLGDYVPGEGYNQKFRELYKIGITC 
YLLLGLI AMLWLETFCELHELKKFRKMFYVKKDKDEDQVH 1 1 EHDQLSFS S I TDQAAGM 
KEDQKQN EP FVATQS SAC VDG PANH 

SEQ ID NO: 465. >Z39663_P4 # TY Protein # CC #LN 242 # Source Gene: Z39663 

# Encoding Transcript: 6 

MRKQPSPDTESAGALIRDFPATENCYGHTVPLSDGGKAFCIIYSVIGIPFTLLFLTAWQ 
RITVHVTRRPVLYFHIRWGFSKQWAIVHAVLLGFVTVSCFFFIPAAVFSVLEDDWNFLE 
SFYFCFISLSTIGLGDYVPGEGYNQKFRELYKIGITCYLLLGLIAMLVVLETFCELHELK 
KFRKMFYVKKDKDEDQVHIIEHDQLSFSSITDQAAGMKEDQKQNEPFVATQSSACVDGPA 
NH 

SEQ ID NO: 4 66. >Z39663_P5 # TY Protein # CC #LN 254 # Source Gene: Z39663 

# Encoding Transcript: 7 

VLRVLSPDLVRRLLNLAVRAFQVPKGESLNFPESTPGYGHTVPLSDGGKAFCIIYSVIGI 
PFTLLFLTAWQRITVHVTRRPVLYFHIRWGFSKQWAIVHAVLLGFVTVSCFFFIPAAV 
FSVLEDDWNFLESFYFCFISLSTIGLGDYVPGEGYNQKFRELYKIGITCYLLLGLIAMLV 
VLETFCELHELKKFRKMFYVKKDKDEDQVHIIEHDQLSFSSITDQAAGMKEDQKQNEPFV 
ATQS SAC VDG PANH 

SEQ ID NO: 467. >Z39663_P6 # TY Protein # CC #LN 245 # Source Gene: Z39663 

# Encoding Transcript: 8 

MIGIYCLKTCLPLCLFLFASVTLFSLQGYGHTVPLSDGGKAFCIIYSVIGIPFTLLFLTA 
WQRITVHVTRRPVLYFHIRWGFSKQWAIVHAVLLGFVTVSCFFFIPAAVFSVLEDDWN 
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FLESFYFCFISLSTIGIX5DYVPGEGYNQKFRELYKIGITCYLLLGLIAMLVVLETFCELH 
ELKKFRKMFYVKKDKDEDQVHIIEHDQLSFSSITDQAAGMKEDQKQNEPFVATQSSACVD 
GPANH 

SEQ ID NO:468. >Z39663_P7 # TY Protein « CC #LN 243 # Source Gene: Z39663 

# Encoding Transcript: 9 

MDGRCAGGFLEELMSVGIHFFSKVEGYGHTVPLSDGGKAFCIIYSVIGIPFTLLFLTAW 
QRITVHVTRRPVLYFHIRWGFSKQWAIVHAVLLGFVTVSCFFFIPAAVFSVLEDDWNFL 
ESFYFCFISLSTIGLGDYVPGEGYNQKFRELYKIGITCYLLLGLIAMLWLETFCELHEL 
KKFRKMFYVKKDKDEDQVHIIEHDQLSFSSITDQAAGMKEDQKQNEPFVATQSSACVDGP 
ANH 

SEQ ID NO: 469. >Z39663_P8 # TY Protein « CC #LN 223 # Source Gene: Z39663 

# Encoding Transcript: 10 

MSFTYSYGHTVPLSDGGKAFCI IYSVIGI PFTLLFLTAVVQRITVHVTRRPVLYFHIRWG 
FSKQWAIVHAVLLGFVTVSCFFFIPAAVFSVLEDDWNFLESFYFCFISLSTIGLGDYVP 
GEGYNQKFRELYKIGITCYLLLGLIAMLWLETFCELHELKKFRKMFYVKKDKDEDQVHI 
IEHDQLS FS S I TDQAAGMKEDQKQNE P FVATQS SAC VDGPANH 

SEQ ID NO: 470 . >Z39663_P9 # TY Protein # CC #LN 77 # Source Gene: Z39663 

# Encoding Transcript: 11 

MLWLETFCELHELKKFRKMFYVKKDKDEDQVHI IEHDQLS FSSI TDQAAGMKEDQKQNE 
P FVATQS SAC VDGPANH 



SEQ ID NO: 471. >Z44462 # TY Consensus # Length 9434 # Number of exons 54 

gaggtggcgaaggggctagcgtcaggggctggcggccgtgatgcgctttgggtggaaagc 

cgcgtgaagggagcacggcgcagccagctcctgtaagtgactggaaccaggaacagagga 

cagtgagtcgctctcgtccctccaggtctgctgagaacagacccagtccctgaggaagga 

gaagatgttgctgccacgatcagtgccacagagaccctctcggaagaggagcaggaagag 

ctaagaagagaacttgcaaaggtcaaaacctgtcaaaggccaagtgctttatttccctcc 

agtcaaggactgggcattttgcatttgaatcctggatggcgggaagcagcaagcaggtgt 

caacaggggtggggaggagacgccggaccccaccagccaatcagagcgcgcagaggcctc 

ccttcccccgccgccgtcccgcgccaaaattccaaacgggctacgcaccgctcccaccgt 

ctccttccgccgagctcagcgagccggaactatgtgaagaaaaagaagtcagcgttttcc 

ccaggactctccgctcaggtcctctatactgtcggcctgtgttcctctccacgttggccc 

ggcctccttcgctctacaaatgggaggagagtttcgagcctgctgttctatggtggtggc 

gtgggcgggacagacggcggagggattctgggcgcgcgcgctgcggccgctggcgcggag 

cttgtggcgccagaattcggagcgcggaagagccagagctgcgagcgcctggagctggat 

ctctctccggtcgcgcacgccgaggccagtagggagagaagatggtggttctccgcagca 

gcttggagctgcacaaccactccgcggcctcggccacgggctccttggacctgtccagtg 

acttcctcagtctggagcacatcggccggaggcggctccgctcggccggcgcggcgcaga 

agaaacccgcggcgaccacagccaaagcgggcgtgagtacgggcctgaggcgggctccca 

gggcggttcccggggggactgccggcagcccggcgctcggaggggcctgggggtgggcgg 

cgccggtcagggtgacagttgtcgggggagaagtgtggtcgaggacgagtaaaatggctt 

gttatgaactataaaccagtaaatcttatattagctgcttcatagacattgctgatattt 

gtttatcatgaaatggtttttctagatactaacttgtagtttcttggggagatacaaaat 

gattctgcacaaaaagcagtttttactaacttgtaatcagagaatactctttggaggaat 

ttaaattggagttttgtaaagaaaaacttagatggctttcttaatttttcttaaaggatg 

ggtcatcagttaaggaagttgaaacctaccaccggacacgtgctttaagatctttgagaa 

aagatgcacagaattcttcagattctagttttgagaagaatgtggaaataacggagcaac 

ttgctaatggcaggcattttacaaggcagttggccagacagcaggctgataaaaaaaaag 

aagagcacagagaagacaaagtgattccagttactcggtcattgagggctagaaacatcg 
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ttcaaagtacagaacacttacatgaagataatggtgatgttgaagtgcgtcgaagttgta 
ggattagaagtcgttatagtggtgtaaaccagtccatgctgtttgacaaacttataacta 
acactgctgaagctgtacttcaaaaaatggatgacatgaagaagatgcgtagacagcgaa 
tgagagaacttgaagacttgggagtgtttaatgaaacagaagaagtaaatatattcttcc 
attaagggaatgatttaatatccagtttaatttttttatagtacctaatccatttttcat 
taaatatctactttttccagcatttatttgatcagtcattagttactgctggaaatgtaa 
gagtttatggctgtaaatatcttaattttaaacaagattggaggatccaacttaattttt 
ttaataaatagattccaaatgttaatggagcaatcttaatatgtacacaagaggaaaaca 
gaaagatattcaaagaactgatgaagaaacaactgataatcaagaaggcagtgtggagtc 
atctgaagagggtgaagaccaagaacatgaagatgatggtgaagatgaagatgatgaaga 
tgatgatgatgatgacgatgatgatgatgatgatgatgatgaagatgatgaagatgaaga 
agatggagaagaagagaatcagaagcgatattatcttagacagagaaaagctactgttta 
ctatcaggctccattggaaaaacctcgtcaccagagaaagcccaacatattttatagtgg 
cccagcttctcctgcaagaccaagataccgattatcttccgcaggaccaagaagtcctta 
ctgtaaacgaatgaacaggcgaaggcatgcaatccacagtagtgactcgacttcatcttc 
ctcctctgaagatgaacagcactttgagaggcggaggaaaaggagtcgtaatagggctat 
caataggtgcctcccactaaattttcggaaagatgaattaaaaggcatttataaagatcg 
aatgaaaattggagcaagccttgccgatgttgatccaatgcaactagattcttcagtacg 
atttgatagtgttggtggcctgtctaatcatatagcagctctaaaagagatggtggtgtt 
tccattactttatccagaagtctttgaaaaatttaaaattcaacccccaagaggttgttt 
gttttatgggccacctggaactggaaagactctggttgccagagcacttgccaatgagtg 
cagtcaaggggataaaagagtagcatttttcatgaggaaaggtgctgattgtctaagtaa 
atgggtaggagaatctgaaagacagctacgattgctgtttgatcaggcctatcagatgcg 
cccatcaattattttttttgacgaaattgatggtctggctccagtacggtcaagcaggca 
agatcagattcacaggtaaaacttgcttgatggcacttcaacgtttcttcctatttcctg 
agtggtattgatgtgtgagagatagaatgttaaatggtgaaagaaactttataagctagt 
agttcctttttttggtagtagaaccccagtatatagtacaaaaataaaaaatgtggatgg 
ttgcagtggctcatacctgtattcctggcactttgggaggccaaggcaggcagatcactt 
gaggtcaagagtttgagaccagcctggccaacatggtgaaatcctgtctctactaaaaaa 
aaaatttgatgttatatattgtttgagaactcttaagttcctttgtgaaattatgcttta 
taaaatgtcttctaggattcatttgtggggggtctaacaaccgtctgttttccattgatt 
tcaatttctgtgggccatttgggatgttattttatagatttttttaatccattggaaata 
gtgaggtgccctgagaaaatttattcacatgtaatgcgttaaatcttacaaaagaagaaa 
tattactgaatgatatagactgtgttcatagtattaatctcttctctcttatttttaaaa 
tttagttctattgtttccaccctgctagctcttatggatggattggacagcagaggggaa 
attgtggtcattggtgctacgaacaggctagattctatagatcctgctttacgaaggcct 
ggtcgctttgatagagaattcctctttagcctgcctgataaagaggctcgaaaagagatt 
ctaaagattcacaccagggattggaatcccaaaccactggacacatttttagaagagcta 
gcagaaaactgtgttggatactgtggagcagatattaaatcaatatgtgctgaagctgct 
ttatgtgctttacgacgacgctacccacagatctataccactagtgagaaactgcagttg 
gatctctcttcaattaatatctcagctaaggatttcgaggtagctatgcaaaagatgata 
ccagcctcccaaagagctgtgacatcacctgggcaggcactgtccaccgttgtgaaacca 
ctcctgcaaaacactgttgacaagattttagaagccctgcagagagtatttccacatgca 
gaattcagaacaaataaaacattagactcaggtataaaacttgattgaccatttaaagtt 
agtttttataatcagtaaagtgaaatgtttcatacataccagaaatatttatttgtaaaa 
gaatctcttgaaaagaatttggtaatgctttacctgggatatttgaaggaaatttactcc 
atggattgcccttaatttaacttattgtaagtttggggtgcatttttacatttacttatt 
aattcatatctacacaaatgactagaaaatatttcttgtcctctgctagaaagtgacttg 
gcttacagtgatgatgatgttccatcagtttatgaaaatggactttctcagaaatcttct 
cataaggcaaaagacaattttaattttcttcatttgaatagaaatgcttgttaccaacct 
atgtcttttcgaccaagaatattgatagtaggagaaccaggatttgggcaaggttctcac 
ttggcaccagctgtcattcatgctttggaaaagtttactgtatatacattagacattcct 
gttctttttggagttagtactacatcccctgaagaaacatgtgcccaggtgattcgtgaa 
gctaagagaacagcaccaagtatagtgtatgttcctcatatccacgtgtggtgggaaata 
gttggaccgacacttaaagccacatttaccacattattacagaatattccttcatttgct 
ccagttttactacttgcaacttctgacaaaccccattccgctttgccagaagaggtaatt 
tatgtggggcggtatcattatactttcaaaaagagttgtttctaccacaataatcataac 
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ttattgaaaaaattcaaatgctgtagaagtgaataaaaggataagttttcttatcaaata 
taccattggtaactatttggtatatgtgctcattgggcaagatggtatatgtatatgtgt 
atgtatatattaaggatacataaattttttctcttgaaaaatggtattatatgccgggtg 
cagtggctcacgcctgtaatcccagcactttgggaggccgaggtgggtggatcatgaggt 
caggagattgagaccatcctggctaacagggtgaaaccctgtccctactaaaaatacaaa 
aaaaattagctggacgtggtggctggcgcctgcagtcccagctactcgggagactgaagc 
aggagaatggcatgaaccctggaggcagagcttgcagtgagccgagtgagccactgcact 
ccagcctgggcgacagagcgagactctgtctcaaaagtgcaagaattgtttatccgtgat 
tatggagagatttttaatgtccagttaccggataaagaagaacggacaaaattttttgaa 
gatttaattctaaaacaagctgctaagcctcctatatcaaaaaagaaagcagttttgcag 
gctttggaggtactcccagtagcaccaccacctgagccaagatcactgacagcagaagaa 
gtgaaacgactagaagaacaagaagaagatacatttagagaactgaggattttcttaaga 
aatgttacacataggcttgctattgacaagcgattccgagtgtttactaagcctgttgac 
cctgatgagaggaaagaatataggctgtactggagaatacagctgcccaagaataagaaa 
caattactggaggatacagctgcccaaggaatagaaacaaatacaaaacaaaaacagcct 
taaatcaagtgaaaggttttatgttaaatatattatcataatctcttacgtatttattct 
ctatctcctatattgcctggcatcgaaacttgactcacatgttatgtttagagttaattt 
tgtgatttgatcatttgcttgaattttaggttcctgattatgtcactgtaataaagcaac 
caatggacctttcatctgtaatcagtaaaattgatctacacaagtatctgactgtgaaag 
actatttgagagatattgatctaatctgtagtaatgccttagaatacaatccagatagag 
atcctggagatcgtcttattaggcatagagcctgtgctttaagagatactgcctatgcca 
taattaaagaagaacttgatgaagactttgagcagctctgtgaagaaattcaggaatcta 
gaaagaaaagaggttgtagctcctccaaatatgccccgtcttactaccatgtgatgccaa 
agcaaaattccactcttgttggtgataaaagatcagacccagagcagaatgaaaagctaa 
agacaccgagtactcctgtggcttgcagcactcctggtaagtactcaagtagttttcatt 
tatgaaattataactttattcttttctaatgccattaactttctcccttattttccttta 
tttgtgttaatgcagtgaattacattgattttgttactattaagacaactttgcattctt 
gaaataaatccaagtttataatgtaccacttgcaacccagaagtcaccatctaattgtgg 
actgtcttgatttgaaactgatgtctgtaaattctggtctatttcttactcccccttatt 
cttagaaggtaactccacgttacctattcaaagcctgtagggttttctaaggcctgcctt 
ccttggcagacttctaattccaatttttgttctttaagcccctttgtcagactgcttagc 
ttctcacagtatgctctattcttgtttctgtattttaatcatacaaaatactgtggtttc 
tcactatttatcctttcatatatcagggtttttcactctttgaatttcagaaaatggttg 
aatgttctttcatcatttttctttaaatttcttgaagctcagttgaagaggaaaattcgc 
aaaaagtcaaactggtacttaggcaccataaaaaagcgaaggaagatttcacaggcaaag 
gatgatagccagaatgccatagatcacaaaattgagagtgatacagaggaaactcaagac 
acaagtgtagatcataatgagaccggaaacacaggagagtcttcggtggaagaaaatgaa 
aaacagcaaaatgcctctgaaagcaaactggaattgagaaataattcaaatacttgtaat 
atagagaatgagcttgaagactctaggaagactacagcatgtacagaattgagagacaag 
attgcttgtaatggagatgcttctagctctcagataatacatatttctgatgaaaatgaa 
ggaaaagaaatgtgtgttctgcgaatgactcgagctagacgttcccaggtagaacagcag 
cagctcatcactgttgaaaaggctttggcaattctttctcagcctacaccctcacttgtt 
gtggatcatgagcgattaaaaaatcttttgaagactgttgttaaaaaaagtcaaaactac 
aacatatttcagttggaaaatttgtatgcagtaatcagccaatgtatttatcggcatcgc 
aaggaccatgataaaacatcacttattcaggtaaactcttgagtggaaaaaaatggagca 
agaggtagaaaacttcagttgttccagatgatgatgtcatggtatcgagtattctttata 
ttcagttcctatttaagtcatttttgtcatgtccgcctaattgatgtagtatgaaaccct 
gcatctttaaggaaaagattaaaatagtaaaataaaagtatttaaactttcctgatattt 
atgtacatattaagataaatgtcatgtgtaagataactgataaatattggaactttgcta 
gaacaagaccctgtgantaatttaatggtagcattatttaaggattaaaattgtccattg 
gtgcttaaagaataaagcttgacttaaaagtaagtttggcggaaaacctttttggggcct 
taaggaaaaaaccaaaaaggaaaacccccttgggtaagtagtaatagtaataatagttga 
agtttggccaactcttaataaagttattttggtaactaatgttttatggcacttaagaat 
aattagcagcgttaaattttgtttgtattaagcacttttaattttatccttcctaaaaat 
agtttattgtatctgacaagaaacttacttaaccattgtgtccttcccatcttttttgtc 
atctttgttttcttcaaatgccctcctcccatctgccttgagattcccttgtcttcactt 
aaaagccagagtgcaagtcatgatttgcgggagggctcttgaaccacttctggctgcacc 
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acaattctgtacttgagtatcacagtcattgtttttgagacaaacatttttataattcta 
atttgggttaataaagattttaaatatttcttggtttacttttgtaattatatacacaac 
aaatgtattaataactaccttgttaaacaccttttaatagcacaaggtttttatatttgc 
aagctgttgatatctttctaaaactgtttaggttatagtctattgatactttttatatac 
aattttataaatataaatattataattttatattaatggtaccaaaaatacatttcttaa 
ggttaaaagcatgcacttccatgcatacttgcttttggggagagtggggagaagacattc 
taataatcagtttgtgaaatagcttctgttggaaaccttttgaggggaataaggaatggt 
catctaaaatgagagattctggattttaatgcagttcaaagttgagctgtatttttgttg 
ttgatttatctggattttttttaaagccttctaaaacccagtgaattcaataccttaatt 
agtacatactatcttatgtaatgcataaagcaatgccagtcactgagaacatttaaatat 
atttatattcctggagatacacattctcatttttgttggtttattataaattattcttct 
agatgcatcttttataactaggatttcattttgtgtgtatagcttatgtaataaatttta 
aaggtgaaaactctcttaaaaaaacaggtyccactgsagaaaccattttaaaatataatt 
taattatttccctgggargaatacccratttccctcatttttttkgkytggggytttayt 
tattaamaattaatttcyttyctaaaratggcaattcttttttataaaccttagggattt 
tcccttttttggggtggtattagcctttatgtaaattaaattttttaaagggggaaaaaa 
cttctcttttaaaa 

SEQ ID NO:472. >Z44462JU # TY Transcript # LN 5361 # Source Gene: 244462 
# Encoded protein: Z4 4462_P1 

gtcaaaacctgtcaaaggccaagtgctttatttccctccagtcaaggactgggcattttg 
catttgaatcctggatggcgggaagcagcaagcaggtgtcaacaggggtggggaggagac 
gccggaccccaccagccaatcagagcgcgcagaggcctcccttcccccgccgccgtcccg 
cgccaaaattccaaacgggctacgcaccgctcccaccgtctccttccgccgagctcagcg 
agccggaactatgtgaagaaaaagaagtcagcgttttccccaggactctccgctcaggtc 
ctctatactgtcggcctgtgttcctctccacgttggcccggcctccttcgctctacaaat 
gggaggagagtttcgagcctgctgttctatggtggtggcgtgggcgggacagacggcgga 
gggattctgggcgcgcgcgctgcggccgctggcgcggagcttgtggcgccagaattcgga 
gcgcggaagagccagagctgcgagcgcctggagctggatctctctccggtcgcgcacgcc 
gaggccagtagggagagaagatggtggttctccgcagcagcttggagctgcacaaccact 
ccgcggcctcggccacgggctccttggacctgtccagtgacttcctcagtctggagcaca 
tcggccggaggcggctccgctcggccggcgcggcgcagaagaaacccgcggcgaccacag 
ccaaagcgggcgatgggtcatcagttaaggaagttgaaacctaccaccggacacgtgctt 
taagatctttgagaaaagatgcacagaattcttcagattctagttttgagaagaatgtgg 
aaataacggagcaacttgctaatggcaggcattttacaaggcagttggccagacagcagg 
ctgataaaaaaaaagaagagcacagagaagacaaagtgattccagttactcggtcattga 
gggctagaaacatcgttcaaagtacagaacacttacatgaagataatggtgatgttgaag 
tgcgtcgaagttgtaggattagaagtcgttatagtggtgtaaaccagtccatgctgtttg 
acaaacttataactaacactgctgaagctgtacttcaaaaaatggatgacatgaagaaga 
tgcgtagacagcgaatgagagaacttgaagacttgggagtgtttaatgaaacagaagaaa 
gcaatcttaatatgtacacaagaggaaaacagaaagatattcaaagaactgatgaagaaa 
caactgataatcaagaaggcagtgtggagtcatctgaagagggtgaagaccaagaacatg 
aagatgatggtgaagatgaagatgatgaagatgatgatgatgatgacgatgatgatgatg 
atgatgatgatgaagatgatgaagatgaagaagatggagaagaagagaatcagaagcgat 
attatcttagacagagaaaagctactgtttactatcaggctccattggaaaaacctcgtc 
accagagaaagcccaacatattttatagtggcccagcttctcctgcaagaccaagatacc 
gattatcttccgcaggaccaagaagtccttactgtaaacgaatgaacaggcgaaggcatg 
caatccacagtagtgactcgacttcatcttcctcctctgaagatgaacagcactttgaga 
ggcggaggaaaaggagtcgtaatagggctatcaataggtgcctcccactaaattttcgga 
aagatgaattaaaaggcatttataaagatcgaatgaaaattggagcaagccttgccgatg 
ttgatccaatgcaactagattcttcagtacgatttgatagtgttggtggcctgtctaatc 
atatagcagctctaaaagagatggtggtgtttccattactttatccagaagtctttgaaa 
aatttaaaattcaacccccaagaggttgtttgttttatgggccacctggaactggaaaga 
ctctggttgccagagcacttgccaatgagtgcagtcaaggggataaaagagtagcatttt 
tcatgaggaaaggtgctgattgtctaagtaaatgggtaggagaatctgaaagacagctac 
gattgctgtttgatcaggcctatcagatgcgcccatcaattattttttttgacgaaattg 
atggtctggctccagtacggtcaagcaggcaagatcagattcacagttctattgtttcca 
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ccctgctagctcttatggatggattggacagcagaggggaaattgtggtcattggtgcta 
cgaacaggctagattctatagatcctgctttacgaaggcctggtcgctttgatagagaat 
tcctctttagcctgcctgataaagaggctcgaaaagagattctaaagattcacaccaggg 
attggaatcccaaaccactggacacatttttagaagagctagcagaaaactgtgttggat 
actgtggagcagatattaaatcaatatgtgctgaagctgctttatgtgctttacgacgac 
gctacccacagatctataccactagtgagaaactgcagttggatctctcttcaattaata 
tctcagctaaggatttcgaggtagctatgcaaaagatgataccagcctcccaaagagctg 
tgacatcacctgggcaggcactgtccaccgttgtgaaaccactcctgcaaaacactgttg 
acaagattttagaagccctgcagagagtatttccacatgcagaattcagaacaaataaaa 
cattagactcagatatttcttgtcctctgctagaaagtgacttggcttacagtgatgatg 
atgttccatcagtttatgaaaatggactttctcagaaatcttctcataaggcaaaagaca 
attttaattttcttcatttgaatagaaatgcttgttaccaacctatgtcttttcgaccaa 
gaatattgatagtaggagaaccaggatttgggcaaggttctcacttggcaccagctgtca 
ttcatgctttggaaaagtttactgtatatacattagacattcctgttctttttggagtta 
gtactacatcccctgaagaaacatgtgcccaggtgattcgtgaagctaagagaacagcac 
caagtatagtgtatgttcctcatatccacgtgtggtgggaaatagttggaccgacactta 
aagccacatttaccacattattacagaatattccttcatttgctccagttttactacttg 
caacttctgacaaaccccattccgctttgccagaagaggtgcaagaattgtttatccgtg 
attatggagagatttttaatgtccagttaccggataaagaagaacggacaaaattttttg 
aagatttaattctaaaacaagctgctaagcctcctatatcaaaaaagaaagcagttttgc 
aggctttggaggtactcccagtagcaccaccacctgagccaagatcactgacagcagaag 
aagtgaaacgactagaagaacaagaagaagatacatttagagaactgaggattttcttaa 
gaaatgttacacataggcttgctattgacaagcgattccgagtgtttactaagcctgttg 
accctgatgaggttcctgattatgtcactgtaataaagcaaccaatggacctttcatctg 
taatcagtaaaattgatctacacaagtatctgactgtgaaagactatttgagagatattg 
atctaatctgtagtaatgccttagaatacaatccagatagagatcctggagatcgtctta 
ttaggcatagagcctgtgctttaagagatactgcctatgccataattaaagaagaacttg 
atgaagactttgagcagctctgtgaagaaattcaggaatctagaaagaaaagaggttgta 
gctcctccaaatatgccccgtcttactaccatgtgatgccaaagcaaaattccactcttg 
ttggtgataaaagatcagacccagagcagaatgaaaagctaaagacaccgagtactcctg 
tggcttgcagcactcctgctcagttgaagaggaaaattcgcaaaaagtcaaactggtact 
taggcaccataaaaaagcgaaggaagatttcacaggcaaaggatgatagccagaatgcca 
tagatcacaaaattgagagtgatacagaggaaactcaagacacaagtgtagatcataatg 
agaccggaaacacaggagagtcttcggtggaagaaaatgaaaaacagcaaaatgcctctg 
aaagcaaactggaattgagaaataattcaaatacttgtaatatagagaatgagcttgaag 
actctaggaagactacagcatgtacagaattgagagacaagattgcttgtaatggagatg 
cttctagctctcagataatacatatttctgatgaaaatgaaggaaaagaaatgtgtgttc 
tgcgaatgactcgagctagacgttcccaggtagaacagcagcagctcatcactgttgaaa 
aggctttggcaattctttctcagcctacaccctcacttgttgtggatcatgagcgattaa 
aaaatcttttgaagactgttgttaaaaaaagtcaaaactacaacatatttcagttggaaa 
atttgtatgcagtaatcagccaatgtatttatcggcatcgcaaggaccatgataaaacat 
cacttattcagaaaatggagcaagaggtagaaaacttcagttgttccagatgatgatgtc 
atggtatcgagtattctttatattcagttcctatttaagtcatttttgtcatgtccgcct 
aattgatgtagtatgaaaccctgcatctttaaggaaaagattaaaatagtaaaataaaag 
tatttaaactttcctgatatttatgtacatattaagataaatgtcatgtgtaagataact 
gataaatattggaactttgctagaacaagaccctgtagtaatagtaataatagttgaagt 
ttggccaactcttaataaagttattttggtaactaatgttttatggcacttaagaataat 
tagcagcgttaaattttgtttgtattaagcacttttaattttatccttcctaaaaatagt 
ttattgtatctgacaagaaacttacttaaccattgtgtccttcccatcttttttgtcatc 
tttgttttcttcaaatgccctcctcccatctgccttgagattcccttgtcttcacttaaa 
agccagagtgcaagtcatgatttgcgggagggctcttgaaccacttctggctgcaccaca 
attctgtacttgagtatcacagtcattgtttttgagacaaacatttttataattctaatt 
tgggttaataaagattttaaa 

SEQ ID NO: 473. >Z44462JT2 # TY Transcript # LN 6021 # Source Gene: Z44462 
# Encoded protein: Z44 4 62_P1 

gtcaaaacctgtcaaaggccaagtgctttatttccctccagtcaaggactgggcattttg 
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Figure 1 (Cont'd) 



catttgaatcctggatggcgggaagcagcaagcaggtgtcaacaggggtggggaggagac 
gccggaccccaccagccaatcagagcgcgcagaggcctcccttcccccgccgccgtcccg 
cgccaaaattccaaacgggctacgcaccgctcccaccgtctccttccgccgagctcagcg 
agccggaactatgtgaagaaaaagaagtcagcgttttccccaggactctccgctcaggtc 
ctctatactgtcggcctgtgttcctctccacgttggcccggcctccttcgctctacaaat 
gggaggagagtttcgagcctgctgttctatggtggtggcgtgggcgggacagacggcgga 
gggattctgggcgcgcgcgctgcggccgctggcgcggagcttgtggcgccagaattcgga 
gcgcggaagagccagagctgcgagcgcctggagctggatctctctccggtcgcgcacgcc 
gaggccagtagggagagaagatggtggttctccgcagcagcttggagctgcacaaccact 
ccgcggcctcggccacgggctccttggacctgtccagtgacttcctcagtctggagcaca 
tcggccggaggcggctccgctcggccggcgcggcgcagaagaaacccgcggcgaccacag 
ccaaagcgggcgatgggtcatcagttaaggaagttgaaacctaccaccggacacgtgctt 
taagatctttgagaaaagatgcacagaattcttcagattctagttttgagaagaatgtgg 
aaataacggagcaacttgctaatggcaggcattttacaaggcagttggccagacagcagg 
ctgataaaaaaaaagaagagcacagagaagacaaagtgattccagttactcggtcattga 
gggctagaaacatcgttcaaagtacagaacacttacatgaagataatggtgatgttgaag 
tgcgtcgaagttgtaggattagaagtcgttatagtggtgtaaaccagtccatgctgtttg 
acaaacttataactaacactgctgaagctgtacttcaaaaaatggatgacatgaagaaga 
tgcgtagacagcgaatgagagaacttgaagacttgggagtgtttaatgaaacagaagaaa 
gcaatcttaatatgtacacaagaggaaaacagaaagatattcaaagaactgatgaagaaa 
caactgataatcaagaaggcagtgtggagtcatctgaagagggtgaagaccaagaacatg 
aagatgatggtgaagatgaagatgatgaagatgatgatgatgatgacgatgatgatgatg 
atgatgatgatgaagatgatgaagatgaagaagatggagaagaagagaatcagaagcgat 
attatcttagacagagaaaagctactgtttactatcaggctccattggaaaaacctcgtc 
accagagaaagcccaacatattttatagtggcccagcttctcctgcaagaccaagatacc 
gattatcttccgcaggaccaagaagtccttactgtaaacgaatgaacaggcgaaggcatg 
caatccacagtagtgactcgacttcatcttcctcctctgaagatgaacagcactttgaga 
ggcggaggaaaaggagtcgtaatagggctatcaataggtgcctcccactaaattttcgga 
aagatgaattaaaaggcatttataaagatcgaatgaaaattggagcaagccttgccgatg 
ttgatccaatgcaactagattcttcagtacgatttgatagtgttggtggcctgtctaatc 
atatagcagctctaaaagagatggtggtgtttccattactttatccagaagtctttgaaa 
aatttaaaattcaacccccaagaggttgtttgttttatgggccacctggaactggaaaga 
ctctggttgccagagcacttgccaatgagtgcagtcaaggggataaaagagtagcatttt 
tcatgaggaaaggtgctgattgtctaagtaaatgggtaggagaatctgaaagacagctac 
gattgctgtttgatcaggcctatcagatgcgcccatcaattattttttttgacgaaattg 
atggtctggctccagtacggtcaagcaggcaagatcagattcacagttctattgtttcca 
ccctgctagctcttatggatggattggacagcagaggggaaattgtggtcattggtgcta 
cgaacaggctagattctatagatcctgctttacgaaggcctggtcgctttgatagagaat 
tcctctttagcctgcctgataaagaggctcgaaaagagattctaaagattcacaccaggg 
attggaatcccaaaccactggacacatttttagaagagctagcagaaaactgtgttggat 
actgtggagcagatattaaatcaatatgtgctgaagctgctttatgtgctttacgacgac 
gctacccacagatctataccactagtgagaaactgcagttggatctctcttcaattaata 
tctcagctaaggatttcgaggtagctatgcaaaagatgataccagcctcccaaagagctg 
tgacatcacctgggcaggcactgtccaccgttgtgaaaccactcctgcaaaacactgttg 
acaagattttagaagccctgcagagagtatttccacatgcagaattcagaacaaataaaa 
cattagactcagatatttcttgtcctctgctagaaagtgacttggcttacagtgatgatg 
atgttccatcagtttatgaaaatggactttctcagaaatcttctcataaggcaaaagaca 
attttaattttcttcatttgaatagaaatgcttgttaccaacctatgtcttttcgaccaa 
gaatattgatagtaggagaaccaggatttgggcaaggttctcacttggcaccagctgtca 
ttcatgctttggaaaagtttactgtatatacattagacattcctgttctttttggagtta 
gtactacatcccctgaagaaacatgtgcccaggtgattcgtgaagctaagagaacagcac 
caagtatagtgtatgttcctcatatccacgtgtggtgggaaatagttggaccgacactta 
aagccacatttaccacattattacagaatattccttcatttgctccagttttactacttg 
caacttctgacaaaccccattccgctttgccagaagaggtgcaagaattgtttatccgtg 
attatggagagatttttaatgtccagttaccggataaagaagaacggacaaaattttttg 
aagatttaattctaaaacaagctgctaagcctcctatatcaaaaaagaaagcagttttgc 
aggctttggaggtactcccagtagcaccaccacctgagccaagatcactgacagcagaag 



339/365 



Figur 1 (Cont'd) 



aagtgaaacgactagaagaacaagaagaagatacatttagagaactgaggattttcttaa 
gaaatgttacacataggcttgctattgacaagcgattccgagtgtttactaagcctgttg 
accctgatgaggttcctgat-tatgtcactgtaataaagcaaccaatggacctttcatctg 
taatcagtaaaattgatctacacaagtatctgactgtgaaagactatttgagagatattg 
atctaatctgtagtaatgccttagaatacaatccagatagagatcctggagatcgtctta 
ttaggcatagagcctgtgctttaagagatactgcctatgccataattaaagaagaacttg 
atgaagactttgagcagctctgtgaagaaattcaggaatctagaaagaaaagaggttgta 
gctcctccaaatatgccccgtcttactaccatgtgatgccaaagcaaaattccactcttg 
ttggtgataaaagatcagacccagagcagaatgaaaagctaaagacaccgagtactcctg 
tggcttgcagcactcctgctcagttgaagaggaaaattcgcaaaaagtcaaactggtact 
taggcaccataaaaaagcgaaggaagatttcacaggcaaaggatgatagccagaatgcca 
tagatcacaaaattgagagtgatacagaggaaactcaagacacaagtgtagatcataatg 
agaccggaaacacaggagagtcttcggtggaagaaaatgaaaaacagcaaaatgcctctg 
aaagcaaactggaattgagaaataattcaaatacttgtaatatagagaatgagcttgaag 
actctaggaagactacagcatgtacagaattgagagacaagattgcttgtaatggagatg 
cttctagctctcagataatacatatttctgatgaaaatgaaggaaaagaaatgtgtgttc 
tgcgaatgactcgagctagacgttcccaggtagaacagcagcagctcatcactgttgaaa 
aggctttggcaattctttctcagcctacaccctcacttgttgtggatcatgagcgattaa 
aaaatcttttgaagactgttgttaaaaaaagtcaaaactacaacatatttcagttggaaa 
atttgtatgcagtaatcagccaatgtatttatcggcatcgcaaggaccatgataaaacat 
cacttattcagaaaatggagcaagaggtagaaaacttcagttgttccagatgatgatgtc 
atggtatcgagtattctttatattcagttcctatttaagtcatttttgtcatgtccgcct 
aattgatgtagtatgaaaccctgcatctttaaggaaaagattaaaatagtaaaataaaag 
tatttaaactttcctgatatttatgtacatattaagataaatgtcatgtgtaagataact 
gataaatattggaactttgctagaacaagaccctgtagtaatagtaataatagttgaagt 
ttggccaactcttaataaagttattttggtaactaatgttttatggcacttaagaataat 
tagcagcgttaaattttgtttgtattaagcacttttaattttatccttcctaaaaatagt 
ttattgtatctgacaagaaacttacttaaccattgtgtccttcccatcttttttgtcatc 
tttgttttcttcaaatgccctcctcccatctgccttgagattcccttgtcttcacttaaa 
agccagagtgcaagtcatgatttgcgggagggctcttgaaccacttctggctgcaccaca 
attctgtacttgagtatcacagtcattgtttttgagacaaacatttttataattctaatt 
tgggttaataaagattttaaatatttcttggtttacttttgtaattatatacacaacaaa 
tgtattaataactaccttgttaaacaccttttaatagcacaaggtttttatatttgcaag 
ctgttgatatctttctaaaactgtttaggttatagtctattgatactttttatatacaat 
tttataaatataaatattataattttatattaatggtaccaaaaatacatttcttaaggt 
taaaagcatgcacttccatgcatacttgcttttggggagagtggggagaagacattctaa 
taatcagtttgtgaaatagcttctgttggaaaccttttgaggggaataaggaatggtcat 
ctaaaatgagagattctggattttaatgcagttcaaagttgagctgtatttttgttgttg 
atttatctggattttttttaaagccttctaaaacccagtgaattcaataccttaattagt 
acatactatcttatgtaatgcataaagcaatgccagtcactgagaacatttaaatatatt 
tatattcctggagatacacattctcatttttgttggtttattataaattattcttctaga 
tgcatcttttataactaggatttcattttgtgtgtatagcttatgtaataaattttaaag 
gtgaaaactctcttaaaaaaa 

SEQ ID NO:474. >Z44462_T3 # TY Transcript # LN 5873 # Source Gene: Z44462 
& Encoded protein: Z4 44 62_P1 

gtcaaaacctgtcaaaggccaagtgctttatttccctccagtcaaggactgggcattttg 
catttgaatcctggatggcgggaagcagcaagcaggtgtcaacaggggtggggaggagac 
gccggaccccaccagccaatcagagcgcgcagaggcctcccttcccccgccgccgtcccg 
cgccaaaattccaaacgggctacgcaccgctcccaccgtctccttccgccgagctcagcg 
agccggaactatgtgaagaaaaagaagtcagcgttttccccaggactctccgctcaggtc 
ctctatactgtcggcctgtgttcctctccacgttggcccggcctccttcgctctacaaat 
gggaggagagtttcgagcctgctgttctatggtggtggcgtgggcgggacagacggcgga 
gggattctgggcgcgcgcgctgcggccgctggcgcggagcttgtggcgccagaattcgga 
gcgcggaagagccagagctgcgagcgcctggagctggatctctctccggtcgcgcacgcc 
gaggccagtagggagagaagatggtggttctccgcagcagcttggagctgcacaaccact 
ccgcggcctcggccacgggctccttggacctgtccagtgacttcctcagtctggagcaca 
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Figure 1 (Cont'd) 



tcggccggaggcggctccgctcggccggcgcggcgcagaagaaacccgcggcgaccacag 
ccaaagcgggcgatgggtcatcagttaaggaagttgaaacctaccaccggacacgtgctt 
taagatctttgagaaaagatgcacagaattcttcagattctagttttgagaagaatgtgg 
aaataacggagcaacttgctaatggcaggcattttacaaggcagttggccagacagcagg 
ctgataaaaaaaaagaagagcacagagaagacaaagtgattccagttactcggtcattga 
gggctagaaacatcgttcaaagtacagaacacttacatgaagataatggtgatgttgaag 
tgcgtcgaagttgtaggattagaagtcgttatagtggtgtaaaccagtccatgctgtttg 
acaaacttataactaacactgctgaagctgtacttcaaaaaatggatgacatgaagaaga 
tgcgtagacagcgaatgagagaacttgaagacttgggagtgtttaatgaaacagaagaaa 
gcaatcttaatatgtacacaagaggaaaacagaaagatattcaaagaactgatgaagaaa 
caactgataatcaagaaggcagtgtggagtcatctgaagagggtgaagaccaagaacatg 
aagatgatggtgaagatgaagatgatgaagatgatgatgatgatgacgatgatgatgatg 
atgatgatgatgaagatgatgaagatgaagaagatggagaagaagagaatcagaagcgat 
attatcttagacagagaaaagctactgtttactatcaggctccattggaaaaacctcgtc 
accagagaaagcccaacatattttatagtggcccagcttctcctgcaagaccaagatacc 
gattatcttccgcaggaccaagaagtccttactgtaaacgaatgaacaggcgaaggcatg 
caatccacagtagtgactcgacttcatcttcctcctctgaagatgaacagcactttgaga 
ggcggaggaaaaggagtcgtaatagggctatcaataggtgcctcccactaaattttcgga 
aagatgaattaaaaggcatttataaagatcgaatgaaaattggagcaagccttgccgatg 
ttgatccaatgcaactagattcttcagtacgatttgatagtgttggtggcctgtctaatc 
atatagcagctctaaaagagatggtggtgtttccattactttatccagaagtctttgaaa 
aatttaaaattcaacccccaagaggttgtttgttttatgggccacctggaactggaaaga 
ctctggttgccagagcacttgccaatgagtgcagtcaaggggataaaagagtagcatttt 
tcatgaggaaaggtgctgattgtctaagtaaatgggtaggagaatctgaaagacagctac 
gattgctgtttgatcaggcctatcagatgcgcccatcaattattttttttgacgaaattg 
atggtctggctccagtacggtcaagcaggcaagatcagattcacagttctattgtttcca 
ccctgctagctcttatggatggattggacagcagaggggaaattgtggtcattggtgcta 
cgaacaggctagattctatagatcctgctttacgaaggcctggtcgctttgatagagaat 
tcctctttagcctgcctgataaagaggctcgaaaagagattctaaagattcacaccaggg 
attggaatcccaaaccactggacacatttttagaagagctagcagaaaactgtgttggat 
actgtggagcagatattaaatcaatatgtgctgaagctgctttatgtgctttacgacgac 
gctacccacagatctataccactagtgagaaactgcagttggatctctcttcaattaata 
tctcagctaaggatttcgaggtagctatgcaaaagatgataccagcctcccaaagagctg 
tgacatcacctgggcaggcactgtccaccgttgtgaaaccactcctgcaaaacactgttg 
acaagattttagaagccctgcagagagtatttccacatgcagaattcagaacaaataaaa 
cattagactcagatatttcttgtcctctgctagaaagtgacttggcttacagtgatgatg 
atgttccatcagtttatgaaaatggactttctcagaaatcttctcataaggcaaaagaca 
attttaattttcttcatttgaatagaaatgcttgttaccaacctatgtcttttcgaccaa 
gaatattgatagtaggagaaccaggatttgggcaaggttctcacttggcaccagctgtca 
ttcatgctttggaaaagtttactgtatatacattagacattcctgttctttttggagtta 
gtactacatcccctgaagaaacatgtgcccaggtgattcgtgaagctaagagaacagcac 
caagtatagtgtatgttcctcatatccacgtgtggtgggaaatagttggaccgacactta 
aagccacatttaccacattattacagaatattccttcatttgctccagttttactacttg 
caacttctgacaaaccccattccgctttgccagaagaggtgcaagaattgtttatccgtg 
attatggagagatttttaatgtccagttaccggataaagaagaacggacaaaattttttg 
aagatttaattctaaaacaagctgctaagcctcctatatcaaaaaagaaagcagttttgc 
aggctttggaggtactcccagtagcaccaccacctgagccaagatcactgacagcagaag 
aagtgaaacgactagaagaacaagaagaagatacatttagagaactgaggattttcttaa 
gaaatgttacacataggcttgctattgacaagcgattccgagtgtttactaagcctgttg 
accctgatgaggttcctgattatgtcactgtaataaagcaaccaatggacctttcatctg 
taatcagtaaaattgatctacacaagtatctgactgtgaaagactatttgagagatattg 
atctaatctgtagtaatgccttagaatacaatccagatagagatcctggagatcgtctta 
ttaggcatagagcctgtgctttaagagatactgcctatgccataattaaagaagaacttg 
atgaagactttgagcagctctgtgaagaaattcaggaatctagaaagaaaagaggttgta 
gctcctccaaatatgccccgtcttactaccatgtgatgccaaagcaaaattccactcttg 
ttggtgataaaagatcagacccagagcagaatgaaaagctaaagacaccgagtactcctg 
tggcttgcagcactcctgctcagttgaagaggaaaattcgcaaaaagtcaaactggtact 
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Figur 1 (Cont'd) 



taggcaccataaaaaagcgaaggaagatttcacaggcaaaggatgatagccagaatgcca 
tagatcacaaaattgagagtgatacagaggaaactcaagacacaagtgtagatcataatg 
agaccggaaacacaggagagtcttcggtggaagaaaatgaaaaacagcaaaatgcctctg 
aaagcaaactggaattgagaaataattcaaatacttgtaatatagagaatgagcttgaag 
actctaggaagactacagcatgtacagaattgagagacaagattgcttgtaatggagatg 
cttctagctctcagataatacatatttctgatgaaaatgaaggaaaagaaatgtgtgttc 
tgcgaatgactcgagctagacgttcccaggtagaacagcagcagctcatcactgttgaaa 
aggctttggcaattctttctcagcctacaccctcacttgttgtggatcatgagcgattaa 
aaaatcttttgaagactgttgttaaaaaaagtcaaaactacaacatatttcagttggaaa 
atttgtatgcagtaatcagccaatgtatttatcggcatcgcaaggaccatgataaaacat 
cacttattcagaaaatggagcaagaggtagaaaacttcagttgttccagatgatgatgtc 
atggtatcgagtattctttatattcagttcctatttaagtcatttttgtcatgtccgcct 
aattgatgtagtatgaaaccctgcatctttaaggaaaagattaaaatagtaaaataaaag 
tatttaaactttcctgatatttatgtacatattaagataaatgtcatgtgtaagataact 
gataaatattggaactttgctagaacaagaccctgtagtaatagtaataatagttgaagt 
ttggccaactcttaataaagttattttggtaactaatgttttatggcacttaagaataat 
tagcagcgttaaattttgtttgtattaagcacttttaattttatccttcctaaaaatagt 
ttattgtatctgacaagaaacttacttaaccattgtgtccttcccatcttttttgtcatc 
tttgttttcttcaaatgccctcctcccatctgccttgagattcccttgtcttcacttaaa 
agccagagtgcaagtcatgatttgcgggagggctcttgaaccacttctggctgcaccaca 
attctgtacttgagtatcacagtcattgtttttgagacaaacatttttataattctaatt 
tgggttaataaagattttaaatatttcttggtttacttttgtaattatatacacaacaaa 
tgtattaataactaccttgttaaacaccttttaataagcatgcacttccatgcatacttg 
cttttggggagagtggggagaagacattctaataatcagtttgtgaaatagcttctgttg 
gaaaccttttgaggggaataaggaatggtcatctaaaatgagagattctggattttaatg 
cagttcaaagttgagctgtatttttgttgttgatttatctggattttttttaaagccttc 
taaaacccagtgaattcaataccttaattagtacatactatcttatgtaatgcataaagc 
aatgccagtcactgagaacatttaaatatatttatattcctggagatacacattctcatt 
tttgttggtttattataaattattcttctagatgcatcttttataactaggatttcattt 
tgtgtgtatagcttatgtaataaattttaaaggtgaaaactctcttaaaaaaa 

SEQ ID NO: 475. >Z44462_T4 # TY Transcript # LN 6084 # Source Gene: Z44462 
# Encoded protein: 24 4 462_P1 

gtcaaaacctgtcaaaggccaagtgctttatttccctccagtcaaggactgggcattttg 
catttgaatcctggatggcgggaagcagcaagcaggtgtcaacaggggtggggaggagac 
gccggaccccaccagccaatcagagcgcgcagaggcctcccttcccccgccgccgtcccg 
cgccaaaattccaaacgggctacgcaccgctcccaccgtctccttccgccgagctcagcg 
agccggaactatgtgaagaaaaagaagtcagcgttttccccaggactctccgctcaggtc 
ctctatactgtcggcctgtgttcctctccacgttggcccggcctccttcgctctacaaat 
gggaggagagtttcgagcctgctgttctatggtggtggcgtgggcgggacagacggcgga 
gggattctgggcgcgcgcgctgcggccgctggcgcggagcttgtggcgccagaattcgga 
gcgcggaagagccagagctgcgagcgcctggagctggatctctctccggtcgcgcacgcc 
gaggccagtagggagagaagatggtggttctccgcagcagcttggagctgcacaaccact 
ccgcggcctcggccacgggctccttggacctgtccagtgacttcctcagtctggagcaca 
tcggccggaggcggctccgctcggccggcgcggcgcagaagaaacccgcggcgaccacag 
ccaaagcgggcgatgggtcatcagttaaggaagttgaaacctaccaccggacacgtgctt 
taagatctttgagaaaagatgcacagaattcttcagattctagttttgagaagaatgtgg 
aaataacggagcaacttgctaatggcaggcattttacaaggcagttggccagacagcagg 
ctgataaaaaaaaagaagagcacagagaagacaaagtgattccagttactcggtcattga 
gggctagaaacatcgttcaaagtacagaacacttacatgaagataatggtgatgttgaag 
tgcgtcgaagttgtaggattagaagtcgttatagtggtgtaaaccagtccatgctgtttg 
acaaacttataactaacactgctgaagctgtacttcaaaaaatggatgacatgaagaaga 
tgcgtagacagcgaatgagagaacttgaagacttgggagtgtttaatgaaacagaagaaa 
gcaatcttaatatgtacacaagaggaaaacagaaagatattcaaagaactgatgaagaaa 
caactgataatcaagaaggcagtgtggagtcatctgaagagggtgaagaccaagaacatg 
aagatgatggtgaagatgaagatgatgaagatgatgatgatgatgacgatgatgatgatg 
atgatgatgatgaagatgatgaagatgaagaagatggagaagaagagaatcagaagcgat 
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attatcttagacagagaaaagctactgtttactatcaggctccattggaaaaacctcgtc 
accagagaaagcccaacatattttatagtggcccagcttctcctgcaagaccaagatacc 
gattatcttccgcaggaccaagaagtccttactgtaaacgaatgaacaggcgaaggcatg 
caatccacagtagtgactcgacttcatcttcctcctctgaagatgaacagcactttgaga 
ggcggaggaaaaggagtcgtaatagggctatcaataggtgcctcccactaaattttcgga 
aagatgaattaaaaggcatttataaagatcgaatgaaaattggagcaagccttgccgatg 
ttgatccaatgcaactagattcttcagtacgatttgatagtgttggtggcctgtctaatc 
atatagcagctctaaaagagatggtggtgtttccattactttatccagaagtctttgaaa 
aatttaaaattcaacccccaagaggttgtttgttttatgggccacctggaactggaaaga 
ctctggttgccagagcacttgccaatgagtgcagtcaaggggataaaagagtagcatttt 
tcatgaggaaaggtgctgattgtctaagtaaatgggtaggagaatctgaaagacagctac 
gattgctgtttgatcaggcctatcagatgcgcccatcaattattttttttgacgaaattg 
atggtctggctccagtacggtcaagcaggcaagatcagattcacagttctattgtttcca 
ccctgctagctcttatggatggattggacagcagaggggaaattgtggtcattggtgcta 
cgaacaggctagattctatagatcctgctttacgaaggcctggtcgctttgatagagaat 
tcctctttagcctgcctgataaagaggctcgaaaagagattctaaagattcacaccaggg 
attggaatcccaaaccactggacacatttttagaagagctagcagaaaactgtgttggat 
actgtggagcagatattaaatcaatatgtgctgaagctgctttatgtgctttacgacgac 
gctacccacagatctataccactagtgagaaactgcagttggatctctcttcaattaata 
tctcagctaaggatttcgaggtagctatgcaaaagatgataccagcctcccaaagagctg 
tgacatcacctgggcaggcactgtccaccgttgtgaaaccactcctgcaaaacactgttg 
acaagattttagaagccctgcagagagtatttccacatgcagaattcagaacaaataaaa 
cattagactcagatatttcttgtcctctgctagaaagtgacttggcttacagtgatgatg 
atgttccatcagtttatgaaaatggactttctcagaaatcttctcataaggcaaaagaca 
attttaattttcttcatttgaatagaaatgcttgttaccaacctatgtcttttcgaccaa 
gaatattgatagtaggagaaccaggatttgggcaaggttctcacttggcaccagctgtca 
ttcatgctttggaaaagtttactgtatatacattagacattcctgttctttttggagtta 
gtactacatcccctgaagaaacatgtgcccaggtgattcgtgaagctaagagaacagcac 
caagtatagtgtatgttcctcatatccacgtgtggtgggaaatagttggaccgacactta 
aagccacatttaccacattattacagaatattccttcatttgctccagttttactacttg 
caacttctgacaaaccccattccgctttgccagaagaggtgcaagaattgtttatccgtg 
attatggagagatttttaatgtccagttaccggataaagaagaacggacaaaattttttg 
aagatttaattctaaaacaagctgctaagcctcctatatcaaaaaagaaagcagttttgc 
aggctttggaggtactcccagtagcaccaccacctgagccaagatcactgacagcagaag 
aagtgaaacgactagaagaacaagaagaagatacatttagagaactgaggattttcttaa 
gaaatgttacacataggcttgctattgacaagcgattccgagtgtttactaagcctgttg 
accctgatgaggttcctgattatgtcactgtaataaagcaaccaatggacctttcatctg 
taatcagtaaaattgatctacacaagtatctgactgtgaaagactatttgagagatattg 
atctaatctgtagtaatgccttagaatacaatccagatagagatcctggagatcgtctta 
ttaggcatagagcctgtgctttaagagatactgcctatgccataattaaagaagaacttg 
atgaagactttgagcagctctgtgaagaaattcaggaatctagaaagaaaagaggttgta 
gctcctccaaatatgccccgtcttactaccatgtgatgccaaagcaaaattccactcttg 
ttggtgataaaagatcagacccagagcagaatgaaaagctaaagacaccgagtactcctg 
tggcttgcagcactcctgctcagttgaagaggaaaattcgcaaaaagtcaaactggtact 
taggcaccataaaaaagcgaaggaagatttcacaggcaaaggatgatagccagaatgcca 
tagatcacaaaattgagagtgatacagaggaaactcaagacacaagtgtagatcataatg 
agaccggaaacacaggagagtcttcggtggaagaaaatgaaaaacagcaaaatgcctctg 
aaagcaaactggaattgagaaataattcaaatacttgtaatatagagaatgagcttgaag 
actctaggaagactacagcatgtacagaattgagagacaagattgcttgtaatggagatg 
cttctagctctcagataatacatatttctgatgaaaatgaaggaaaagaaatgtgtgttc 
tgcgaatgactcgagctagacgttcccaggtagaacagcagcagctcatcactgttgaaa 
aggctttggcaattctttctcagcctacaccctcacttgttgtggatcatgagcgattaa 
aaaatcttttgaagactgttgttaaaaaaagtcaaaactacaacatatttcagttggaaa 
atttgtatgcagtaatcagccaatgtatttatcggcatcgcaaggaccatgataaaacat 
cacttattcagaaaatggagcaagaggtagaaaacttcagttgttccagatgatgatgtc 
atggtatcgagtattctttatattcagttcctatttaagtcatttttgtcatgtccgcct 
aattgatgtagtatgaaaccctgcatctttaaggaaaagattaaaatagtaaaataaaag 
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tatttaaactttcctgatatttatgtacatattaagataaatgtcatgtgtaagataact 
gataaatattggaactttgctagaacaagaccctgtagtaatagtaataatagttgaagt 
ttggccaactcttaataaagttattttggtaactaatgttttatggcacttaagaataat 
tagcagcgttaaattttgtttgtattaagcacttttaattttatccttcctaaaaatagt 
ttattgtatctgacaagaaacttacttaaccattgtgtccttcccatcttttttgtcatc 
tttgttttcttcaaatgccctcctcccatctgccttgagattcccttgtcttcacttaaa 
agccagagtgcaagtcatgatttgcgggagggctcttgaaccacttctggctgcaccaca 
attctgtacttgagtatcacagtcattgtttttgagacaaacatttttataattctaatt 
tgggttaataaagattttaaatatttcttggtttacttttgtaattatatacacaacaaa 
tgtattaataactaccttgttaaacaccttttaatagcacaaggtttttatatttgcaag 
ctgttgatatctttctaaaactgtttaggttatagtctattgatactttttatatacaat 
tttataaatataaatattataattttatattaatggtaccaaaaatacatttcttaaggt 
taaaagcatgcacttccatgcatacttgcttttggggagagtggggagaagacattctaa 
taatcagtttgtgaaatagcttctgttggaaaccttttgaggggaataaggaatggtcat 
ctaaaatgagagattctggattttaatgcagttcaaagttgagctgtatttttgttgttg 
atttatctggattttttttaaagccttctaaaacccagtgaattcaataccttaattagt 
acatactatcttatgtaatgcataaagcaatgcccaggtyccactgsagaaaccatttta 
aaatataatttaattatttccctgggargaatacccratttccctcatttttttkgkytg 
gggytttayttattaamaattaatttcyttyctaaaratggcaattcttttttataaacc 
ttagggattttcccttttttggggtggtattagcctttatgtaaattaaattttttaaag 
ggggaaaaaacttctcttttaaaa 

SEQ ID NO: 476. >Z44462_T5 # TY Transcript # LN 5014 « Source Gene: Z44462 
# Encoded protein: Z444 62_P9 

gtcaaaacctgtcaaaggccaagtgctttatttccctccagtcaaggactgggcattttg 
catttgaatcctggatggcgggaagcagcaagcaggtgtcaacaggggtggggaggagac 
gccggaccccaccagccaatcagagcgcgcagaggcctcccttcccccgccgccgtcccg 
cgccaaaattccaaacgggctacgcaccgctcccaccgtctccttccgccgagctcagcg 
agccggaactatgtgaagaaaaagaagtcagcgttttccccaggactctccgctcaggtc 
ctctatactgtcggcctgtgttcctctccacgttggcccggcctccttcgctctacaaat 
gggaggagagtttcgagcctgctgttctatggtggtggcgtgggcgggacagacggcgga 
gggattctgggcgcgcgcgctgcggccgctggcgcggagcttgtggcgccagaattcgga 
gcgcggaagagccagagctgcgagcgcctggagctggatctctctccggtcgcgcacgcc 
gaggccagtagggagagaagatggtggttctccgcagcagcttggagctgcacaaccact 
ccgcggcctcggccacgggctccttggacctgtccagtgacttcctcagtctggagcaca 
tcggccggaggcggctccgctcggccggcgcggcgcagaagaaacccgcggcgaccacag 
ccaaagcgggcgatgggtcatcagttaaggaagttgaaacctaccaccggacacgtgctt 
taagatctttgagaaaagatgcacagaattcttcagattctagttttgagaagaatgtgg 
aaataacggagcaacttgctaatggcaggcattttacaaggcagttggccagacagcagg 
ctgataaaaaaaaagaagagcacagagaagacaaagtgattccagttactcggtcattga 
gggctagaaacatcgttcaaagtacagaacacttacatgaagataatggtgatgttgaag 
tgcgtcgaagttgtaggattagaagtcgttatagtggtgtaaaccagtccatgctgtttg 
acaaacttataactaacactgctgaagctgtacttcaaaaaatggatgacatgaagaaga 
tgcgtagacagcgaatgagagaacttgaagacttgggagtgtttaatgaaacagaagaaa 
gcaatcttaatatgtacacaagaggaaaacagaaagatattcaaagaactgatgaagaaa 
caactgataatcaagaaggcagtgtggagtcatctgaagagggtgaagaccaagaacatg 
aagatgatggtgaagatgaagatgatgaagatgatgatgatgatgacgatgatgatgatg 
atgatgatgatgaagatgatgaagatgaagaagatggagaagaagagaatcagaagcgat 
attatcttagacagagaaaagctactgtttactatcaggctccattggaaaaacctcgtc 
accagagaaagcccaacatattttatagtggcccagcttctcctgcaagaccaagatacc 
gattatcttccgcaggaccaagaagtccttactgtaaacgaatgaacaggcgaaggcatg 
caatccacagtagtgactcgacttcatcttcctcctctgaagatgaacagcactttgaga 
ggcggaggaaaaggagtcgtaatagggctatcaataggtgcctcccactaaattttcgga 
aagatgaattaaaaggcatttataaagatcgaatgaaaattggagcaagccttgccgatg 
ttgatccaatgcaactagattcttcagtacgatttgatagtgttggtggcctgtctaatc 
atatagcagctctaaaagagatggtggtgtttccattactttatccagaagtctttgaaa 
aatttaaaattcaacccccaagaggttgtttgttttatgggccacctggaactggaaaga 
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ctctggttgccagagcacttgccaatgagtgcagtcaaggggataaaagagtagcatttt 
tcatgaggaaaggtgctgattgtctaagtaaatgggtaggagaatctgaaagacagctac 
gattgctgtttgatcaggcctatcagatgcgcccatcaattattttttttgacgaaattg 
atggtctggctccagtacggtcaagcaggcaagatcagattcacagttctattgtttcca 
ccctgctagctcttatggatggattggacagcagaggggaaattgtggtcattggtgcta 
cgaacaggctagattctatagatcctgctttacgaaggcctggtcgctttgatagagaat 
tcctctttagcctgcctgataaagaggctcgaaaagagattctaaagattcacaccaggg 
attggaatcccaaaccactggacacatttttagaagagctagcagaaaactgtgttggat 
actgtggagcagatattaaatcaatatgtgctgaagctgctttatgtgctttacgacgac 
gctacccacagatctataccactagtgagaaactgcagttggatctctcttcaattaata 
tctcagctaaggatttcgaggtagctatgcaaaagatgataccagcctcccaaagagctg 
tgacatcacctgggcaggcactgtccaccgttgtgaaaccactcctgcaaaacactgttg 
acaagattttagaagccctgcagagagtatttccacatgcagaattcagaacaaataaaa 
cattagactcagatatttcttgtcctctgctagaaagtgacttggcttacagtgatgatg 
atgttccatcagtttatgaaaatggactttctcagaaatcttctcataaggcaaaagaca 
attttaattttcttcatttgaatagaaatgcttgttaccaacctatgtcttttcgaccaa 
gaatattgatagtaggagaaccaggatttgggcaaggttctcacttggcaccagctgtca 
ttcatgctttggaaaagtttactgtatatacattagacattcctgttctttttggagtta 
gtactacatcccctgaagaaacatgtgcccaggtgattcgtgaagctaagagaacagcac 
caagtatagtgtatgttcctcatatccacgtgtggtgggaaatagttggaccgacactta 
aagccacatttaccacattattacagaatattccttcatttgctccagttttactacttg 
caacttctgacaaaccccattccgctttgccagaagaggtgcaagaattgtttatccgtg 
attatggagagatttttaatgtccagttaccggataaagaagaacggacaaaattttttg 
aagatttaattctaaaacaagctgctaagcctcctatatcaaaaaagaaagcagttttgc 
aggctttggaggtactcccagtagcaccaccacctgagccaagatcactgacagcagaag 
aagtgaaacgactagaagaacaagaagaagatacatttagagaactgaggattttcttaa 
gaaatgttacacataggcttgctattgacaagcgattccgagtgtttactaagcctgttg 
accctgatgaggttcctgattatgtcactgtaataaagcaaccaatggacctttcatctg 
taatcagtaaaattgatctacacaagtatctgactgtgaaagactatttgagagatattg 
atctaatctgtagtaatgccttagaatacaatccagatagagatcctggagatcgtctta 
ttaggcatagagcctgtgctttaagagatactgcctatgccataattaaagaagaacttg 
atgaagactttgagcagctctgtgaagaaattcaggaatctagaaagaaaagaggttgta 
gctcctccaaatatgccccgtcttactaccatgtgatgccaaagcaaaattccactcttg 
ttggtgataaaagatcagacccagagcagaatgaaaagctaaagacaccgagtactcctg 
tggcttgcagcactcctgctcagttgaagaggaaaattcgcaaaaagtcaaactggtact 
taggcaccataaaaaagcgaaggaagatttcacaggcaaaggatgatagccagaatgcca 
tagatcacaaaattgagagtgatacagaggaaactcaagacacaagtgtagatcataatg 
agaccggaaacacaggagagtcttcggtggaagaaaatgaaaaacagcaaaatgcctctg 
aaagcaaactggaattgagaaataattcaaatacttgtaatatagagaatgagcttgaag 
actctaggaagactacagcatgtacagaattgagagacaagattgcttgtaatggagatg 
cttctagctctcagataatacatatttctgatgaaaatgaaggaaaagaaatgtgtgttc 
tgcgaatgactcgagctagacgttcccaggtagaacagcagcagctcatcactgttgaaa 
aggctttggcaattctttctcagcctacaccctcacttgttgtggatcatgagcgattaa 
aaaatcttttgaagactgttgttaaaaaaagtcaaaactacaacatatttcagttggaaa 
atttgtatgcagtaatcagccaatgtatttatcggcatcgcaaggaccatgataaaacat 
cacttattcagaaaatggagcaagaggtagaaaacttcagttgttccagatgatgatgtc 
atggtatcgagtattctttatattcagttcctatttaagtcatttttgtcatgtccgcct 
aattgatgtagtatgaaaccctgcatctttaaggaaaagattaaaatagtaaaataaaag 
tatttaaactttcctgatatttatgtacatattaagataaatgtcatgtgtaagataact 
gataaatattggaactttgctagaacaagaccctgtagtaatagtaataatagttgaagt 
ttggccaactcttaataaagttattttggtaact 

SEQ ID NO: 477. >Z44462_T6 # TY Transcript # LN 5967 # Source Gene: Z44462 
# Encoded protein: Z4 44 62_P2 

gtcaaaacctgtcaaaggccaagtgctttatttccctccagtcaaggactgggcattttg 
catttgaatcctggatggcgggaagcagcaagcaggtgtcaacaggggtggggaggagac 
gccggaccccaccagccaatcagagcgcgcagaggcctcccttcccccgccgccgtcccg 
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cgccaaaattccaaacgggctacgcaccgctcccaccgtctccttccgccgagctcagcg 
agccggaactatgtgaagaaaaagaagtcagcgttttccccaggactctccgctcaggtc 
ctctatactgtcggcctgtgttcctctccacgttggcccggcctccttcgctctacaaat 
gggaggagagtttcgagcctgctgttctatggtggtggcgtgggcgggacagacggcgga 
gggattctgggcgcgcgcgctgcggccgctggcgcggagcttgtggcgccagaattcgga 
gcgcggaagagccagagctgcgagcgcctggagctggatctctctccggtcgcgcacgcc 
gaggccagtagggagagaagatggtggttctccgcagcagcttggagctgcacaaccact 
ccgcggcctcggccacgggctccttggacctgtccagtgacttcctcagtctggagcaca 
tcggccggaggcggctccgctcggccggcgcggcgcagaagaaacccgcggcgaccacag 
ccaaagcgggcgatgggtcatcagttaaggaagttgaaacctaccaccggacacgtgctt 
taagatctttgagaaaagatgcacagaattcttcagattctagttttgagaagaatgtgg 
aaataacggagcaacttgctaatggcaggcattttacaaggcagttggccagacagcagg 
ctgataaaaaaaaagaagagcacagagaagacaaagtgattccagttactcggtcattga 
gggctagaaacatcgttcaaagtacagaacacttacatgaagataatggtgatgttgaag 
tgcgtcgaagttgtaggattagaagtcgttatagtggtgtaaaccagtccatgctgtttg 
acaaacttataactaacactgctgaagctgtacttcaaaaaatggatgacatgaagaaga 
tgcgtagacagcgaatgagagaacttgaagacttgggagtgtttaatgaaacagaagaaa 
gcaatcttaatatgtacacaagaggaaaacagaaagatattcaaagaactgatgaagaaa 
caactgataatcaagaaggcagtgtggagtcatctgaagagggtgaagaccaagaacatg 
aagatgatggtgaagatgaagatgatgaagatgatgatgatgatgacgatgatgatgatg 
atgatgatgatgaagatgatgaagatgaagaagatggagaagaagagaatcagaagcgat 
attatcttagacagagaaaagctactgtttactatcaggctccattggaaaaacctcgtc 
accagagaaagcccaacatattttatagtggcccagcttctcctgcaagaccaagatacc 
gattatcttccgcaggaccaagaagtccttactgtaaacgaatgaacaggcgaaggcatg 
caatccacagtagtgactcgacttcatcttcctcctctgaagatgaacagcactttgaga 
ggcggaggaaaaggagtcgtaatagggctatcaataggtgcctcccactaaattttcgga 
aagatgaattaaaaggcatttataaagatcgaatgaaaattggagcaagccttgccgatg 
ttgatccaatgcaactagattcttcagtacgatttgatagtgttggtggcctgtctaatc 
atatagcagctctaaaagagatggtggtgtttccattactttatccagaagtctttgaaa 
aatttaaaattcaacccccaagaggttgtttgttttatgggccacctggaactggaaaga 
ctctggttgccagagcacttgccaatgagtgcagtcaaggggataaaagagtagcatttt 
tcatgaggaaaggtgctgattgtctaagtaaatgggtaggagaatctgaaagacagctac 
gattgctgtttgatcaggcctatcagatgcgcccatcaattattttttttgacgaaattg 
atggtctggctccagtacggtcaagcaggcaagatcagattcacagttctattgtttcca 
ccctgctagctcttatggatggattggacagcagaggggaaattgtggtcattggtgcta 
cgaacaggctagattctatagatcctgctttacgaaggcctggtcgctttgatagagaat 
tcctctttagcctgcctgataaagaggctcgaaaagagattctaaagattcacaccaggg 
attggaatcccaaaccactggacacatttttagaagagctagcagaaaactgtgttggat 
actgtggagcagatattaaatcaatatgtgctgaagctgctttatgtgctttacgacgac 
gctacccacagatctataccactagtgagaaactgcagttggatctctcttcaattaata 
tctcagctaaggatttcgaggtagctatgcaaaagatgataccagcctcccaaagagctg 
tgacatcacctgggcaggcactgtccaccgttgtgaaaccactcctgcaaaacactgttg 
acaagattttagaagccctgcagagagtatttccacatgcagaattcagaacaaataaaa 
cattagactcagatatttcttgtcctctgctagaaagtgacttggcttacagtgatgatg 
atgttccatcagtttatgaaaatggactttctcagaaatcttctcataaggcaaaagaca 
attttaattttcttcatttgaatagaaatgcttgttaccaacctatgtcttttcgaccaa 
gaatattgatagtaggagaaccaggatttgggcaaggttctcacttggcaccagctgtca 
ttcatgctttggaaaagtttactgtatatacattagacattcctgttctttttggagtta 
gtactacatcccctgaagaaacatgtgcccaggtgattcgtgaagctaagagaacagcac 
caagtatagtgtatgttcctcatatccacgtgtggtgggaaatagttggaccgacactta 
aagccacatttaccacattattacagaatattccttcatttgctccagttttactacttg 
caacttctgacaaaccccattccgctttgccagaagaggtgcaagaattgtttatccgtg 
attatggagagatttttaatgtccagttaccggataaagaagaacggacaaaattttttg 
aagatttaattctaaaacaagctgctaagcctcctatatcaaaaaagaaagcagttttgc 
aggctttggaggtactcccagtagcaccaccacctgagccaagatcactgacagcagaag 
aagtgaaacgactagaagaacaagaagaagatacatttagagaactgaggattttcttaa 
gaaatgttacacataggcttgctattgacaagcgattccgagtgtttactaagcctgttg 
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accctgatgaggttcctgattatgtcactgtaataaagcaaccaatggacctttcatctg 

taatcagtaaaattgatctacacaagtatctgactgtgaaagactatttgagagatattg 

atctaatctgtagtaatgccttagaatacaatccagatagagatcctggagatcgtctta 

ttaggcatagagcctgtgctttaagagatactgcctatgccataattaaagaagaacttg 

atgaagactttgagcagctctgtgaagaaattcaggaatctagaaagaaaagaggttgta 

gctcctccaaatatgccccgtcttactaccatgtgatgccaaagcaaaattccactcttg 

ttggtgataaaagatcagacccagagcagaatgaaaagctaaagacaccgagtactcctg 

tggcttgcagcactcctgctcagttgaagaggaaaattcgcaaaaagtcaaactggtact 

taggcaccataaaaaagcgaaggaagatttcacaggcaaaggatgatagccagaatgcca 

tagatcacaaaattgagagtgatacagaggaaactcaagacacaagtgtagatcataatg 

agaccggaaacacaggagagtcttcggtggaagaaaatgaaaaacagcaaaatgcctctg 

aaagcaaactggaattgagaaataattcaaatacttgtaatatagagaatgagcttgaag 

actctaggaagactacagcatgtacagaattgagagacaagattgcttgtaatggagatg 

cttctagctctcagataatacatatttctgatgaaaatgaaggaaaagaaatgtgtgttc i 

tgcgaatgactcgagctagacgttcccaggtagaacagcagcagctcatcactgttgaaa 

aggctttggcaattctttctcagcctacaccctcacttgttgtggatcatgagcgattaa 

aaaatcttttgaagactgttgttaaaaaaagtcaaaactacaacatatttcagttggaaa 

atttgtatgcagtaatcagccaatgtatttatcggcatcgcaaggaccatgataaaacat 

cacttattcaggtaaactcttgagtggaaacagttcctatttaagtcatttttgtcatgt 

ccgcctaattgatgtagtatgaaaccctgcatctttaaggaaaagattaaaatagtaaaa 

taaaagtatttaaactttcctgatatttatgtacatattaagataaatgtcatgtgtaag 

ataactgataaatattggaactttgctagaacaagaccctgtagtaatagtaataatagt 

tgaagtttggccaactcttaataaagttattttggtaactaatgttttatggcacttaag 

aataattagcagcgttaaattttgtttgtattaagcacttttaattttatccttcctaaa 

aatagtttattgtatctgacaagaaacttacttaaccattgtgtccttcccatctttttt 

gtcatctttgttttcttcaaatgccctcctcccatctgccttgagattcccttgtcttca 

cttaaaagccagagtgcaagtcatgatttgcgggagggctcttgaaccacttctggctgc 

accacaattctgtacttgagtatcacagtcattgtttttgagacaaacatttttataatt 

ctaatttgggttaataaagattttaaatatttcttggtttacttttgtaattatatacac 

aacaaatgtattaataactaccttgttaaacaccttttaatagcacaaggtttttatatt 

tgcaagctgttgatatctttctaaaactgtttaggttatagtctattgatactttttata 

tacaattttataaatataaatattataattttatattaatggtaccaaaaatacatttct 

taaggttaaaagcatgcacttccatgcatacttgcttttggggagagtggggagaagaca 

ttctaataatcagtttgtgaaatagcttctgttggaaaccttttgaggggaataaggaat 

ggtcatctaaaatgagagattctggattttaatgcagttcaaagttgagctgtatttttg 

ttgttgatttatctggattttttttaaagccttctaaaacccagtgaattcaatacctta 

attagtacatactatcttatgtaatgcataaagcaatgccagtcactgagaacatttaaa 

tatatttatattcctggagatacacattctcatttttgttggtttattataaattattct 

tctagatgcatcttttataactaggatttcattttgtgtgtatagcttatgtaataaatt 

ttaaaggtgaaaactctcttaaaaaaa 

SEQ ID NO:478. >Z44462_T7 # TY Transcript # LN 5967 # Source Gene: 244462 
# Encoded protein: Z4 44 62_P3 

gtcaaaacctgtcaaaggccaagtgctttatttccctccagtcaaggactgggcattttg 
catttgaatcctggatggcgggaagcagcaagcaggtgtcaacaggggtggggaggagac 
gccggaccccaccagccaatcagagcgcgcagaggcctcccttcccccgccgccgtcccg 
cgccaaaattccaaacgggctacgcaccgctcccaccgtctccttccgccgagctcagcg 
agccggaactatgtgaagaaaaagaagtcagcgttttccccaggactctccgctcaggtc 
ctctatactgtcggcctgtgttcctctccacgttggcccggcctccttcgctctacaaat 
gggaggagagtttcgagcctgctgttctatggtggtggcgtgggcgggacagacggcgga 
gggattctgggcgcgcgcgctgcggccgctggcgcggagcttgtggcgccagaattcgga 
gcgcggaagagccagagctgcgagcgcctggagctggatctctctccggtcgcgcacgcc 
gaggccagtagggagagaagatggtggttctccgcagcagcttggagctgcacaaccact 
ccgcggcctcggccacgggctccttggacctgtccagtgacttcctcagtctggagcaca 
tcggccggaggcggctccgctcggccggcgcggcgcagaagaaacccgcggcgaccacag 
ccaaagcgggcgatgggtcatcagttaaggaagttgaaacctaccaccggacacgtgctt 
taagatctttgagaaaagatgcacagaattcttcagattctagttttgagaagaatgtgg 



347/365 



Figure 1 (Cont'd) 



aaataacggagcaacttgctaatggcaggcattttacaaggcagttggccagacagcagg * 

ctgataaaaaaaaagaagagcacagagaagacaaagtgattccagttactcggtcattga 

gggctagaaacatcgttcaaagtacagaacacttacatgaagataatggtgatgttgaag 

tgcgtcgaagttgtaggattagaagtcgttatagtggtgtaaaccagtccatgctgtttg 

acaaacttataactaacactgctgaagctgtacttcaaaaaatggatgacatgaagaaga 

tgcgtagacagcgaatgagagaacttgaagacttgggagtgtttaatgaaacagaagaaa 

gcaatcttaatatgtacacaagaggaaaacagaaagatattcaaagaactgatgaagaaa 

caactgataatcaagaaggcagtgtggagtcatctgaagagggtgaagaccaagaacatg 

aagatgatggtgaagatgaagatgatgaagatgatgatgatgatgacgatgatgatgatg 

atgatgatgatgaagatgatgaagatgaagaagatggagaagaagagaatcagaagcgat 

attatcttagacagagaaaagctactgtttactatcaggctccattggaaaaacctcgtc 

accagagaaagcccaacatattttatagtggcccagcttctcctgcaagaccaagatacc 

gattatcttccgcaggaccaagaagtccttactgtaaacgaatgaacaggcgaaggcatg 

caatccacagtagtgactcgacttcatcttcctcctctgaagatgaacagcactttgaga 

ggcggaggaaaaggagtcgtaatagggctatcaataggtgcctcccactaaattttcgga 

aagatgaattaaaaggcatttataaagatcgaatgaaaattggagcaagccttgccgatg 

ttgatccaatgcaactagattcttcagtacgatttgatagtgttggtggcctgtctaatc 

atatagcagctctaaaagagatggtggtgtttccattactttatccagaagtctttgaaa 

aatttaaaattcaacccccaagaggttgtttgttttatgggccacctggaactggaaaga 

ctctggttgccagagcacttgccaatgagtgcagtcaaggggataaaagagtagcatttt 

tcatgaggaaaggtgctgattgtctaagtaaatgggtaggagaatctgaaagacagctac 

gattgctgtttgatcaggcctatcagatgcgcccatcaattattttttttgacgaaattg 

atggtctggctccagtacggtcaagcaggcaagatcagattcacagttctattgtttcca 

ccctgctagctcttatggatggattggacagcagaggggaaattgtggtcattggtgcta 

cgaacaggctagattctatagatcctgctttacgaaggcctggtcgctttgatagagaat 

tcctctttagcctgcctgataaagaggctcgaaaagagattctaaagattcacaccaggg 

attggaatcccaaaccactggacacatttttagaagagctagcagaaaactgtgttggat 

actgtggagcagatattaaatcaatatgtgctgaagctgctttatgtgctttacgacgac 

gctacccacagatctataccactagtgagaaactgcagttggatctctcttcaattaata 

tctcagctaaggatttcgaggtagctatgcaaaagatgataccagcctcccaaagagctg 

tgacatcacctgggcaggcactgtccaccgttgtgaaaccactcctgcaaaacactgttg 

acaagattttagaagccctgcagagagtatttccacatgcagaattcagaacaaataaaa 

cattagactcagatatttcttgtcctctgctagaaagtgacttggcttacagtgatgatg 

atgttccatcagtttatgaaaatggactttctcagaaatcttctcataaggcaaaagaca 

attttaattttcttcatttgaatagaaatgcttgttaccaacctatgtcttttcgaccaa 

gaatattgatagtaggagaaccaggatttgggcaaggttctcacttggcaccagctgtca 

ttcatgctttggaaaagtttactgtatatacattagacattcctgttctttttggagtta 

gtactacatcccctgaagaaacatgtgcccaggtgattcgtgaagctaagagaacagcac 

caagtatagtgtatgttcctcatatccacgtgtggtgggaaatagttggaccgacactta 

aagccacatttaccacattattacagaatattccttcatttgctccagttttactacttg 

caacttctgacaaaccccattccgctttgccagaagaggtgcaagaattgtttatccgtg 

attatggagagatttttaatgtccagttaccggataaagaagaacggacaaaattttttg 

aagatttaattctaaaacaagctgctaagcctcctatatcaaaaaagaaagcagttttgc 

aggctttggaggtactcccagtagcaccaccacctgagccaagatcactgacagcagaag 

aagtgaaacgactagaagaacaagaagaagatacatttagagaactgaggattttcttaa 

gaaatgttacacataggcttgctattgacaagcgattccgagtgtttactaagcctgttg 

accctgatgaggttcctgattatgtcactgtaataaagcaaccaatggacctttcatctg 

taatcagtctaatctgtagtaatgccttagaatacaatccagatagagatcctggagatc 

gtcttattaggcatagagcctgtgctttaagagatactgcctatgccataattaaagaag 

aacttgatgaagactttgagcagctctgtgaagaaattcaggaatctagaaagaaaagag 

gttgtagctcctccaaatatgccccgtcttactaccatgtgatgccaaagcaaaattcca 

ctcttgttggtgataaaagatcagacccagagcagaatgaaaagctaaagacaccgagta 

ctcctgtggcttgcagcactcctgctcagttgaagaggaaaattcgcaaaaagtcaaact 

ggtacttaggcaccataaaaaagcgaaggaagatttcacaggcaaaggatgatagccaga 

atgccatagatcacaaaattgagagtgatacagaggaaactcaagacacaagtgtagatc 

ataatgagaccggaaacacaggagagtcttcggtggaagaaaatgaaaaacagcaaaatg 

cctctgaaagcaaactggaattgagaaataattcaaatacttgtaatatagagaatgagc 
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Figure 1 (Cont'd) 



ttgaagactctaggaagactacagcatgtacagaattgagagacaagattgcttgtaatg 
gagatgcttctagctctcagataatacatatttctgatgaaaatgaaggaaaagaaatgt 
gtgttctgcgaatgactcgagctagacgttcccaggtagaacagcagcagctcatcactg 
ttgaaaaggctttggcaattctttctcagcctacaccctcacttgttgtggatcatgagc 
gattaaaaaatcttttgaagactgttgttaaaaaaagtcaaaactacaacatatttcagt 
tggaaaatttgtatgcagtaatcagccaatgtatttatcggcatcgcaaggaccatgata 
aaacatcacttattcagaaaatggagcaagaggtagaaaacttcagttgttccagatgat 
gatgtcatggtatcgagtattctttatattcagttcctatttaagtcatttttgtcatgt 
ccgcctaattgatgtagtatgaaaccctgcatctttaaggaaaagattaaaatagtaaaa 
taaaagtatttaaactttcctgatatttatgtacatattaagataaatgtcatgtgtaag 
ataactgataaatattggaactttgctagaacaagaccctgtagtaatagtaataatagt 
tgaagtttggccaactcttaataaagttattttggtaactaatgttttatggcacttaag 
aataattagcagcgttaaattttgtttgtattaagcacttttaattttatccttcctaaa 
aatagtttattgtatctgacaagaaacttacttaaccattgtgtccttcccatctttttt 
gtcatctttgttttcttcaaatgccctcctcccatctgccttgagattcccttgtcttca 
cttaaaagccagagtgcaagtcatgatttgcgggagggctcttgaaccacttctggctgc 
accacaattctgtacttgagtatcacagtcattgtttttgagacaaacatttttataatt 
ctaatttgggttaataaagattttaaatatttcttggtttacttttgtaattatatacac 
aacaaatgtattaataactaccttgttaaacaccttttaatagcacaaggtttttatatt 
tgcaagctgttgatatctttctaaaactgtttaggttatagtctattgatactttttata 
tacaattttataaatataaatattataattttatattaatggtaccaaaaatacatttct 
taaggttaaaagcatgcacttccatgcatacttgcttttggggagagtggggagaagaca 
ttctaataatcagtttgtgaaatagcttctgttggaaaccttttgaggggaataaggaat 
ggtcatctaaaatgagagattctggattttaatgcagttcaaagttgagctgtatttttg 
ttgttgatttatctggattttttttaaagccttctaaaacccagtgaattcaatacctta 
attagtacatactatcttatgtaatgcataaagcaatgccagtcactgagaacatttaaa 
tatatttatattcctggagatacacattctcatttttgttggtttattataaattattct 
tctagatgcatcttttataactaggatttcattttgtgtgtatagcttatgtaataaatt 
ttaaaggtgaaaactctcttaaaaaaa 

SEQ ID NO:479. >Z44462_T8 # TY Transcript # LN 6086 # Source Gene: Z44462 
# Encoded protein: Z44 4 62_P1 

gtcaaaacctgtcaaaggccaagtgctttatttccctccagtcaaggactgggcattttg 
catttgaatcctggatggcgggaagcagcaagcaggtgtcaacaggggtggggaggagac 
gccggaccccaccagccaatcagagcgcgcagaggcctcccttcccccgccgccgtcccg 
cgccaaaattccaaacgggctacgcaccgctcccaccgtctccttccgccgagctcagcg 
agccggaactatgtgaagaaaaagaagtcagcgttttccccaggactctccgctcaggtc 
ctctatactgtcggcctgtgttcctctccacgttggcccggcctccttcgctctacaaat 
gggaggagagtttcgagcctgctgttctatggtggtggcgtgggcgggacagacggcgga 
gggattctgggcgcgcgcgctgcggccgctggcgcggagcttgtggcgccagaattcgga 
gcgcggaagagccagagctgcgagcgcctggagctggatctctctccggtcgcgcacgcc 
gaggccagtagggagagaagatggtggttctccgcagcagcttggagctgcacaaccact 
ccgcggcctcggccacgggctccttggacctgtccagtgacttcctcagtctggagcaca 
tcggccggaggcggctccgctcggccggcgcggcgcagaagaaacccgcggcgaccacag 
ccaaagcgggcgatgggtcatcagttaaggaagttgaaacctaccaccggacacgtgctt 
taagatctttgagaaaagatgcacagaattcttcagattctagttttgagaagaatgtgg 
aaataacggagcaacttgctaatggcaggcattttacaaggcagttggccagacagcagg 
ctgataaaaaaaaagaagagcacagagaagacaaagtgattccagttactcggtcattga 
gggctagaaacatcgttcaaagtacagaacacttacatgaagataatggtgatgttgaag 
tgcgtcgaagttgtaggattagaagtcgttatagtggtgtaaaccagtccatgctgtttg 
acaaacttataactaacactgctgaagctgtacttcaaaaaatggatgacatgaagaaga 
tgcgtagacagcgaatgagagaacttgaagacttgggagtgtttaatgaaacagaagaaa 
gcaatcttaatatgtacacaagaggaaaacagaaagatattcaaagaactgatgaagaaa 
caactgataatcaagaaggcagtgtggagtcatctgaagagggtgaagaccaagaacatg 
aagatgatggtgaagatgaagatgatgaagatgatgatgatgatgacgatgatgatgatg 
atgatgatgatgaagatgatgaagatgaagaagatggagaagaagagaatcagaagcgat 
attatcttagacagagaaaagctactgtttactatcaggctccattggaaaaacctcgtc 
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Figur 1 (Cont'd) 



accagagaaagcccaacatattttatagtggcccagcttctcctgcaagaccaagatacc 
gattatcttccgcaggaccaagaagtccttactgtaaacgaatgaacaggcgaaggcatg 
caatccacagtagtgactcgacttcatcttcctcctctgaagatgaacagcactttgaga 
ggcggaggaaaaggagtcgtaatagggctatcaataggtgcctcccactaaattttcgga 
aagatgaattaaaaggcatttataaagatcgaatgaaaattggagcaagccttgccgatg 
ttgatccaatgcaactagattcttcagtacgatttgatagtgttggtggcctgtctaatc 
atatagcagctctaaaagagatggtggtgtttccattactttatccagaagtctttgaaa 
aatttaaaattcaacccccaagaggttgtttgttttatgggccacctggaactggaaaga 
ctctggttgccagagcacttgccaatgagtgcagtcaaggggataaaagagtagcatttt 
tcatgaggaaaggtgctgattgtctaagtaaatgggtaggagaatctgaaagacagctac 
gattgctgtttgatcaggcctatcagatgcgcccatcaattattttttttgacgaaattg 
atggtctggctccagtacggtcaagcaggcaagatcagattcacagttctattgtttcca 
ccctgctagctcttatggatggattggacagcagaggggaaattgtggtcattggtgcta 
cgaacaggctagattctatagatcctgctttacgaaggcctggtcgctttgatagagaat 
tcctctttagcctgcctgataaagaggctcgaaaagagattctaaagattcacaccaggg 
attggaatcccaaaccactggacacatttttagaagagctagcagaaaactgtgttggat 
actgtggagcagatattaaatcaatatgtgctgaagctgctttatgtgctttacgacgac 
gctacccacagatctataccactagtgagaaactgcagttggatctctcttcaattaata 
tctcagctaaggatttcgaggtagctatgcaaaagatgataccagcctcccaaagagctg 
tgacatcacctgggcaggcactgtccaccgttgtgaaaccactcctgcaaaacactgttg 
acaagattttagaagccctgcagagagtatttccacatgcagaattcagaacaaataaaa 
cattagactcagatatttcttgtcctctgctagaaagtgacttggcttacagtgatgatg 
atgttccatcagtttatgaaaatggactttctcagaaatcttctcataaggcaaaagaca 
attttaattttcttcatttgaatagaaatgcttgttaccaacctatgtcttttcgaccaa 
gaatattgatagtaggagaaccaggatttgggcaaggttctcacttggcaccagctgtca 
ttcatgctttggaaaagtttactgtatatacattagacattcctgttctttttggagtta 
gtactacatcccctgaagaaacatgtgcccaggtgattcgtgaagctaagagaacagcac 
caagtatagtgtatgttcctcatatccacgtgtggtgggaaatagttggaccgacactta 
aagccacatttaccacattattacagaatattccttcatttgctccagttttactacttg 
caacttctgacaaaccccattccgctttgccagaagaggtgcaagaattgtttatccgtg 
attatggagagatttttaatgtccagttaccggataaagaagaacggacaaaattttttg 
aagatttaattctaaaacaagctgctaagcctcctatatcaaaaaagaaagcagttttgc 
aggctttggaggtactcccagtagcaccaccacctgagccaagatcactgacagcagaag 
aagtgaaacgactagaagaacaagaagaagatacatttagagaactgaggattttcttaa 
gaaatgttacacataggcttgctattgacaagcgattccgagtgtttactaagcctgttg 
accctgatgaggttcctgattatgtcactgtaataaagcaaccaatggacctttcatctg 
taatcagtaaaattgatctacacaagtatctgactgtgaaagactatttgagagatattg 
atctaatctgtagtaatgccttagaatacaatccagatagagatcctggagatcgtctta 
ttaggcatagagcctgtgctttaagagatactgcctatgccataattaaagaagaacttg 
atgaagactttgagcagctctgtgaagaaattcaggaatctagaaagaaaagaggttgta 
gctcctccaaatatgccccgtcttactaccatgtgatgccaaagcaaaattccactcttg 
ttggtgataaaagatcagacccagagcagaatgaaaagctaaagacaccgagtactcctg 
tggcttgcagcactcctgctcagttgaagaggaaaattcgcaaaaagtcaaactggtact 
taggcaccataaaaaagcgaaggaagatttcacaggcaaaggatgatagccagaatgcca 
tagatcacaaaattgagagtgatacagaggaaactcaagacacaagtgtagatcataatg 
agaccggaaacacaggagagtcttcggtggaagaaaatgaaaaacagcaaaatgcctctg 
aaagcaaactggaattgagaaataattcaaatacttgtaatatagagaatgagcttgaag 
actctaggaagactacagcatgtacagaattgagagacaagattgcttgtaatggagatg 
cttctagctctcagataatacatatttctgatgaaaatgaaggaaaagaaatgtgtgttc 
tgcgaatgactcgagctagacgttcccaggtagaacagcagcagctcatcactgttgaaa 
aggctttggcaattctttctcagcctacaccctcacttgttgtggatcatgagcgattaa 
aaaatcttttgaagactgttgttaaaaaaagtcaaaactacaacatatttcagttggaaa 
atttgtatgcagtaatcagccaatgtatttatcggcatcgcaaggaccatgataaaacat 
cacttattcagaaaatggagcaagaggtagaaaacttcagttgttccagatgatgatgtc 
atggtatcgagtattctttatattcagttcctatttaagtcatttttgtcatgtccgcct 
aattgatgtagtatgaaaccctgcatctttaaggaaaagattaaaatagtaaaataaaag 
tatttaaactttccttgantaatttaatggtagcattatttaaggattaaaattgtccat 
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Figur 1 (Cont'd) 



tggtgcttaaagaataaagcttgacttaaaagtaagtttggcggaaaacctttttggggc 
cttaaggaaaaaaccaaaaaggaaaacccccttgggtaagtagtaatagtaataatagtt 
gaagtttggccaactcttaataaagttattttggtaactaatgttttatggcacttaaga 
ataattagcagcgttaaattttgtttgtattaagcacttttaattttatccttcctaaaa 
atagtttattgtatctgacaagaaacttacttaaccattgtgtccttcccatcttttttg 
tcatctttgttttcttcaaatgccctcctcccatctgccttgagattcccttgtcttcac 
ttaaaagccagagtgcaagtcatgatttgcgggagggctcttgaaccacttctggctgca 
ccacaattctgtacttgagtatcacagtcattgtttttgagacaaacatttttataattc 
taatttgggttaataaagattttaaatatttcttggtttacttttgtaattatatacaca 
acaaatgtattaataactaccttgttaaacaccttttaatagcacaaggtttttatattt 
gcaagctgttgatatctttctaaaactgtttaggttatagtctattgatactttttatat 
acaattttataaatataaatattataattttatattaatggtaccaaaaatacatttctt 
aaggttaaaagcatgcacttccatgcatacttgcttttggggagagtggggagaagacat 
tctaataatcagtttgtgaaatagcttctgttggaaaccttttgaggggaataaggaatg 
gtcatctaaaatgagagattctggattttaatgcagttcaaagttgagctgtatttttgt 
tgttgatttatctggattttttttaaagccttctaaaacccagtgaattcaataccttaa 
ttagtacatactatcttatgtaatgcataaagcaatgccagtcactgagaacatttaaat 
atatttatattcctggagatacacattctcatttttgttggtttattataaattattctt 
ctagatgcatcttttataactaggatttcattttgtgtgtatagcttatgtaataaattt 
taaaggtgaaaactctcttaaaaaaa 

SEQ ID NO: 480 . >Z44462JT9 # TY Transcript # LN 5529 # Source Gene: Z44462 
# Encoded protein: Z444 62_P9 

cttgttatgaactataaaccagtaaatcttatattagctgcttcatagacattgctgata 
tttgtttatcatgaaatggtttttctagatactaacttgtagtttcttggggagatacaa 
aatgattctgcacaaaaagcagtttttactaacttgtaatcagagaatactctttggagg 
aatttaaattggagttttgtaaagaaaaacttagatggctttcttaatttttcttaaagg 
atgggtcatcagttaaggaagttgaaacctaccaccggacacgtgctttaagatctttga 
gaaaagatgcacagaattcttcagattctagttttgagaagaatgtggaaataacggagc 
aacttgctaatggcaggcattttacaaggcagttggccagacagcaggctgataaaaaaa 
aagaagagcacagagaagacaaagtgattccagttactcggtcattgagggctagaaaca 
tcgttcaaagtacagaacacttacatgaagataatggtgatgttgaagtgcgtcgaagtt 
gtaggattagaagtcgttatagtggtgtaaaccagtccatgctgtttgacaaacttataa 
ctaacactgctgaagctgtacttcaaaaaatggatgacatgaagaagatgcgtagacagc 
gaatgagagaacttgaagacttgggagtgtttaatgaaacagaagaaagcaatcttaata 
tgtacacaagaggaaaacagaaagatattcaaagaactgatgaagaaacaactgataatc 
aagaaggcagtgtggagtcatctgaagagggtgaagaccaagaacatgaagatgatggtg 
aagatgaagatgatgaagatgatgatgatgatgacgatgatgatgatgatgatgatgatg 
aagatgatgaagatgaagaagatggagaagaagagaatcagaagcgatattatcttagac 
agagaaaagctactgtttactatcaggctccattggaaaaacctcgtcaccagagaaagc 
ccaacatattttatagtggcccagcttctcctgcaagaccaagataccgattatcttccg 
caggaccaagaagtccttactgtaaacgaatgaacaggcgaaggcatgcaatccacagta 
gtgactcgacttcatcttcctcctctgaagatgaacagcactttgagaggcggaggaaaa 
ggagtcgtaatagggctatcaataggtgcctcccactaaattttcggaaagatgaattaa 
aaggcatttataaagatcgaatgaaaattggagcaagccttgccgatgttgatccaatgc 
aactagattcttcagtacgatttgatagtgttggtggcctgtctaatcatatagcagctc 
taaaagagatggtggtgtttccattactttatccagaagtctttgaaaaatttaaaattc 
aacccccaagaggttgtttgttttatgggccacctggaactggaaagactctggttgcca 
gagcacttgccaatgagtgcagtcaaggggataaaagagtagcatttttcatgaggaaag 
gtgctgattgtctaagtaaatgggtaggagaatctgaaagacagctacgattgctgtttg 
atcaggcctatcagatgcgcccatcaattattttttttgacgaaattgatggtctggctc 
cagtacggtcaagcaggcaagatcagattcacagttctattgtttccaccctgctagctc 
ttatggatggattggacagcagaggggaaattgtggtcattggtgctacgaacaggctag 
attctatagatcctgctttacgaaggcctggtcgctttgatagagaattcctctttagcc 
tgcctgataaagaggctcgaaaagagattctaaagattcacaccagggattggaatccca 
aaccactggacacatttttagaagagctagcagaaaactgtgttggatactgtggagcag 
atattaaatcaatatgtgctgaagctgctttatgtgctttacgacgacgctacccacaga 
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Figure 1 (Cont'd) 



tctataccactagtgagaaactgcagttggatctctcttcaattaatatctcagctaagg 
atttcgaggtagctatgcaaaagatgataccagcctcccaaagagctgtgacatcacctg 
ggcaggcactgtccaccgttgtgaaaccactcctgcaaaacactgttgacaagattttag 
aagccctgcagagagtatttccacatgcagaattcagaacaaataaaacattagactcag 
atatttcttgtcctctgctagaaagtgacttggcttacagtgatgatgatgttccatcag 
tttatgaaaatggactttctcagaaatcttctcataaggcaaaagacaattttaattttc 
ttcatttgaatagaaatgcttgttaccaacctatgtcttttcgaccaagaatattgatag 
taggagaaccaggatttgggcaaggttctcacttggcaccagctgtcattcatgctttgg 
aaaagtttactgtatatacattagacattcctgttctttttggagttagtactacatccc 
ctgaagaaacatgtgcccaggtgattcgtgaagctaagagaacagcaccaagtatagtgt 
atgttcctcatatccacgtgtggtgggaaatagttggaccgacacttaaagccacattta 
ccacattattacagaatattccttcatttgctccagttttactacttgcaacttctgaca 
aaccccattccgctttgccagaagaggtgcaagaattgtttatccgtgattatggagaga 
tttttaatgtccagttaccggataaagaagaacggacaaaattttttgaagatttaattc 
taaaacaagctgctaagcctcctatatcaaaaaagaaagcagttttgcaggctttggagg 
tactcccagtagcaccaccacctgagccaagatcactgacagcagaagaagtgaaacgac 
tagaagaacaagaagaagatacatttagagaactgaggattttcttaagaaatgttacac 
ataggcttgctattgacaagcgattccgagtgtttactaagcctgttgaccctgatgagg 
ttcctgattatgtcactgtaataaagcaaccaatggacctttcatctgtaatcagtaaaa 
ttgatctacacaagtatctgactgtgaaagactatttgagagatattgatctaatctgta 
gtaatgccttagaatacaatccagatagagatcctggagatcgtcttattaggcatagag 
cctgtgctttaagagatactgcctatgccataattaaagaagaacttgatgaagactttg 
agcagctctgtgaagaaattcaggaatctagaaagaaaagaggttgtagctcctccaaat 
atgccccgtcttactaccatgtgatgccaaagcaaaattccactcttgttggtgataaaa 
gatcagacccagagcagaatgaaaagctaaagacaccgagtactcctgtggcttgcagca 
ctcctgctcagttgaagaggaaaattcgcaaaaagtcaaactggtacttaggcaccataa 
aaaagcgaaggaagatttcacaggcaaaggatgatagccagaatgccatagatcacaaaa 
ttgagagtgatacagaggaaactcaagacacaagtgtagatcataatgagaccggaaaca 
caggagagtcttcggtggaagaaaatgaaaaacagcaaaatgcctctgaaagcaaactgg 
aattgagaaataattcaaatacttgtaatatagagaatgagcttgaagactctaggaaga 
ctacageatgtacagaattgagagacaagattgcttgtaatggagatgcttctagctctc 
agataatacatatttctgatgaaaatgaaggaaaagaaatgtgtgttctgcgaatgactc 
gagctagacgttcccaggtagaacagcagcagctcatcactgttgaaaaggctttggcaa 
ttctttctcagcctacaccctcacttgttgtggatcatgagcgattaaaaaatcttttga 
agactgttgttaaaaaaagtcaaaactacaacatatttcagttggaaaatttgtatgcag 
taatcagccaatgtatttatcggcatcgcaaggaccatgataaaacatcacttattcaga 
aaatggagcaagaggtagaaaacttcagttgttccagatgatgatgtcatggtatcgagt 
attctttatattcagttcctatttaagtcatttttgtcatgtccgcctaattgatgtagt 
atgaaaccctgcatctttaaggaaaagattaaaatagtaaaataaaagtatttaaacttt 
cctgatatttatgtacatattaagataaatgtcatgtgtaagataactgataaatattgg 
aactttgctagaacaagaccctgtagtaatagtaataatagttgaagtttggccaactct 
taataaagttattttggtaactaatgttttatggcacttaagaataattagcagcgttaa 
attttgtttgtattaagcacttttaattttatccttcctaaaaatagtttattgtatctg 
acaagaaacttacttaaccattgtgtccttcccatcttttttgtcatctttgttttcttc 
aaatgccctcctcccatctgccttgagattcccttgtcttcacttaaaagccagagtgca 
agtcatgatttgcgggagggctcttgaaccacttctggctgcaccacaattctgtacttg 
agtatcacagtcattgtttttgagacaaacatttttataattctaatttgggttaataaa 
gattttaaatatttcttggtttacttttgtaattatatacacaacaaatgtattaataac 
taccttgttaaacaccttttaatagcacaaggtttttatatttgcaagctgttgatatct 
ttctaaaactgtttaggttatagtctattgatactttttatatacaattttataaatata 
aatattataattttatattaatggtaccaaaaatacatttcttaaggttaaaagcatgca 
cttccatgcatacttgcttttggggagagtggggagaagacattctaataatcagtttgt 
gaaatagcttctgttggaaaccttttgaggggaataaggaatggtcatctaaaatgagag 
attctggattttaatgcagttcaaagttgagctgtatttttgttgttgatttatctggat 
tttttttaaagccttctaaaacccagtgaattcaataccttaattagtacatactatctt 
atgtaatgcataaagcaatgccagtcactgagaacatttaaatatatttatattcctgga 
gatacacattctcatttttgttggtttattataaattattcttctagatgcatcttttat 
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Figure 1 (Cont'd) 



aactaggatttcattttgtgtgtatagcttatgtaataaattttaaaggtgaaaactctc 
ttaaaaaaa 

SEQ ID NO: 481. >Z44462_T10 # TY Transcript # LN 4673 # Source Gene: Z44462 
# Encoded protein: Z4 4 4 62_P9 

gaggtggcgaaggggctagcgtcaggggctggcggccgtgatgcgctttgggtggaaagc 
cgcgtgaagggagcacggcgcagccagctcctgtaagtgactggaaccaggaacagagga 
cagtgagtcgctctcgtccctccaggtctgctgagaacagacccagtccctgaggaagga 
gaagatgttgctgccacgatcagtgccacagagaccctctcggaagaggagcaggaagag 
ctaagaagagaacttgcaaaggcgaaggcatgcaatccacagtagtgactcgacttcatc 
ttcctcctctgaagatgaacagcactttgagaggcggaggaaaaggagtcgtaatagggc 
tatcaataggtgcctcccactaaattttcggaaagatgaattaaaaggcatttataaaga 
tcgaatgaaaattggagcaagccttgccgatgttgatccaatgcaactagattcttcagt 
acgatttgatagtgttggtggcctgtctaatcatatagcagctctaaaagagatggtggt 
gtttccattactttatccagaagtctttgaaaaatttaaaattcaacccccaagaggttg 
tttgttttatgggccacctggaactggaaagactctggttgccagagcacttgccaatga 
gtgcagtcaaggggataaaagagtagcatttttcatgaggaaaggtgctgattgtctaag 
taaatgggtaggagaatctgaaagacagctacgattgctgtttgatcaggcctatcagat 
gcgcccatcaattattttttttgacgaaattgatggtctggctccagtacggtcaagcag 
gcaagatcagattcacagttctattgtttccaccctgctagctcttatggatggattgga 
cagcagaggggaaattgtggtcattggtgctacgaacaggctagattctatagatcctgc 
tttacgaaggcctggtcgctttgatagagaattcctctttagcctgcctgataaagaggc 
tcgaaaagagattctaaagattcacaccagggattggaatcccaaaccactggacacatt 
tttagaagagctagcagaaaactgtgttggatactgtggagcagatattaaatcaatatg 
tgctgaagctgctttatgtgctttacgacgacgctacccacagatctataccactagtga 
gaaactgcagttggatctctcttcaattaatatctcagctaaggatttcgaggtagctat 
gcaaaagatgataccagcctcccaaagagctgtgacatcacctgggcaggcactgtccac 
cgttgtgaaaccactcctgcaaaacactgttgacaagattttagaagccctgcagagagt 
atttccacatgcagaattcagaacaaataaaacattagactcagatatttcttgtcctct 
gctagaaagtgacttggcttacagtgatgatgatgttccatcagtttatgaaaatggact 
ttctcagaaatcttctcataaggcaaaagacaattttaattttcttcatttgaatagaaa 
tgcttgttaccaacctatgtcttttcgaccaagaatattgatagtaggagaaccaggatt 
tgggcaaggttctcacttggcaccagctgtcattcatgctttggaaaagtttactgtata 
tacattagacattcctgttctttttggagttagtactacatcccctgaagaaacatgtgc 
ccaggtgattcgtgaagctaagagaacagcaccaagtatagtgtatgttcctcatatcca 
cgtgtggtgggaaatagttggaccgacacttaaagccacatttaccacattattacagaa 
tattccttcatttgctccagttttactacttgcaacttctgacaaaccccattccgcttt 
gccagaagaggtgcaagaattgtttatccgtgattatggagagatttttaatgtccagtt 
accggataaagaagaacggacaaaattttttgaagatttaattctaaaacaagctgctaa 
gcctcctatatcaaaaaagaaagcagttttgcaggctttggaggtactcccagtagcacc 
accacctgagccaagatcactgacagcagaagaagtgaaacgactagaagaacaagaaga 
agatacatttagagaactgaggattttcttaagaaatgttacacataggcttgctattga 
caagcgattccgagtgtttactaagcctgttgaccctgatgaggttcctgattatgtcac 
tgtaataaagcaaccaatggacctttcatctgtaatcagtaaaattgatctacacaagta 
tctgactgtgaaagactatttgagagatattgatctaatctgtagtaatgccttagaata 
caatccagatagagatcctggagatcgtcttattaggcatagagcctgtgctttaagaga 
tactgcctatgccataattaaagaagaacttgatgaagactttgagcagctctgtgaaga 
aattcaggaatctagaaagaaaagaggttgtagctcctccaaatatgccccgtcttacta 
ccatgtgatgccaaagcaaaattccactcttgttggtgataaaagatcagacccagagca 
gaatgaaaagctaaagacaccgagtactcctgtggcttgcagcactcctgctcagttgaa 
gaggaaaattcgcaaaaagtcaaactggtacttaggcaccataaaaaagcgaaggaagat 
ttcacaggcaaaggatgatagccagaatgccatagatcacaaaattgagagtgatacaga 
ggaaactcaagacacaagtgtagatcataatgagaccggaaacacaggagagtcttcggt 
ggaagaaaatgaaaaacagcaaaatgcctctgaaagcaaactggaattgagaaataattc 
aaatacttgtaatatagagaatgagcttgaagactctaggaagactacagcatgtacaga 
attgagagacaagattgcttgtaatggagatgcttctagctctcagataatacatatttc 
tgatgaaaatgaaggaaaagaaatgtgtgttctgcgaatgactcgagctagacgttccca 
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Figure 1 (Cont'd) 



ggtagaacagcagcagctcatcactgttgaaaaggctttggcaattctttctcagcctac 
accctcacttgttgtggatcatgagcgattaaaaaatcttttgaagactgttgttaaaaa 
aagtcaaaactacaacatatttcagttggaaaatttgtatgcagtaatcagccaatgtat 
ttatcggcatcgcaaggaccatgataaaacatcacttattcagaaaatggagcaagaggt 
agaaaacttcagttgttccagatgatgatgtcatggtatcgagtattctttatattcagt 
tcctatttaagtcatttttgtcatgtccgcctaattgatgtagtatgaaaccctgcatct 
ttaaggaaaagattaaaatagtaaaataaaagtatttaaactttcctgatatttatgtac 
atattaagataaatgtcatgtgtaagataactgataaatattggaactttgctagaacaa 
gaccctgtagtaatagtaataatagttgaagtttggccaactcttaataaagttattttg 
gtaactaatgttttatggcacttaagaataattagcagcgttaaattttgtttgtattaa 
gcacttttaattttatccttcctaaaaatagtttattgtatctgacaagaaacttactta 
accattgtgtccttcccatcttttttgtcatctttgttttcttcaaatgccctcctccca 
tctgccttgagattcccttgtcttcacttaaaagccagagtgcaagtcatgatttgcggg 
agggctcttgaaccacttctggctgcaccacaattctgtacttgagtatcacagtcattg 
tttttgagacaaacatttttataattctaatttgggttaataaagattttaaatatttct 
tggtttacttttgtaattatatacacaacaaatgtattaataactaccttgttaaacacc 
ttttaatagcacaaggtttttatatttgcaagctgttgatatctttctaaaactgtttag 
gttatagtctattgatactttttatatacaattttataaatataaatattataattttat 
attaatggtaccaaaaatacatttcttaaggttaaaagcatgcacttccatgcatacttg 
cttttggggagagtggggagaagacattctaataatcagtttgtgaaatagcttctgttg 
gaaaccttttgaggggaataaggaatggtcatctaaaatgagagattctggattttaatg 
cagttcaaagttgagctgtatttttgttgttgatttatctggattttttttaaagccttc 
taaaacccagtgaattcaataccttaattagtacatactatcttatgtaatgcataaagc 
aatgccagtcactgagaacatttaaatatatttatattcctggagatacacattctcatt 
tttgttggtttattataaat tat tcttctagatgcatcttttataactaggatt teat tt 
tgtgtgtatagcttatgtaataaattttaaaggtgaaaactctcttaaaaaaa 

SEQ ID NO:482. >Z44462_T11 # TY Transcript # LN 4116 # Source Gene: Z44462 
# Encoded protein: Z44 4 62_P9 

tttgatgttatatattgtttgagaactcttaagttcctttgtgaaattatgctttataaa 
atgtcttctaggattcatttgtggggggtctaacaaccgtctgttttccattgatttcaa 
tttctgtgggccatttgggatgttattttatagatttttttaatccattggaaatagtga 
ggtgccctgagaaaatttattcacatgtaatgcgttaaatcttacaaaagaagaaatatt 
actgaatgatatagactgtgttcatagtattaatctcttctctcttatttttaaaattta 
gttctattgtttccaccctgctagctcttatggatggattggacagcagaggggaaattg 
tggtcattggtgctacgaacaggctagattctatagatcctgctttacgaaggcctggtc 
gctttgatagagaattcctctttagcctgcctgataaagaggctcgaaaagagattctaa 
agattcacaccagggattggaatcccaaaccactggacacatttttagaagagctagcag 
aaaactgtgttggatactgtggagcagatattaaatcaatatgtgctgaagctgctttat 
gtgctttacgacgacgctacccacagatctataccactagtgagaaactgcagttggatc 
tctcttcaattaatatctcagctaaggatttcgaggtagctatgcaaaagatgataccag 
cctcccaaagagctgtgacatcacctgggcaggcactgtccaccgttgtgaaaccactcc 
tgcaaaacactgttgacaagattttagaagccctgcagagagtatttccacatgcagaat 
tcagaacaaataaaacattagactcagatatttcttgtcctctgctagaaagtgacttgg 
cttacagtgatgatgatgttccatcagtttatgaaaatggactttctcagaaatcttctc 
ataaggcaaaagacaattttaattttcttcatttgaatagaaatgcttgttaccaaccta 
tgtcttttcgaccaagaatattgatagtaggagaaccaggatttgggcaaggttctcact 
tggcaccagctgtcattcatgctttggaaaagtttactgtatatacattagacattcctg 
ttctttttggagttagtactacatcccctgaagaaacatgtgcccaggtgattcgtgaag 
ctaagagaacagcaccaagtatagtgtatgttcctcatatccacgtgtggtgggaaatag 
ttggaccgacacttaaagccacatttaccacattattacagaatattccttcatttgctc 
cagttttactacttgcaacttctgacaaaccccattccgctttgccagaagaggtgcaag 
aattgtttatccgtgattatggagagatttttaatgtccagttaccggataaagaagaac 
ggacaaaattttttgaagatttaattctaaaacaagctgctaagcctcctatatcaaaaa 
agaaagcagttttgcaggctttggaggtactcccagtagcaccaccacctgagccaagat 
cactgacagcagaagaagtgaaacgactagaagaacaagaagaagatacatttagagaac 
tgaggattttcttaagaaatgttacacataggcttgctattgacaagcgattccgagtgt 
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Figure 1 (Cont'd) 



ttactaagcctgttgaccctgatgaggttcctgattatgtcactgtaataaagcaaccaa 
tggacctttcatctgtaatcagtaaaattgatctacacaagtatctgactgtgaaagact 
atttgagagatattgatctaatctgtagtaatgccttagaatacaatccagatagagatc 
ctggagatcgtcttattaggcatagagcctgtgctttaagagatactgcctatgccataa 
ttaaagaagaacttgatgaagactttgagcagctctgtgaagaaattcaggaatctagaa 
agaaaagaggttgtagctcctccaaatatgccccgtcttactaccatgtgatgccaaagc 
aaaattccactcttgttggtgataaaagatcagacccagagcagaatgaaaagctaaaga 
caccgagtactcctgtggcttgcagcactcctgctcagttgaagaggaaaattcgcaaaa 
agtcaaactggtacttaggcaccataaaaaagcgaaggaagatttcacaggcaaaggatg 
atagccagaatgccatagatcacaaaattgagagtgatacagaggaaactcaagacacaa 
gtgtagatcataatgagaccggaaacacaggagagtcttcggtggaagaaaatgaaaaac 
agcaaaatgcctctgaaagcaaactggaattgagaaataattcaaatacttgtaatatag 
agaatgagcttgaagactctaggaagactacagcatgtacagaattgagagacaagattg 
cttgtaatggagatgcttctagctctcagataatacatatttctgatgaaaatgaaggaa 
aagaaatgtgtgttctgcgaatgactcgagctagacgttcccaggtagaacagcagcagc 
tcatcactgttgaaaaggctttggcaattctttctcagcctacaccctcacttgttgtgg 
atcatgagcgattaaaaaatcttttgaagactgttgttaaaaaaagtcaaaactacaaca 
tatttcagttggaaaatttgtatgcagtaatcagccaatgtatttatcggcatcgcaagg 
accatgataaaacatcacttattcagaaaatggagcaagaggtagaaaacttcagttgtt 
ccagatgatgatgtcatggtatcgagtattctttatattcagttcctatttaagtcattt 
ttgtcatgtccgcctaattgatgtagtatgaaaccctgcatctttaaggaaaagattaaa 
atagtaaaataaaagtatttaaactttcctgatatttatgtacatattaagataaatgtc 
atgtgtaagataactgataaatattggaactttgctagaacaagaccctgtagtaatagt 
aataatagttgaagtttggccaactcttaataaagttattttggtaactaatgttttatg 
gcacttaagaataattagcagcgttaaattttgtttgtattaagcacttttaattttatc 
cttcctaaaaatagtttattgtatctgacaagaaacttacttaaccattgtgtccttccc 
atcttttttgtcatctttgttttcttcaaatgccctcctcccatctgccttgagattccc 
ttgtcttcacttaaaagccagagtgcaagtcatgatttgcgggagggctcttgaaccact 
tctggctgcaccacaattctgtacttgagtatcacagtcattgtttttgagacaaacatt 
tttataattctaatttgggttaataaagattttaaatatttcttggtttacttttgtaat 
tatatacacaacaaatgtattaataactaccttgttaaacaccttttaatagcacaaggt 
ttttatatttgcaagctgttgatatctttctaaaactgtttaggttatagtctattgata 
ctttttatatacaattttataaatataaatattataattttatattaatggtaccaaaaa 
tacatttcttaaggttaaaagcatgcacttccatgcatacttgcttttggggagagtggg 
gagaagacattctaataatcagtttgtgaaatagcttctgttggaaaccttttgagggga 
ataaggaatggtcatctaaaatgagagattctggattttaatgcagttcaaagttgagct 
gtatttttgttgttgatttatctggattttttttaaagccttctaaaacccagtgaattc 
aataccttaattagtacatactatcttatgtaatgcataaagcaatgccagtcactgaga 
acatttaaatatatttatattcctggagatacacattctcatttttgttggtttattata 
aattattcttctagatgcatcttttataactaggatttcattttgtgtgtatagcttatg 
taataaattttaaaggtgaaaactctcttaaaaaaa 

SEQ ID NO: 483. >Z44462_T12 # TY Transcript # LN 4421 # Source Gene: Z44462 
# Encoded protein: 24 44 62_P7 

gtcaaaacctgtcaaaggccaagtgctttatttccctccagtcaaggactgggcattttg 
catttgaatcctggatggcgggaagcagcaagcaggtgtcaacaggggtggggaggagac 
gccggaccccaccagccaatcagagcgcgcagaggcctcccttcccccgccgccgtcccg 
cgccaaaattccaaacgggctacgcaccgctcccaccgtctccttccgccgagctcagcg 
agccggaactatgtgaagaaaaagaagtcagcgttttccccaggactctccgctcaggtc 
ctctatactgtcggcctgtgttcctctccacgttggcccggcctccttcgctctacaaat 
gggaggagagtttcgagcctgctgttctatggtggtggcgtgggcgggacagacggcgga 
gggattctgggcgcgcgcgctgcggccgctggcgcggagcttgtggcgccagaattcgga 
gcgcggaagagccagagctgcgagcgcctggagctggatctctctccggtcgcgcacgcc 
gaggccagtagggagagaagatggtggttctccgcagcagcttggagctgcacaaccact 
ccgcggcctcggccacgggctccttggacctgtccagtgacttcctcagtctggagcaca 
tcggccggaggcggctccgctcggccggcgcggcgcagaagaaacccgcggcgaccacag 
ccaaagcgggcgatgggtcatcagttaaggaagttgaaacctaccaccggacacgtgctt 
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Figur 1 (Cont'd) 



taagatctttgagaaaagatgcacagaattcttcagattctagttttgagaagaatgtgg 
aaataacggagcaacttgctaatggcaggcattttacaaggcagttggccagacagcagg 
ctgataaaaaaaaagaagagcacagagaagacaaagtgattccagttactcggtcattga 
gggctagaaacatcgttcaaagtacagaacacttacatgaagataatggtgatgttgaag 
tgcgtcgaagttgtaggattagaagtcgttatagtggtgtaaaccagtccatgctgtttg 
acaaacttataactaacactgctgaagctgtacttcaaaaaatggatgacatgaagaaga 
tgcgtagacagcgaatgagagaacttgaagacttgggagtgtttaatgaaacagaagaaa 
gcaatcttaatatgtacacaagaggaaaacagaaagatattcaaagaactgatgaagaaa 
caactgataatcaagaaggcagtgtggagtcatctgaagagggtgaagaccaagaacatg 
aagatgatggtgaagatgaagatgatgaagatgatgatgatgatgacgatgatgatgatg 
atgatgatgatgaagatgatgaagatgaagaagatggagaagaagagaatcagaagcgat 
attatcttagacagagaaaagctactgtttactatcaggctccattggaaaaacctcgtc 
accagagaaagcccaacatattttatagtggcccagcttctcctgcaagaccaagatacc 
gattatcttccgcaggaccaagaagtccttactgtaaacgaatgaacaggcgaaggcatg 
caatccacagtagtgactcgacttcatcttcctcctctgaagatgaacagcactttgaga 
ggcggaggaaaaggagtcgtaatagggctatcaataggtgcctcccactaaattttcgga 
aagatgaattaaaaggcatttataaagatcgaatgaaaattggagcaagccttgccgatg 
ttgatccaatgcaactagattcttcagtacgatttgatagtgttggtggcctgtctaatc 
atatagcagctctaaaagagatggtggtgtttccattactttatccagaagtctttgaaa 
aatttaaaattcaacccccaagaggttgtttgttttatgggccacctggaactggaaaga 
ctctggttgccagagcacttgccaatgagtgcagtcaaggggataaaagagtagcatttt 
tcatgaggaaaggtgctgattgtctaagtaaatgggtaggagaatctgaaagacagctac 
gattgctgtttgatcaggcctatcagatgcgcccatcaattattttttttgacgaaattg 
atggtctggctccagtacggtcaagcaggcaagatcagattcacagttctattgtttcca 
ccctgctagctcttatggatggattggacagcagaggggaaattgtggtcattggtgcta 
cgaacaggctagattctatagatcctgctttacgaaggcctggtcgctttgatagagaat 
tcctctttagcctgcctgataaagaggctcgaaaagagattctaaagattcacaccaggg 
attggaatcccaaaccactggacacatttttagaagagctagcagaaaactgtgttggat 
actgtggagcagatattaaatcaatatgtgctgaagctgctttatgtgctttacgacgac 
gctacccacagatctataccactagtgagaaactgcagttggatctctcttcaattaata 
tctcagctaaggatttcgaggtagctatgcaaaagatgataccagcctcccaaagagctg 
tgacatcacctgggcaggcactgtccaccgttgtgaaaccactcctgcaaaacactgttg 
acaagattttagaagccctgcagagagtatttccacatgcagaattcagaacaaataaaa 
cattagactcagatatttcttgtcctctgctagaaagtgacttggcttacagtgatgatg 
atgttccatcagtttatgaaaatggactttctcagaaatcttctcataaggcaaaagaca 
attttaattttcttcatttgaatagaaatgcttgttaccaacctatgtcttttcgaccaa 
gaatattgatagtaggagaaccaggatttgggcaaggttctcacttggcaccagctgtca 
ttcatgctttggaaaagtttactgtatatacattagacattcctgttctttttggagtta 
gtactacatcccctgaagaaacatgtgcccaggtgattcgtgaagctaagagaacagcac 
caagtatagtgtatgttcctcatatccacgtgtggtgggaaatagttggaccgacactta 
aagccacatttaccacattattacagaatattccttcatttgctccagttttactacttg 
caacttctgacaaaccccattccgctttgccagaagaggtgcaagaattgtttatccgtg 
attatggagagatttttaatgtccagttaccggataaagaagaacggacaaaattttttg 
aagatttaattctaaaacaagctgctaagcctcctatatcaaaaaagaaagcagttttgc 
aggctttggaggtactcccagtagcaccaccacctgagccaagatcactgacagcagaag 
aagtgaaacgactagaagaacaagaagaagatacatttagagaactgaggattttcttaa 
gaaatgttacacataggcttgctattgacaagcgattccgagtgtttactaagcctgttg 
accctgatgaggttcctgattatgtcactgtaataaagcaaccaatggacctttcatctg 
taatcagtaaaattgatctacacaagtatctgactgtgaaagactatttgagagatattg 
atctaatctgtagtaatgccttagaatacaatccagatagagatcctggagatcgtctta 
ttaggcatagagcctgtgctttaagagatactgcctatgccataattaaagaagaacttg 
atgaagactttgagcagctctgtgaagaaattcaggaatctagaaagaaaagaggttgta 
gctcctccaaatatgccccgtcttactaccatgtgatgccaaagcaaaattccactcttg 
ttggtgataaaagatcagacccagagcagaatgaaaagctaaagacaccgagtactcctg 
tggcttgcagcactcctggtaagtactcaagtagttttcatttatgaaattataacttta 
ttcttttctaatgccattaactttctcccttattttcctttatttgtgttaatgcagtga 
attacattgattttgttactattaagacaactttgcattcttgaaataaatccaagttta 
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Figur 1 (Cont'd) 



taatgtaccacttgcaacccagaagtcaccatctaattgtggactgtcttgatttgaaac 
tgatgtctgtaaattctggtctatttcttactcccccttattcttagaaggtaactccac 
gttacctattcaaagcctgtagggttttctaaggcctgccttccttggcagacttctaat 
tccaatttttgttctttaagcccctttgtcagactgcttag 

SEQ ID NO: 484. >Z44462_T13 # TY Transcript # LN 3800 # Source Gene: Z44462 
# Encoded protein: Z44462_P8 

gtcaaaacctgtcaaaggccaagtgctttatttccctccagtcaaggactgggcattttg 
catttgaatcctggatggcgggaagcagcaagcaggtgtcaacaggggtggggaggagac 
gccggaccccaccagccaatcagagcgcgcagaggcctcccttcccccgccgccgtcccg 
cgccaaaattccaaacgggctacgcaccgctcccaccgtctccttccgccgagctcagcg 
agccggaactatgtgaagaaaaagaagtcagcgttttccccaggactctccgctcaggtc 
ctctatactgtcggcctgtgttcctctccacgttggcccggcctccttcgctctacaaat 
gggaggagagtttcgagcctgctgttctatggtggtggcgtgggcgggacagacggcgga 
gggattctgggcgcgcgcgctgcggccgctggcgcggagcttgtggcgccagaattcgga 
gcgcggaagagccagagctgcgagcgcctggagctggatctctctccggtcgcgcacgcc 
gaggccagtagggagagaagatggtggttctccgcagcagcttggagctgcacaaccact 
ccgcggcctcggccacgggctccttggacctgtccagtgacttcctcagtctggagcaca 
tcggccggaggcggctccgctcggccggcgcggcgcagaagaaacccgcggcgaccacag 
ccaaagcgggcgatgggtcatcagttaaggaagttgaaacctaccaccggacacgtgctt 
taagatctttgagaaaagatgcacagaattcttcagattctagttttgagaagaatgtgg 
aaataacggagcaacttgctaatggcaggcattttacaaggcagttggccagacagcagg 
ctgataaaaaaaaagaagagcacagagaagacaaagtgattccagttactcggtcattga 
gggctagaaacatcgttcaaagtacagaacacttacatgaagataatggtgatgttgaag 
tgcgtcgaagttgtaggattagaagtcgttatagtggtgtaaaccagtccatgctgtttg 
acaaacttataactaacactgctgaagctgtacttcaaaaaatggatgacatgaagaaga 
tgcgtagacagcgaatgagagaacttgaagacttgggagtgtttaatgaaacagaagaaa 
gcaatcttaatatgtacacaagaggaaaacagaaagatattcaaagaactgatgaagaaa 
caactgataatcaagaaggcagtgtggagtcatctgaagagggtgaagaccaagaacatg 
aagatgatggtgaagatgaagatgatgaagatgatgatgatgatgacgatgatgatgatg 
atgatgatgatgaagatgatgaagatgaagaagatggagaagaagagaatcagaagcgat 
attatcttagacagagaaaagctactgtttactatcaggctccattggaaaaacctcgtc 
accagagaaagcccaacatattttatagtggcccagcttctcctgcaagaccaagatacc 
gattatcttccgcaggaccaagaagtccttactgtaaacgaatgaacaggcgaaggcatg 
caatccacagtagtgactcgacttcatcttcctcctctgaagatgaacagcactttgaga 
ggcggaggaaaaggagtcgtaatagggctatcaataggtgcctcccactaaattttcgga 
aagatgaattaaaaggcatttataaagatcgaatgaaaattggagcaagccttgccgatg 
ttgatccaatgcaactagattcttcagtacgatttgatagtgttggtggcctgtctaatc 
atatagcagctctaaaagagatggtggtgtttccattactttatccagaagtctttgaaa 
aatttaaaattcaacccccaagaggttgtttgttttatgggccacctggaactggaaaga 
ctctggttgccagagcacttgccaatgagtgcagtcaaggggataaaagagtagcatttt 
tcatgaggaaaggtgctgattgtctaagtaaatgggtaggagaatctgaaagacagctac 
gattgctgtttgatcaggcctatcagatgcgcccatcaattattttttttgacgaaattg 
atggtctggctccagtacggtcaagcaggcaagatcagattcacagttctattgtttcca 
ccctgctagctcttatggatggattggacagcagaggggaaattgtggtcattggtgcta 
cgaacaggctagattctatagatcctgctttacgaaggcctggtcgctttgatagagaat 
tcctctttagcctgcctgataaagaggctcgaaaagagattctaaagattcacaccaggg 
attggaatcccaaaccactggacacatttttagaagagctagcagaaaactgtgttggat 
actgtggagcagatattaaatcaatatgtgctgaagctgctttatgtgctttacgacgac 
gctacccacagatctataccactagtgagaaactgcagttggatctctcttcaattaata 
tctcagctaaggatttcgaggtagctatgcaaaagatgataccagcctcccaaagagctg 
tgacatcacctgggcaggcactgtccaccgttgtgaaaccactcctgcaaaacactgttg 
acaagattttagaagccctgcagagagtatttccacatgcagaattcagaacaaataaaa 
cattagactcagatatttcttgtcctctgctagaaagtgacttggcttacagtgatgatg 
atgttccatcagtttatgaaaatggactttctcagaaatcttctcataaggcaaaagaca 
attttaattttcttcatttgaatagaaatgcttgttaccaacctatgtcttttcgaccaa 
gaatattgatagtaggagaaccaggatttgggcaaggttctcacttggcaccagctgtca 
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Figur 1 (Cont'd) 



ttcatgctttggaaaagtttactgtatatacattagacattcctgttctttttggagtta 
gtactacatcccctgaagaaacatgtgcccaggtgattcgtgaagctaagagaacagcac 
caagtatagtgtatgttcctcatatccacgtgtggtgggaaatagttggaccgacactta 
aagccacatttaccacattattacagaatattccttcatttgctccagttttactacttg 
caacttctgacaaaccccattccgctttgccagaagaggtaatttatgtggggcggtatc 
attatactttcaaaaagagttgtttctaccacaataatcataacttattgaaaaaattca 
aatgctgtagaagtgaataaaaggataagttttcttatcaaatataccattggtaactat 
ttggtatatgtgctcattgggcaagatggtatatgtatatgtgtatgtatatattaagga 
tacataaattttttctcttgaaaaatggtattatatgccgggtgcagtggctcacgcctg 
taatcccagcactttgggaggccgaggtgggtggatcatgaggtcaggagattgagacca 
tcctggctaacagggtgaaaccctgtccctactaaaaatacaaaaaaaattagctggacg 
tggtggctggcgcctgcagtcccagctactcgggagactgaagcaggagaatggcatgaa 
ccctggaggcagagcttgcagtgagccgagtgagccactgcactccagcctgggcgacag 
agcgagactctgtctcaaaa 

SEQ ID NO: 485. >Z44462_T14 # TY Transcript I LN 2670 # Source Gene: Z44462 
# Encoded protein: Z44 4 62_P9 

aggaaagaatataggctgtactggagaatacagctgcccaagaataagaaacaattactg 
gaggatacagctgcccaaggaatagaaacaaatacaaaacaaaaacagccttaaatcaag 
tgaaaggttttatgttaaatatattatcataatctcttacgtatttattctctatctcct 
atattgcctggcatcgaaacttgactcacatgttatgtttagagttaattttgtgatttg 
atcatttgcttgaattttaggttcctgattatgtcactgtaataaagcaaccaatggacc 
tttcatctgtaatcagtaaaattgatctacacaagtatctgactgtgaaagactatttga 
gagatattgatctaatctgtagtaatgccttagaatacaatccagatagagatcctggag 
atcgtcttattaggcatagagcctgtgctttaagagatactgcctatgccataattaaag 
aagaacttgatgaagactttgagcagctctgtgaagaaattcaggaatctagaaagaaaa 
gaggttgtagctcctccaaatatgccccgtcttactaccatgtgatgccaaagcaaaatt 
ccactcttgttggtgataaaagatcagacccagagcagaatgaaaagctaaagacaccga 
gtactcctgtggcttgcagcactcctgctcagttgaagaggaaaattcgcaaaaagtcaa 
actggtacttaggcaccataaaaaagcgaaggaagatttcacaggcaaaggatgatagcc 
agaatgccatagatcacaaaattgagagtgatacagaggaaactcaagacacaagtgtag 
atcataatgagaccggaaacacaggagagtcttcggtggaagaaaatgaaaaacagcaaa 
atgcctctgaaagcaaactggaattgagaaataattcaaatacttgtaatatagagaatg 
agcttgaagactctaggaagactacagcatgtacagaattgagagacaagattgcttgta 
atggagatgcttctagctctcagataatacatatttctgatgaaaatgaaggaaaagaaa 
tgtgtgttctgcgaatgactcgagctagacgttcccaggtagaacagcagcagctcatca 
ctgttgaaaaggctttggcaattctttctcagcctacaccctcacttgttgtggatcatg 
agcgattaaaaaatcttttgaagactgttgttaaaaaaagtcaaaactacaacatatttc 
agttggaaaatttgtatgcagtaatcagccaatgtatttatcggcatcgcaaggaccatg 
ataaaacatcacttattcagaaaatggagcaagaggtagaaaacttcagttgttccagat 
gatgatgtcatggtatcgagtattctttatattcagttcctatttaagtcatttttgtca 
tgtccgcctaattgatgtagtatgaaaccctgcatctttaaggaaaagattaaaatagta 
aaataaaagtatttaaactttcctgatatttatgtacatattaagataaatgtcatgtgt 
aagataactgataaatattggaactttgctagaacaagaccctgtagtaatagtaataat 
agttgaagtttggccaactcttaataaagttattttggtaactaatgttttatggcactt 
aagaataattagcagcgttaaattttgtttgtattaagcacttttaattttatccttcct 
aaaaatagtttattgtatctgacaagaaacttacttaaccattgtgtccttcccatcttt 
tttgtcatctttgttttcttcaaatgccctcctcccatctgccttgagattcccttgtct 
tcacttaaaagccagagtgcaagtcatgatttgcgggagggctcttgaaccacttctggc 
tgcaccacaattctgtacttgagtatcacagtcattgtttttgagacaaacatttttata 
attctaatttgggttaataaagattttaaatatttcttggtttacttttgtaattatata 
cacaacaaatgtattaataactaccttgttaaacaccttttaatagcacaaggtttttat 
atttgcaagctgttgatatctttctaaaactgtttaggttatagtctattgatacttttt 
atatacaattttataaatataaatattataattttatattaatggtaccaaaaatacatt 
tcttaaggttaaaagcatgcacttccatgcatacttgcttttggggagagtggggagaag 
acattctaataatcagtttgtgaaatagcttctgttggaaaccttttgaggggaataagg 
aatggtcatctaaaatgagagattctggattttaatgcagttcaaagttgagctgtattt 
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Figur 1 (Cont'd) 



ttgttgttgatttatctggattttttttaaagccttctaaaacccagtgaattcaatacc 
ttaattagtacatactatcttatgtaatgcataaagcaatgccagtcactgagaacattt 
aaatatatttatattcctggagatacacattctcatttttgttggtttattataaattat 
tcttctagatgcatcttttataactaggatttcattttgtgtgtatagcttatgtaataa 
attttaaaggtgaaaactctcttaaaaaaa 

SEQ ID NO: 486. >Z44462JT15 # TY Transcript # LN 2141 # Source Gene: Z44462 

# Encoded protein: Z44462_P10 

cttctcacagtatgctctattcttgtttctgtattttaatcatacaaaatactgtggttt 
ctcactatttatcctttcatatatcagggtttttcactctttgaatttcagaaaatggtt 
gaatgttctttcatcatttttctttaaatttcttgaagctcagttgaagaggaaaattcg 
caaaaagtcaaactggtacttaggcaccataaaaaagcgaaggaagatttcacaggcaaa 
ggatgatagccagaatgccatagatcacaaaattgagagtgatacagaggaaactcaaga 
cacaagtgtagatcataatgagaccggaaacacaggagagtcttcggtggaagaaaatga 
aaaacagcaaaatgcctctgaaagcaaactggaattgagaaataattcaaatacttgtaa 
tatagagaatgagcttgaagactctaggaagactacagcatgtacagaattgagagacaa 
gattgcttgtaatggagatgcttctagctctcagataatacatatttctgatgaaaatga 
aggaaaagaaatgtgtgttctgcgaatgactcgagctagacgttcccaggtagaacagca 
gcagctcatcactgttgaaaaggctttggcaattctttctcagcctacaccctcacttgt 
tgtggatcatgagcgattaaaaaatcttttgaagactgttgttaaaaaaagtcaaaacta 
caacatatttcagttggaaaatttgtatgcagtaatcagccaatgtatttatcggcatcg 
caaggaccatgataaaacatcacttattcagaaaatggagcaagaggtagaaaacttcag 
ttgttccagatgatgatgtcatggtatcgagtattctttatattcagttcctatttaagt 
catttttgtcatgtccgcctaattgatgtagtatgaaaccctgcatctttaaggaaaaga 
ttaaaatagtaaaataaaagtatttaaactttcctgatatttatgtacatattaagataa 
atgtcatgtgtaagataactgataaatattggaactttgctagaacaagaccctgtagta 
atagtaataatagttgaagtttggccaactcttaataaagttattttggtaactaatgtt 
ttatggcacttaagaataattagcagcgttaaattttgtttgtattaagcacttttaatt 
ttatccttcctaaaaatagtttattgtatctgacaagaaacttacttaaccattgtgtcc 
ttcccatcttttttgtcatctttgttttcttcaaatgccctcctcccatctgccttgaga 
ttcccttgtcttcacttaaaagccagagtgcaagtcatgatttgcgggagggctcttgaa 
ccacttctggctgcaccacaattctgtacttgagtatcacagtcattgtttttgagacaa 
acatttttataattctaatttgggttaataaagattttaaatatttcttggtttactttt 
gtaattatatacacaacaaatgtattaataactaccttgttaaacaccttttaatagcac 
aaggtttttatatttgcaagctgttgatatctttctaaaactgtttaggttatagtctat 
tgatactttttatatacaattttataaatataaatattataattttatattaatggtacc 
aaaaatacatttcttaaggttaaaagcatgcacttccatgcatacttgcttttggggaga 
gtggggagaagacattctaataatcagtttgtgaaatagcttctgttggaaaccttttga 
ggggaataaggaatggtcatctaaaatgagagattctggattttaatgcagttcaaagtt 
gagctgtatttttgttgttgatttatctggattttttttaaagccttctaaaacccagtg 
aattcaataccttaattagtacatactatcttatgtaatgcataaagcaatgccagtcac 
tgagaacatttaaatatatttatattcctggagatacacattctcatttttgttggttta 
ttataaattattcttctagatgcatcttttataactaggatttcattttgtgtgtatagc 
ttatgtaataaattttaaaggtgaaaactctcttaaaaaaa 

SEQ ID NO:487. >Z44462_T16 # TY Transcript # LN 3009 # Source Gene: Z44462 

# Encoded protein: Z44 4 62_P11 

gtcaaaacctgtcaaaggccaagtgctttatttccctccagtcaaggactgggcattttg 
catttgaatcctggatggcgggaagcagcaagcaggtgtcaacaggggtggggaggagac 
gccggaccccaccagccaatcagagcgcgcagaggcctcccttcccccgccgccgtcccg 
cgccaaaattccaaacgggctacgcaccgctcccaccgtctccttccgccgagctcagcg 
agccggaactatgtgaagaaaaagaagtcagcgttttccccaggactctccgctcaggtc 
ctctatactgtcggcctgtgttcctctccacgttggcccggcctccttcgctctacaaat 
gggaggagagtttcgagcctgctgttctatggtggtggcgtgggcgggacagacggcgga 
gggattctgggcgcgcgcgctgcggccgctggcgcggagcttgtggcgccagaattcgga 
gcgcggaagagccagagctgcgagcgcctggagctggatctctctccggtcgcgcacgcc 
gaggccagtagggagagaagatggtggttctccgcagcagcttggagctgcacaaccact 
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Figur 1 (Cont'd) 



ccgcggcctcggccacgggctccttggacctgtccagtgacttcctcagtctggagcaca 
tcggccggaggcggctccgctcggccggcgcggcgcagaagaaacccgcggcgaccacag 
ccaaagcgggcgatgggtcatcagttaaggaagttgaaacctaccaccggacacgtgctt 
taagatctttgagaaaagatgcacagaattcttcagattctagttttgagaagaatgtgg 
aaataacggagcaacttgctaatggcaggcattttacaaggcagttggccagacagcagg 
ctgataaaaaaaaagaagagcacagagaagacaaagtgattccagttactcggtcattga 
gggctagaaacatcgttcaaagtacagaacacttacatgaagataatggtgatgttgaag 
tgcgtcgaagttgtaggattagaagtcgttatagtggtgtaaaccagtccatgctgtttg 
acaaacttataactaacactgctgaagctgtacttcaaaaaatggatgacatgaagaaga 
tgcgtagacagcgaatgagagaacttgaagacttgggagtgtttaatgaaacagaagaaa 
gcaatcttaatatgtacacaagaggaaaacagaaagatattcaaagaactgatgaagaaa 
caactgataatcaagaaggcagtgtggagtcatctgaagagggtgaagaccaagaacatg 
aagatgatggtgaagatgaagatgatgaagatgatgatgatgatgacgatgatgatgatg 
atgatgatgatgaagatgatgaagatgaagaagatggagaagaagagaatcagaagcgat 
attatcttagacagagaaaagctactgtttactatcaggctccattggaaaaacctcgtc 
accagagaaagcccaacatattttatagtggcccagcttctcctgcaagaccaagatacc 
gattatcttccgcaggaccaagaagtccttactgtaaacgaatgaacaggcgaaggcatg 
caatccacagtagtgactcgacttcatcttcctcctctgaagatgaacagcactttgaga 
ggcggaggaaaaggagtcgtaatagggctatcaataggtgcctcccactaaattttcgga 
aagatgaattaaaaggcatttataaagatcgaatgaaaattggagcaagccttgccgatg 
ttgatccaatgcaactagattcttcagtacgatttgatagtgttggtggcctgtctaatc 
atatagcagctctaaaagagatggtggtgtttccattactttatccagaagtctttgaaa 
aatttaaaattcaacccccaagaggttgtttgttttatgggccacctggaactggaaaga 
ctctggttgccagagcacttgccaatgagtgcagtcaaggggataaaagagtagcatttt 
tcatgaggaaaggtgctgattgtctaagtaaatgggtaggagaatctgaaagacagctac 
gattgctgtttgatcaggcctatcagatgcgcccatcaattattttttttgacgaaattg 
atggtctggctccagtacggtcaagcaggcaagatcagattcacagttctattgtttcca 
ccctgctagctcttatggatggattggacagcagaggggaaattgtggtcattggtgcta 
cgaacaggctagattctatagatcctgctttacgaaggcctggtcgctttgatagagaat 
tcctctttagcctgcctgataaagaggctcgaaaagagattctaaagattcacaccaggg 
attggaatcccaaaccactggacacatttttagaagagctagcagaaaactgtgttggat 
actgtggagcagatattaaatcaatatgtgctgaagctgctttatgtgctttacgacgac 
gctacccacagatctataccactagtgagaaactgcagttggatctctcttcaattaata 
tctcagctaaggatttcgaggtagctatgcaaaagatgataccagcctcccaaagagctg 
tgacatcacctgggcaggcactgtccaccgttgtgaaaccactcctgcaaaacactgttg 
acaagattttagaagccctgcagagagtatttccacatgcagaattcagaacaaataaaa 
cattagactcaggtataaaacttgattgaccatttaaagttagtttttataatcagtaaa 
gtgaaatgtttcatacataccagaaatatttatttgtaaaagaatctcttgaaaagaatt 
tggtaatgctttacctgggatatttgaaggaaatttactccatggattgcccttaattta 
acttattgtaagtttggggtgcatttttacatttacttattaattcatatctacacaaat 
gactagaaa 

SEQ ID NO:488. >Z44462_T17 # TY Transcript # LN 2495 # Source Gene: Z44462 
# Encoded protein: Z44 4 62_P12 

gtcaaaacctgtcaaaggccaagtgctttatttccctccagtcaaggactgggcattttg 
catttgaatcctggatggcgggaagcagcaagcaggtgtcaacaggggtggggaggagac 
gccggaccccaccagccaatcagagcgcgcagaggcctcccttcccccgccgccgtcccg 
cgccaaaattccaaacgggctacgcaccgctcccaccgtctccttccgccgagctcagcg 
agccggaactatgtgaagaaaaagaagtcagcgttttccccaggactctccgctcaggtc 
ctctatactgtcggcctgtgttcctctccacgttggcccggcctccttcgctctacaaat 
gggaggagagtttcgagcctgctgttctatggtggtggcgtgggcgggacagacggcgga 
gggattctgggcgcgcgcgctgcggccgctggcgcggagcttgtggcgccagaattcgga 
gcgcggaagagccagagctgcgagcgcctggagctggatctctctccggtcgcgcacgcc 
gaggccagtagggagagaagatggtggttctccgcagcagcttggagctgcacaaccact 
ccgcggcctcggccacgggctccttggacctgtccagtgacttcctcagtctggagcaca 
tcggccggaggcggctccgctcggccggcgcggcgcagaagaaacccgcggcgaccacag 
ccaaagcgggcgatgggtcatcagttaaggaagttgaaacctaccaccggacacgtgctt 
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taagatctttgagaaaagatgcacagaattcttcagattctagttttgagaagaatgtgg 
aaataacggagcaacttgctaatggcaggcattttacaaggcagttggccagacagcagg 
ctgataaaaaaaaagaagagcacagagaagacaaagtgattccagttactcggtcattga 
gggctagaaacatcgttcaaagtacagaacacttacatgaagataatggtgatgttgaag 
tgcgtcgaagttgtaggattagaagtcgttatagtggtgtaaaccagtccatgctgtttg 
acaaacttataactaacactgctgaagctgtacttcaaaaaatggatgacatgaagaaga 
tgcgtagacagcgaatgagagaacttgaagacttgggagtgtttaatgaaacagaagaaa 
gcaatcttaatatgtacacaagaggaaaacagaaagatattcaaagaactgatgaagaaa 
caactgataatcaagaaggcagtgtggagtcatctgaagagggtgaagaccaagaacatg 
aagatgatggtgaagatgaagatgatgaagatgatgatgatgatgacgatgatgatgatg 
atgatgatgatgaagatgatgaagatgaagaagatggagaagaagagaatcagaagcgat 
attatcttagacagagaaaagctactgtttactatcaggctccattggaaaaacctcgtc 
accagagaaagcccaacatattttatagtggcccagcttctcctgcaagaccaagatacc 
gattatcttccgcaggaccaagaagtccttactgtaaacgaatgaacaggcgaaggcatg 
caatccacagtagtgactcgacttcatcttcctcctctgaagatgaacagcactttgaga 
ggcggaggaaaaggagtcgtaatagggctatcaataggtgcctcccactaaattttcgga 
aagatgaattaaaaggcatttataaagatcgaatgaaaattggagcaagccttgccgatg 
ttgatccaatgcaactagattcttcagtacgatttgatagtgttggtggcctgtctaatc 
atatagcagctctaaaagagatggtggtgtttccattactttatccagaagtctttgaaa 
aatttaaaattcaacccccaagaggttgtttgttttatgggccacctggaactggaaaga 
ctctggttgccagagcacttgccaatgagtgcagtcaaggggataaaagagtagcatttt 
tcatgaggaaaggtgctgattgtctaagtaaatgggtaggagaatctgaaagacagctac 
gattgctgtttgatcaggcctatcagatgcgcccatcaattattttttttgacgaaattg 
atggtctggctccagtacggtcaagcaggcaagatcagattcacaggtaaaacttgcttg 
atggcacttcaacgtttcttcctatttcctgagtggtattgatgtgtgagagatagaatg 
ttaaatggtgaaagaaactttataagctagtagttcctttttttggtagtagaaccccag 
tatatagtacaaaaataaaaaatgtggatggttgcagtggctcatacctgtattcctggc 
actttgggaggccaaggcaggcagatcacttgaggtcaagagtttgagaccagcctggcc 
aacatggtgaaatcctgtctctactaaaaaaaaaa 

SEQ ID NO:489. >Z44462_T18 #• TY Transcript 8 LN 1423 « Source Gene: Z44462 
# Encoded protein: Z4 4 4 62_P13 

gtcaaaacctgtcaaaggccaagtgctttatttccctccagtcaaggactgggcattttg 
catttgaatcctggatggcgggaagcagcaagcaggtgtcaacaggggtggggaggagac 
gccggaccccaccagccaatcagagcgcgcagaggcctcccttcccccgccgccgtcccg 
cgccaaaattccaaacgggctacgcaccgctcccaccgtctccttccgccgagctcagcg 
agccggaactatgtgaagaaaaagaagtcagcgttttccccaggactctccgctcaggtc 
ctctatactgtcggcctgtgttcctctccacgttggcccggcctccttcgctctacaaat 
gggaggagagtttcgagcctgctgttctatggtggtggcgtgggcgggacagacggcgga 
gggattctgggcgcgcgcgctgcggccgctggcgcggagcttgtggcgccagaattcgga 
gcgcggaagagccagagctgcgagcgcctggagctggatctctctccggtcgcgcacgcc 
gaggccagtagggagagaagatggtggttctccgcagcagcttggagctgcacaaccact 
ccgcggcctcggccacgggctccttggacctgtccagtgacttcctcagtctggagcaca 
tcggccggaggcggctccgctcggccggcgcggcgcagaagaaacccgcggcgaccacag 
ccaaagcgggcgatgggtcatcagttaaggaagttgaaacctaccaccggacacgtgctt 
taagatctttgagaaaagatgcacagaattcttcagattctagttttgagaagaatgtgg 
aaataacggagcaacttgctaatggcaggcattttacaaggcagttggccagacagcagg 
ctgataaaaaaaaagaagagcacagagaagacaaagtgattccagttactcggtcattga 
gggctagaaacatcgttcaaagtacagaacacttacatgaagataatggtgatgttgaag 
tgcgtcgaagttgtaggattagaagtcgttatagtggtgtaaaccagtccatgctgtttg 
acaaacttataactaacactgctgaagctgtacttcaaaaaatggatgacatgaagaaga 
tgcgtagacagcgaatgagagaacttgaagacttgggagtgtttaatgaaacagaagaag 
taaatatattcttccattaagggaatgatttaatatccagtttaatttttttatagtacc 
taatccatttttcattaaatatctactttttccagcatttatttgatcagtcattagtta 
ctgctggaaatgtaagagtttatggctgtaaatatcttaattttaaacaagattggagga 
tccaacttaatttttttaataaatagattccaaatgttaatgg 
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SEQ ID NO: 490. >Z44462_T19 # TY Transcript # LN 876 ft Source Gene: 244462 

# Encoded protein: Z44462_P14 

gtcaaaacctgtcaaaggccaagtgctttatttccctccagtcaaggactgggcattttg 
catttgaatcctggatggcgggaagcagcaagcaggtgtcaacaggggtggggaggagac 
gccggaccccaccagccaatcagagcgcgcagaggcctcccttcccccgccgccgtcccg 
cgccaaaattccaaacgggctacgcaccgctcccaccgtctccttccgccgagctcagcg 
agccggaactatgtgaagaaaaagaagtcagcgttttccccaggactctccgctcaggtc 
ctctatactgtcggcctgtgttcctctccacgttggcccggcctccttcgctctacaaat 
gggaggagagtttcgagcctgctgttctatggtggtggcgtgggcgggacagacggcgga 
gggattctgggcgcgcgcgctgcggccgctggcgcggagcttgtggcgccagaattcgga 
gcgcggaagagccagagctgcgagcgcctggagctggatctctctccggtcgcgcacgcc 
gaggccagtagggagagaagatggtggttctccgcagcagcttggagctgcacaaccact 
ccgcggcctcggccacgggctccttggacctgtccagtgacttcctcagtctggagcaca 
tcggccggaggcggctccgctcggccggcgcggcgcagaagaaacccgcggcgaccacag 
ccaaagcgggcgtgagtacgggcctgaggcgggctcccagggcggttcccggggggactg 
ccggcagcccggcgctcggaggggcctgggggtgggcggcgccggtcagggtgacagttg 
tcgggggagaagtgtggtcgaggacgagtaaaatgg 

SEQ ID NO:491. >Z44462_P1 # TY Protein # CC #LN 1390 # Source Gene: 

Z44 462 # Encoding Transcript: 1 

MWLRSSLELHNHSAASATGSLDLSSDFLSLEHIGRRRLRSAGAAQKKPAATTAKAGDGS 
SVKEVETYHRTRALRSLRKDAQNSSDSSFEKNVEITEQLANGRHFTRQLARQQADKKKEE 
HREDKVIPVTRSLRARNIVQSTEHLHEDNGDVEVRRSCRIRSRYSGVNQSMLFDKLITNT 
AEAVLQKMDDMKKMRRQRMRELEDLGVFNETEESNLNMYTRGKQKDIQRTDEETTDNQEG 
SVESSEEGEDQEHEDDGEDEDDEDDDDDDDDDDDDDDEDDEDEEDGEEENQKRYYLRQRK 
ATVYYQAPLEKPRHQRKPNIFYSGPASPARPRYRLSSAGPRSPYCKRMNRRRHAIHSSDS 
TSSSSSEDEQHFERRRKRSRNRAINRCLPLNFRKDELKGIYKDRMKIGASLADVDPMQLD 
SSVRFDSVGGLSNHIAALKEMVVFPLLYPEVFEKFKIQPPRGCLFYGPPGTGKTLVARAL 
ANECSQGDKRVAFFMRKGADCLSKWVGESERQLRLLFDQAYQMRPSIIFFDEIDGLAPVR 
SSRQDQIHSSIVSTLLALMDGLDSRGEIWIGATNRLDSIDPALRRPGRFDREFLFSLPD 
KEARKEILKIHTRDWNPKPLDTFLEELAENCVGYCGADIKSICAEAALCALRRRYPQIYT 
TSEKLQLDLSSINISAKDFEVAMQKMIPASQRAVTSPGQALSTVVKPLLQNTVDKILEAL 
QRVFPHAEFRTNKTLDSDISCPLLESDLAYSDDDVPSVYENGLSQKSSHKAKDNFNFLHL 
NRNACYQPMSFRPRILIVGEPGFGQGSHLAPAVIHALEKFTVYTLDIPVLFGVSTTSPEE 
TCAQVIREAKRTAPSIVYVPHIHVWWEIVGPTLKATFTTLLQNIPSFAPVLLLATSDKPH 
SALPEEVQELFIRDYGEIFNVQLPDKEERTKFFEDLILKQAAKPPISKKKAVLQALEVLP 
VAPPPEPRSLTAEEVKRLEEQEEDTFRELRI FLRNVTHRLAI DKRFRVFTKPVDPDEVPD 
YVTVIKQPMDLSSVISKIDLHKYLTVKDYLRDIDLICSNALEYNPDRDPGDRLIRHRACA 
LRDTAYAIIKEELDEDFEQLCEEIQESRKKRGCSSSKYAPSYYHVMPKQNSTLVGDKRSD 
PEQNEKLBCT PSTPVACSTPAQLKRKI RKKSNWYLGTI KKRRKI SQAKDDSQNAI DHKI ES 
DTEETQDTSVDHNETGNTGESSVEENEKQQNASESKLELRNNSNTCNIENELEDSRKTTA 
CTELRDKIACNGDASSSQIIHISDENEGKEMCVLRMTRARRSQVEQQQLITVEKALAILS 
QPTPSLVVDHERLKNLLKTWKKSQNYNIFQLENLYAVISQCIYRHRKDHDKTSLIQKME 
QEVENFSCSR 

SEQ ID NO: 492. >Z44462_P9 # TY Protein # CC #LN 362 # Source Gene: Z44462 

# Encoding Transcript: 5 

MDLSSVISKIDLHKYLTVKDYLRDIDLICSNALEYNPDRDPGDRLIRHRACALRDTAYAI 
IKEELDEDFEQLCEEIQESRKKRGCSSSKYAPSYYHVMPKQNSTLVGDKRSDPEQNEKLK 
T PSTPVACSTPAQLKRKI RKKSNWYLGTI KKRRKI SQAKDDSQNAI DHKI ESDTEETQDT 
SVDHNETGNTGESSVEENEKQQNASESKLELRNNSNTCNIENELEDSRKTTACTELRDKI 
ACNGDASSSQIIHISDENEGKEMCVLRMTRARRSQVEQQQLITVEKALAILSQPTPSLW 
DHERLKNLLKTWKKSQNYNIFQLENLYAVISQCIYRHRKDH DKTSLIQKMEQEVENFSC 
SR 

SEQ ID NO:493. >Z44462_P2 # TY Protein # CC #LN 1380 # Source Gene: 

Z44462 8 Encoding Transcript: 6 
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MWLRSSLELHNHSAASATGSLDLSSDFLSLEHIGRRRLRSAGAAQKKPAATTAKAGDGS 
SVKEVETYHRTRALRSLRKDAQNSSDSSFEKNVEITEQLANGRHFTRQLARQQADKKKEE 
HREDKVIPVTRSLRARNIVQSTEHLHEDNGDVEVRRSCRIRSRYSGVNQSMLFDKLITNT 
AEAVLQKMDDMKKMRRQRMRELEDLGVFNETEESNLNMYTRGKQKDIQRTDEETTDNQEG 
SVESSEEGEDQEHEDDGEDEDDEDDDDDDDDDDDDDDEDDEDEEDGEEENQKRYYLRQRK 
ATVYYQAPLEKPRHQRKPNIFYSGPASPARPRYRLSSAGPRSPYCKRMNRRRHAIHSSDS 
TSSSSSEDEQHFERRRKRSRNRAINRCLPLNFRKDELKGIYKDRMKIGASLADVDPMQLD 
SSVRFDSVGGLSNHIAALKEMWFPLLYPEVFEKFKIQPPRGCLFYGPPGTGKTLVARAL 
ANECSQGDKRVAFFMRKGADCLS KWVGESERQLRLLFDQAYQMRPS 1 1 FFDE I DGLAPVR 
SSRQDQIHSSIVSTLLALMDGLDSRGEIWIGATNRLDSIDPALRRPGRFDREFLFSLPD 
KEARKE I LKIHTRDWN PKPLDTFLEELAENCVG YCGADI KS ICAEAALCALRRRY PQI YT 
T SEKLQLDLSS INI SAKDFEVAMQKMIPASQRAVTSPGQALSTWK PL LQNTVDK ILEAL 
QRVFPHAEFRTNKTLDSDISCPLLESDLAYSDDDVPSVYENGLSQKSSHKAKDNFNFLHL 
NRNACYQPMSFRPRILIVGEPGFGQGSHLAPAVIHALEKFTVYTLDIPVLFGVSTTSPEE 
TCAQVI REAKRTAPSI VY VPH I H VWWE I VGPTLKAT FTTLLQN I PS FAPVLLLATS DKPH 
SALPEEVQELFIRDYGEIFNVQLPDKEERTKFFEDLILKQAAKPPISKKKAVLQALEVLP 
VAPPPEPRSLTAEEVKRLEEQEEDTFRELRI FLRNVTHRLAI DKRFRVFTKPVDPDEVPD 
YVTVIKQPMDLSSVISKIDLHKYLTVKDYLRDIDLICSNALEYNPDRDPGDRLIRHRACA 
LRDTAYAIIKEELDEDFEQLCEEIQESRKKRGCSSSKYAPSYYHVMPKQNSTLVGDKRSD 
PEQNEKLKT PST PVACST PAQLKRKI RKKSNW YLGT I KKRRKI SQAKDDSQNAI DH KI ES 
DTEETQDTSVDHNETGNTGESSVEENEKQQNASESKLELRNNSNTCNIENELEDSRKTTA 
CTELRDKIACNGDASSSQIIHISDENEGKEMCVLRMTRARRSQVEQQQLITVEKALAILS 
QPTPSLVVDHERLKNLLKTWKKSQNYNIFQLENLYAVISQCIYRHRKDHDKTSLIQVNS 

SEQ ID NO:494. >Z44462_P3 # TY Protein # CC #LN 1372 # Source Gene: 

Z444 62 # Encoding Transcript: 7 

MWLRSSLELHNHSAASATGSLDLSSDFLSLEHIGRRRLRSAGAAQKKPAATTAKAGDGS 
SVKEVETYHRTRALRSLRKDAQNSSDSSFEKNVEITEQLANGRHFTRQLARQQADKKKEE 
HREDKVIPVTRSLRARNIVQSTEHLHEDNGDVEVRRSCRIRSRYSGVNQSMLFDKLITNT 
AEAVLQKMDDMKKMRRQRMRELEDLGVFNETEESNLNMYTRGKQKDIQRTDEETTDNQEG 
SVESSEEGEDQEHEDDGEDEDDEDDDDDDDDDDDDDDEDDEDEEDGEEENQKRYYLRQRK 
ATVYYQAPLEKPRHQRKPNIFYSGPASPARPRYRLSSAGPRSPYCKRMNRRRHAIHSSDS 
TSSSSSEDEQHFERRRKRSRNRAINRCLPLNFRKDELKGIYKDRMKIGASLADVDPMQLD 
SSVRFDSVGGLSNHIAALKEMWFPLLYPEVFEKFKIQPPRGCLFYGPPGTGKTLVARAL 
ANECSQGDKRVAFFMRKGADCLS KWVGESERQLRLLFDQAYQMRPS I I FFDE I DGLAPVR 
SSRQDQIHSSIVSTLLALMDGLDSRGEIVVIGATNRLDSIDPALRRPGRFDREFLFSLPD 
KEARKEILKIHTRDWNPKPLDTFLEELAENCVGYCGADIKSICAEAALCALRRRYPQIYT 
TSEKLQLDLSSINI SAKDFEVAMQKMI PASQRAVTSPGQALSTWKPLLQNTVDKI LEAL 
QRVFPHAEFRTNKTLDSDISCPLLESDLAYSDDDVPSVYENGLSQKSSHKAKDNFNFLHL 
NRNACYQPMSFRPRILIVGEPGFGQGSHLAPAVIHALEKFTVYTLDIPVLFGVSTTSPEE 
TCAQVI REAKRTAPSIVYVPH I HVWWEIVGPTLKATFTTLLQNI PS FAPVLLLATS DKPH 
SALPEEVQELFIRDYGEIFNVQLPDKEERTKFFEDLILKQAAKPPISKKKAVLQALEVLP 
VAPPPEPRSLTAEEVKRLEEQEEDTFRELRI FLRNVTHRLAI DKRFRVFTKPVDPDEVPD 
YVTVIKQPMDLSSVISLICSNALEYNPDRDPGDRLIRHRACALRDTAYAIIKEELDEDFE 
QLCEEIQESRKKRGCSSSKYAPSYYHVMPKQNSTLVGDKRSDPEQNEKLKTPSTPVACST 
PAQLKRKI RKKSNWYLGT I KKRRKI SQAKDDSQNAI DHKIESDTEETQDTSVDHNETGNT 
GESSVEENEKQQNASESKLELRNNSNTCNIENELEDSRKTTACTELRDKIACNGDASSSQ 
IIHISDENEGKEMCVLRMTRARRSQVEQQQLITVEKALAILSQPTPSLWDHERLKNLLK 
TVVKKSQNYNIFQLENLYAVISQCIYRHRKDHDKTSLIQKMEQEVENFSCSR 

SEQ ID NO: 495. >Z44462_P7 # TY Protein # CC #LN 1168 # Source Gene: 

Z444 62 # Encoding Transcript: 12 

MWLRSSLELHNHSAASATGSLDLSSDFLSLEHIGRRRLRSAGAAQKKPAATTAKAGDGS 
SVKEVETYHRTRALRSLRKDAQNSSDSSFEKNVEITEQLANGRHFTRQLARQQADKKKEE 
HREDKVIPVTRSLRARNIVQSTEHLHEDNGDVEVRRSCRIRSRYSGVNQSMLFDKLITNT 
AEAVLQKMDDMKKMRRQRMRELEDLGVFNETEESNLNMYTRGKQKDIQRTDEETTDNQEG 
SVESSEEGEDQEHEDDGEDEDDEDDDDDDDDDDDDDDEDDEDEEDGEEENQKRYYLRQRK 
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ATVYYQAPLEKPRHQRKPNIFYSGPASPARPRYRLSSAGPRSPYCKRMNRRRHAIHSSDS 
TSSSSSEDEQHFERRRKRSRNRAINRCLPLNFRKDELKGIYKDRMKIGASLADVDPMQLD 
SSVRFDSVGGLSNHIAALKEMWFPLLYPEVFEKFKIQPPRGCLFYGPPGTGKTLVARAL 
ANECSQGDKRVAFFMRKGADCLSKWVGESERQLRLLFDQAYQMRPSIIFFDEIDGLAPVR 
SSRQDQIHSSIVSTLLALMDGLDSRGEIWIGATNRLDSIDPALRRPGRFDREFLFSLPD 
KEARKE I L KI HT RDWN PK PL DT FLEE LAENCVG YCGAD I KS I C AEAALCALRRRY PQ I YT 
TSEKLQLDLS S I NI SAKDFEVAMQKM I PASQRAVTS PGQALST WKPLLQNTVDKI LEAL 
QRVFPHAEFRTNKTLDSDISCPLLESDLAYSDDDVPSVYENGLSQKSSHKAKDNFNFLHL 
NRNACYQPMSFRPRILIVGEPGFGQGSHLAPAVIHALEKFTVYTLDIPVLFGVSTTSPEE 
TCAQVIREAKRTAPSIVYVPHIHVWWEIVGPTLKATFTTLLQNIPSFAPVLLLATSDKPH 
SALPEEVQELFIRDYGEIFNVQLPDKEERTKFFEDLILKQAAKPPISKKKAVLQALEVLP 
VAPPPEPRSLTAEEVKRLEEQEEDTFRELRIFLRNVTHRLAIDKRFRVFTKPVDPDEVPD 
YVTVIKQPMDLSSVISKIDLHKYLTVKDYLRDIDLICSNALEYNPDRDPGDRLIRHRACA 
LRDTAYAIIKEELDEDFEQLCEEIQESRKKRGCSSSKYAPSYYHVMPKQNSTLVGDKRSD 
PEQNEKLKTPSTPVACSTPGKYSSS FHL 

SEQ ID NO:496. >Z44462_P8 # TY Protein # CC #LN 939 # Source Gene: Z44462 
# Encoding Transcript: 13 

MWLRSSLELHNHSAASATGSLDLSSDFLSLEHIGRRRLRSAGAAQKKPAATTAKAGDGS 
SVKEVETYHRTRALRSLRKDAQNSSDSSFEKNVEITEQLANGRHFTRQLARQQADKKKEE 
HREDKVIPVTRSLRARNIVQSTEHLHEDNGDVEVRRSCRIRSRYSGVNQSMLFDKLITNT 
AEAVLQKMDDMKKMRRQRMRELEDLGVFNETEESNLNMYTRGKQKDIQRTDEETTDNQEG 
SVESSEEGEDQEHEDDGEDEDDEDDDDDDDDDDDDDDEDDEDEEDGEEENQKRYYLRQRK 
ATVYYQAPLEKPRHQRKPNIFYSGPASPARPRYRLSSAGPRSPYCKRMNRRRHAIHSSDS 
TSSSSSEDEQHFERRRKRSRNRAINRCLPLNFRKDELKGIYKDRMKIGASLADVDPMQLD 
SSVRFDSVGGLSNHIAALKEMWFPLLYPEVFEKFKIQPPRGCLFYGPPGTGKTLVARAL 
ANECSQGDKRVAFFMRKGADCLSKWVGESERQLRLLFDQAYQMRPS 1 1 FFDEI DGLAPVR 
SSRQDQIHSSIVSTLLALMDGLDSRGEIVVIGATNRLDSIDPALRRPGRFDREFLFSLPD 
KEARKEILKIHTRDWNPKPLDTFLEELAENCVGYCGADIKSICAEAALCALRRRYPQIYT 
TSEKLQLDLSS I NI SAKDFEVAMQKMI PASQRAVTS PGQALST WKPLLQNTVDKI LEAL 
QRVFPHAEFRTNKTLDSDISCPLLESDLAYSDDDVPSVYENGLSQKSSHKAKDNFNFLHL 
NRNACYQPMSFRPRILIVGEPGFGQGSHLAPAVIHALEKFTVYTLDIPVLFGVSTTSPEE 
TCAQVIREAKRTAPSIVYVPHIHVWWEIVGPTLKATFTTLLQNIPSFAPVLLLATSDKPH 
SALPEEVIYVGRYHYTFKKSCFYHNNHNLLKKFKCCRSE 

SEQ ID NO:497. >Z44462_P10 # TY Protein # CC #LN 283 # Source Gene: 

Z44462 # Encoding Transcript: 15 

FSQYALFLFLYFNHTKYCGFSLFILSYIRVFHSLNFRKWLNVLSSFFFKFLEAQLKRKIR 
KKSNWYLGTIKKRRKISQAKDDSQNAIDHKIESDTEETQDTSVDHNETGNTGESSVEENE 
KQQNASESKLELRNNSNTCN I ENELEDS RKTTACTELRDKI ACNGDASSSQI I H I S DENE 
GKEMCVLRMTRARRSQVEQQQLITVEKALAILSQPTPSLWDHERLKNLLKTWKKSQNY 
NIFQLENLYAVISQCIYRHRKDHDKTSLIQKMEQEVENFSCSR 

SEQ ID NO:498. >Z44462_P11 # TY Protein # CC #LN 742 # Source Gene: 

Z44462 # Encoding Transcript: 16 

MWLRSSLELHNHSAASATGSLDLSSDFLSLEHIGRRRLRSAGAAQKKPAATTAKAGDGS 
SVKEVETYHRTRALRSLRKDAQNSSDSSFEKNVEITEQLANGRHFTRQLARQQADKKKEE 
HREDKVIPVTRSLRARNIVQSTEHLHEDNGDVEVRRSCRIRSRYSGVNQSMLFDKLITNT 
AEAVLQKMDDMKKMRRQRMRELEDLGVFNETEESNLNMYTRGKQKDIQRTDEETTDNQEG 
SVESSEEGEDQEHEDDGEDEDDEDDDDDDDDDDDDDDEDDEDEEDGEEENQKRYYLRQRK 
ATVYYQAPLEKPRHQRKPNIFYSGPASPARPRYRLSSAGPRSPYCKRMNRRRHAIHSSDS 
TSSSSSEDEQHFERRRKRSRNRAINRCLPLNFRKDELKGIYKDRMKIGASLADVDPMQLD 
SSVRFDSVGGLSNHIAALKEMWFPLLYPEVFEKFKIQPPRGCLFYGPPGTGKTLVARAL 
ANECSQGDKRVAFFMRKGADCLSKWVGESERQLRLLFDQAYQMRPSI I FFDEI DGLAPVR 
SSRQDQIHSSIVSTLLALMDGLDSRGEIWIGATNRLDSIDPALRRPGRFDREFLFSLPD 
KEARKEILKIHTRDWNPKPLDTFLEELAENCVGYCGADIKSICAEAALCALRRRYPQIYT 
TSEKLQLDLSS INI SAKDFEVAMQKMI PASQRAVTS PGQALST WKPLLQNTVDKILEAL 
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Figur 1 (Cont'd) 



QRVFPHAEFRTNKTLDSGIKLD 



SEQ ID NO: 499. >Z44462_P12 # TY Protein # CC #LN 549 S Source Gene: 

Z44462 # Encoding Transcript: 17 

MWLRSSLELHNHSAASATGSLDLSSDFLSLEHIGRRRLRSAGMQKKPAATTAKAGDGS 
SVKEVETYHRTRALRSLRKDAQNSSDSSFEKNVEITEQLANGRHFTRQLARQQADKKKEE 
HREDKVIPVTRSLRARNIVQSTEHLHEDNGDVEVRRSCRIRSRYSGVNQSMLFDKLITNT 
AEAVLQKMDDMKKMRRQRMRELEDLGVFNETEESNLNMYTRGKQKDIQRTDEETTDNQEG 
SVESSEEGEDQEHEDDGEDEDDEDDDDDDDDDDDDDDEDDEDEEDGEEENQKRYYLRQRK 
ATVYYQAPLEKPRHQRKPNIFYSGPAS PARPRYRLSSAGPRSPYCKRMNRRRHAIHSSDS 
TSSSSSEDEQHFERRRKRSRNRAINRCLPLNFRKDELKGIYKDRMKIGASLADVDPMQLD 
SSVRFDSVGGLSNHIAALKEMWFPLLYPEVFEKFKIQPPRGCLFYGPPGTGKTLVARAL 
ANECSQGDKRVAFFMRKGADCLS KWVGESERQLRLLFDQAYQMRPS 1 1 FFDEI DGLAPVR 
SSRQDQIHR 



SEQ ID NO: 500. >Z444 62_P13 # TY Protein # CC #LN 219 # Source Gene: 

Z44462 # Encoding Transcript: 18 

MVVLRSSLELHNHSAASATGSLDLSSDFLSLEHIGRRRLRSAGAAQKKPAATTAKAGDGS 
SVKEVETYHRTRALRSLRKDAQNSSDSSFEKNVEITEQLANGRHFTRQLARQQADKKKEE 
HREDKVIPVTRSLRARNIVQSTEHLHEDNGDVEVRRSCRIRSRYSGVNQSMLFDKLITNT 
AEAVLQKMDDMKKMRRQRMRELEDLGVFNETEEVNIFFH 



SEQ ID NO: 501. >Z44462_P14 # TY Protein # CC #LN 243 # Source Gene: 

Z44462 # Encoding Transcript: 19 

QNLSKAKCFISLQSRTGHFAFESWMAGSSKQVSTGVGRRRRTPPANQSAQRPPFPRRRPA 
PKFQTGYAPLPPSPSAELSEPELCEEKEVSVFPRTLRSGPLYCRPVFLSTLARPPSLYKW 
EESFEPAVLGGGVGGTDGGGILGARAAAAGAELVAPEFGARKSQSCERLELDLSPVAHAE 
ASRERRWWFSAAAWSCTTTPRPRPRAPWTCPVTSSVWSTSAGGGSARPARRRRNPRRPQP 
KRA 
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